Adrien987k

PhD Student at Meta AI & Inria

MetaParis

Adrien987k's Stars

facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.4k600
stack-of-tasks/pinocchio
A fast and flexible implementation of Rigid Body Dynamics algorithms and their analytical derivatives
Language:C++1.8k383
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python2.9k207
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.7k245
OpenGVLab/VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
Language:Python79660
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
Language:Python2.8k354
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
Language:Python2.6k251
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.4k285
antoyang/VidChapters
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
Language:Jupyter Notebook17519
google-deepmind/perception_test
Language:Jupyter Notebook18515
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Language:Python2.2k177
tatp22/multidim-positional-encoding
An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow
Language:Python53535
SwinTransformer/Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
Language:Python1.4k199
facebookresearch/active_indexing
Official implementation of "Active Image Indexing"
Language:Jupyter Notebook585
xvjiarui/VFS
Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)
Language:Python14411
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.6k2.5k
lightly-ai/lightly
A python library for self-supervised learning on images.
Language:Python3.1k258
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python38.7k5k
valeoai/sfrik
Official code for "Self-supervised learning with rotation-invariant kernels"
Language:Python12
XuelianCheng/LEAStereo
Hierarchical Neural Architecture Searchfor Deep Stereo Matching (NeurIPS 2020)
Language:Python25651
antoyang/FrozenBiLM
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Language:Python15323
facebookresearch/VICRegL
VICRegL official code base
Language:Python22324
facebookresearch/msn
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
Language:Python44634
zhyever/Monocular-Depth-Estimation-Toolbox
Monocular Depth Estimation Toolbox based on MMSegmentation.
Language:Python903104
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook67.8k10.1k
lliuz/ARFlow
The official PyTorch implementation of the paper "Learning by Analogy: Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation".
Language:Python25150
princeton-vl/RAFT
Language:Python3.2k628
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language:Python1.3k134
bytedance/ibot
iBOT :robot:: Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
Language:Jupyter Notebook67177
facebookresearch/metaseq
Repo for external large-scale work
Language:Python6.5k724

Adrien987k

Adrien987k's Stars

facebookresearch/xformers

stack-of-tasks/pinocchio

PKU-YuanGroup/Video-LLaVA

DAMO-NLP-SG/Video-LLaMA

OpenGVLab/VideoMamba

facebookresearch/ijepa

facebookresearch/jepa

state-spaces/s4

antoyang/VidChapters

google-deepmind/perception_test

webdataset/webdataset

tatp22/multidim-positional-encoding

SwinTransformer/Video-Swin-Transformer

facebookresearch/active_indexing

xvjiarui/VFS

microsoft/unilm

lightly-ai/lightly

Stability-AI/stablediffusion

valeoai/sfrik

XuelianCheng/LEAStereo

antoyang/FrozenBiLM

facebookresearch/VICRegL

facebookresearch/msn

zhyever/Monocular-Depth-Estimation-Toolbox

CompVis/stable-diffusion

lliuz/ARFlow

princeton-vl/RAFT

MCG-NJU/VideoMAE

bytedance/ibot

facebookresearch/metaseq