yztongzhan

Researcher at Ant Research.

KU LeuvenLeuven, Belgium

yztongzhan's Stars

facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook45.2k 299 6505.3k
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.3k 1k 1853.4k
meta-llama/codellama
Inference code for CodeLlama models
Language:Python15.4k 176 1901.8k
NVIDIA/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Language:C14.2k 172 3181.2k
qiuyu96/CoDeF
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Language:Python4.8k 74 79389
OpenGVLab/InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Language:Python3.2k 43 49229
TigerResearch/TigerBot
TigerBot: A multi-language multi-task LLM
Language:Python2.2k 31 125194
ShoufaChen/DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Language:Python2k 17 112155
fenglinglwb/MAT
MAT: Mask-Aware Transformer for Large Hole Image Inpainting
Language:Python717 11 11379
OpenGVLab/VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Language:Python441 6 5143
tinatiansjz/hmr-survey
[TPAMI 2023] Recovering 3D Human Mesh from Monocular Images: A Survey
333 15 511
ShoufaChen/AdaptFormer
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
Language:Python305 7 3517
MCG-NJU/SparseBEV
[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
Language:Python303 9 7720
chaytonmin/Occupancy-MAE
Official implementation of our TIV'23 paper: Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders
Language:Python241 7 3020
implus/UM-MAE
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
Language:Jupyter Notebook234 5 2220
fenglinglwb/EDT
On Efficient Transformer-Based Image Pre-training for Low-Level Vision
Language:Python127 14 118
MCG-NJU/SportsMOT
[ICCV 2023] SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes
Language:Python122 5 154
AILab-CVC/GroupMixFormer
GroupMixAttention and GroupMixFormer
Language:Python107 9 411
zhaoyue-zephyrus/AVION
Code release for "Training a Large Video Model on a Single Machine in a Day"
Language:Python99 1 104
zhaoyue-zephyrus/TeSTra
Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
Language:Python94 2 117
ChongjianGE/MetaBEV
MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation
86 6 66
showlab/sparseformer
(ICLR 2024, CVPR 2024) SparseFormer
Language:Python61 9 31
MCG-NJU/DDM
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Language:Python47 2 103
MCG-NJU/STMixer
[CVPR 2023] STMixer: A One-Stage Sparse Action Detector
Language:Python47 1 44
MCG-NJU/VideoMAE-Action-Detection
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Language:Python46 2 63
MCG-NJU/EVAD
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
Language:Python20 2 43
leexinhao/ZeroI2V
Official implementation of "ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video"
Language:Python13 3 70
ChongjianGE/SNCLR
[ICLR 2023] Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning
Language:Python11 2 21
sebgao/chatgpt_mini_helper
My customized GPT 3.5 helper
Language:Python7 2 01
yztongzhan/VideoMAE
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language:Python1 1 00

yztongzhan

yztongzhan's Stars

facebookresearch/segment-anything

XingangPan/DragGAN

meta-llama/codellama

NVIDIA/open-gpu-kernel-modules

qiuyu96/CoDeF

OpenGVLab/InternGPT

TigerResearch/TigerBot

ShoufaChen/DiffusionDet

fenglinglwb/MAT

OpenGVLab/VideoMAEv2

tinatiansjz/hmr-survey

ShoufaChen/AdaptFormer

MCG-NJU/SparseBEV

chaytonmin/Occupancy-MAE

implus/UM-MAE

fenglinglwb/EDT

MCG-NJU/SportsMOT

AILab-CVC/GroupMixFormer

zhaoyue-zephyrus/AVION

zhaoyue-zephyrus/TeSTra

ChongjianGE/MetaBEV

showlab/sparseformer

MCG-NJU/DDM

MCG-NJU/STMixer

MCG-NJU/VideoMAE-Action-Detection

MCG-NJU/EVAD

leexinhao/ZeroI2V

ChongjianGE/SNCLR

sebgao/chatgpt_mini_helper

yztongzhan/VideoMAE