XiongyiCai's Stars
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Troivyn/HazeCLIP
henry123-boy/SpaTracker
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
jyrao/MatchTime
[EMNLP 2024 Oral] MatchTime: Towards Automatic Soccer Game Commentary Generation
mx-mark/VideoTransformer-pytorch
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
princeton-vl/RAFT
haoningwu3639/StoryGen
[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
google-deepmind/tapnet
Tracking Any Point (TAP)
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
pytorch/vision
Datasets, Transforms and Models specific to Computer Vision
dougsouza/pytorch-sync-batchnorm-example
How to use Cross Replica / Synchronized Batchnorm in Pytorch
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
facebookresearch/detr
End-to-End Object Detection with Transformers
o0o0o0o0o0o0o/image-processing-from-scratch
This project contains some interesting image processing algorithms that were wrote in python and c++ from scratch.
facebookresearch/MeMViT
Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
aharley/pips
Particle Video Revisited
megvii-research/SOLQ
"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.
UMJI-Advising-Center/Workshops
Archives for experience-sharing workshops held by Advising Center.