Pinned Repositories
CT-Net
[ICLR2021] official implementation of CT-Net
Nightcrawler
Top-2 Solution for CVPR UG2+ Track2
SaoImage
A web app about image style transfer
seg-for-fun
Top-1 Solution for CCF BDCI Seg
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
UniFormer
[ICLR2022] official implementation of UniFormer
Andy1621's Repositories
Andy1621/CT-Net
[ICLR2021] official implementation of CT-Net
Andy1621/Nightcrawler
Top-2 Solution for CVPR UG2+ Track2
Andy1621/CS231n-Notes
Notes and resources for CS231n
Andy1621/CrossFormer
The official code for the paper: https://arxiv.org/pdf/2108.00154.pdf
Andy1621/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc.
Andy1621/Kinetics-TPS-evaluation
fork from xiadingZ/Kinetics-TPS-evaluation
Andy1621/Andy1621
Andy1621/Andy1621.github.io
Andy1621/anuraghazra
Andy1621/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
Andy1621/catalyst
Accelerated deep learning R&D
Andy1621/ConvNeXt
Code release for ConvNeXt model
Andy1621/CrossViT
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
Andy1621/deit
Official DeiT repository
Andy1621/grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
Andy1621/Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion - Detect , Segment and Generate Anything with Text Inputs
Andy1621/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Andy1621/MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Andy1621/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Andy1621/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Andy1621/PaddleViT
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Andy1621/save_file
Andy1621/sd-webui-segment-anything
Segment Anything for Stable Diffusion Webui
Andy1621/SKNet-PyTorch
Nearly Perfect & Easily Understandable PyTorch Implementation of SKNet
Andy1621/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Andy1621/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Andy1621/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Andy1621/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Andy1621/VideoMAE
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Andy1621/visual-chatgpt
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models