Pinned Repositories
LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
goku
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
AdaptFormer
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
Awesome-Diffusion-Transformers
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
clone-anonymous4open
clone/download codes from https://anonymous.4open.science/
CycleMLP
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
WOO
[ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"
ShoufaChen's Repositories
ShoufaChen/DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
ShoufaChen/AdaptFormer
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
ShoufaChen/CycleMLP
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
ShoufaChen/Awesome-Diffusion-Transformers
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
ShoufaChen/WOO
[ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"
ShoufaChen/clone-anonymous4open
clone/download codes from https://anonymous.4open.science/
ShoufaChen/gradio-box
ShoufaChen/Grounded-Segment-Anything-patch
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs
ShoufaChen/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
ShoufaChen/COMP3340_Transformer_MLP
ShoufaChen/accelerate-patch
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
ShoufaChen/Awesome-Anything-patch
AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask
ShoufaChen/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
ShoufaChen/diffusers-dev
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
ShoufaChen/pytorch-grad-cam
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Examples for classification, object detection, segmentation, embedding networks and more. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
ShoufaChen/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
ShoufaChen/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
ShoufaChen/Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
ShoufaChen/COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
ShoufaChen/detr-patch
End-to-End Object Detection with Transformers
ShoufaChen/DiffDock-patch
Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
ShoufaChen/gpu-burn
Multi-GPU CUDA stress test
ShoufaChen/jekyll
:globe_with_meridians: Jekyll is a blog-aware static site generator in Ruby
ShoufaChen/langchain-patch
⚡ Building applications with LLMs through composability ⚡
ShoufaChen/lqae
Language Quantized AutoEncoders
ShoufaChen/mdetr-1
ShoufaChen/minisora-patch
The Mini Sora project aims to explore the implementation path and future development direction of Sora.
ShoufaChen/mmcv
OpenMMLab Computer Vision Foundation
ShoufaChen/torchdrug
A powerful and flexible machine learning platform for drug discovery
ShoufaChen/waymo-open-dataset
Waymo Open Dataset