ShoufaChen

Ph.D. student, The University of Hong Kong

The University of Hong KongHong Kong

Pinned Repositories

LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.9k 21 7985
goku
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Language:Python2.9k 145 0312
grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
Language:Jupyter Notebook416 4 1022
AdaptFormer
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
Language:Python368 7 3521
Awesome-Diffusion-Transformers
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
Language:HTML137 6 18
clone-anonymous4open
clone/download codes from https://anonymous.4open.science/
Language:Python33 2 58
CycleMLP
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
Language:Python290 3 1727
DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Language:Python2.2k 17 115172
WOO
[ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"
44 14 70

ShoufaChen's Repositories

ShoufaChen/DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Language:Python2.2k 17 115172
ShoufaChen/AdaptFormer
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
Language:Python368 7 3521
ShoufaChen/CycleMLP
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
Language:Python290 3 1727
ShoufaChen/Awesome-Diffusion-Transformers
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
Language:HTML137 6 18
ShoufaChen/WOO
[ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"
44 14 70
ShoufaChen/clone-anonymous4open
clone/download codes from https://anonymous.4open.science/
Language:Python33 2 58
ShoufaChen/gradio-box
Language:Python18 2 21
ShoufaChen/Grounded-Segment-Anything-patch
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs
Language:Jupyter Notebook13 1 02
ShoufaChen/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python4 1 00
ShoufaChen/COMP3340_Transformer_MLP
Language:Python3 3 42
ShoufaChen/accelerate-patch
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python1 1 0
ShoufaChen/Awesome-Anything-patch
AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask
1 1 00
ShoufaChen/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
1 1 0
ShoufaChen/diffusers-dev
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python1 1 0
ShoufaChen/pytorch-grad-cam
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Examples for classification, object detection, segmentation, embedding networks and more. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
Language:Python1 2 00
ShoufaChen/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Language:Python1 2 00
ShoufaChen/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python1 0
ShoufaChen/Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
1 0
ShoufaChen/COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
1 05
ShoufaChen/detr-patch
End-to-End Object Detection with Transformers
Language:Python1 0
ShoufaChen/DiffDock-patch
Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
Language:Python1 0
ShoufaChen/gpu-burn
Multi-GPU CUDA stress test
Language:C++1 0
ShoufaChen/jekyll
:globe_with_meridians: Jekyll is a blog-aware static site generator in Ruby
Language:Ruby1 0
ShoufaChen/langchain-patch
⚡ Building applications with LLMs through composability ⚡
Language:Python1 0
ShoufaChen/lqae
Language Quantized AutoEncoders
Language:Python1 0
ShoufaChen/mdetr-1
Language:Python1 0
ShoufaChen/minisora-patch
The Mini Sora project aims to explore the implementation path and future development direction of Sora.
Language:Python1 0
ShoufaChen/mmcv
OpenMMLab Computer Vision Foundation
Language:Python2 0
ShoufaChen/torchdrug
A powerful and flexible machine learning platform for drug discovery
Language:Python1 0
ShoufaChen/waymo-open-dataset
Waymo Open Dataset
Language:C++2 0