JosephZZ's Stars
zhenzhiwang/HumanVid
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
LTH14/mage
A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
cientgu/VQ-Diffusion
lcysyzxdxc/MISC
Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model
Q-Future/CMC-Bench
[LMM + codec] A new paradigm of visual signal compression!
scenarios/WeMM
nicklashansen/puppeteer
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
bytedance/GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
valeoai/Maskgit-pytorch
Kwai-Kolors/Kolors
Kolors Team
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
FoundationVision/VAR
[GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
LTH14/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
ytongbai/LVM
openai/transformer-debugger
wilson1yan/VideoGPT
songweige/TATS
Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
sail-sg/lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
project-baize/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.