JosephZZ

JosephZZ's Stars

zhenzhiwang/HumanVid
Language:Python2033
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
Language:Python47169
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python50.6k5.3k
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Language:Python85539
LTH14/mage
A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
Language:Python50726
cientgu/VQ-Diffusion
Language:Python43243
lcysyzxdxc/MISC
Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model
Language:Jupyter Notebook18
Q-Future/CMC-Bench
[LMM + codec] A new paradigm of visual signal compression!
Language:Python25
scenarios/WeMM
Language:Python8211
nicklashansen/puppeteer
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
Language:Python1407
bytedance/GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
Language:Python953
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Language:Python2.3k107
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Language:Python49519
valeoai/Maskgit-pytorch
Language:Jupyter Notebook14515
Kwai-Kolors/Kolors
Kolors Team
Language:Python3.5k225
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5k321
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4k302
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.4k192
LTH14/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
Language:Python78938
ytongbai/LVM
Language:Python1.7k54
openai/transformer-debugger
Language:Python4k231
wilson1yan/VideoGPT
Language:Jupyter Notebook962117
songweige/TATS
Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)
Language:Python26317
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
Language:Python93842
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2k164
sail-sg/lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Language:Python57235
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.4k157
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook1.4k224
project-baize/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
Language:Python3.2k280
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.4k4.5k