ChrisLiu6

ChrisLiu6's Stars

black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python15.5k 136 1471.1k
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.5k 103 360866
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.1k 74 5361k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.9k 52 593459
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook5.8k 76 2201.1k
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4.2k 115 81309
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Language:Python2.5k 43 387154
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
1.8k 53 1490
apple/ml-4m
4M: Massively Multimodal Masked Modeling
Language:Python1.6k 33 2194
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.3k 22 6054
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Language:Python957 18 6350
ckkelvinchan/RealBasicVSR
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
Language:Python915 13 87135
Mukosame/Zooming-Slow-Mo-CVPR-2020
Fast and Accurate One-Stage Space-Time Video Super-Resolution (accepted in CVPR 2020)
Language:Python915 31 69164
GAIR-NLP/anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Language:Python667 10 4436
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Language:Python492 6 2920
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Language:Python445 19 2824
AILab-CVC/SEED-X
Multimodal Models in Real World
Language:Jupyter Notebook396 19 2616
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
Language:Python388 7 1136
Meituan-AutoML/VisionLLaMA
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
Language:Python364 23 610
mira-space/MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
Language:Python363 14 159
sail-sg/zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
Language:Python277 7 2615
Picsart-AI-Research/VideoINR-Continuous-Space-Time-Super-Resolution
[CVPR 2022] VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
Language:Python273 5 1927
NJU-PCALab/OpenVid-1M
Language:Python186 3 144
valeoai/Maskgit-pytorch
unofficial MaskGIT reproduction in PyTorch
Language:Jupyter Notebook161 6 1815
gladia-research-group/multi-source-diffusion-models
Language:Python150 12 1112
danier97/LDMVFI
[AAAI'2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull
Language:Python138 6 2514
XiaolongTang23/HPNet
[CVPR 2024] HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention
Language:Python131 2 1917
liuggchen/wechatDatDecode
微信dat文件解码，Windows系统下载exe文件可直接使用。
Language:Go100 3 220
PhyscalX/gradio-image-prompter
Image Prompter for Gradio
Language:JavaScript72 2 712
LiuDongyang6/METR
A Simple Romance Between Multi-Exit Vision Transformer and Token Reduction (ICLR 2024)
Language:Python1 2 00