hblflybird's Stars
AFeng-x/PixWizard
cswry/OSEDiff
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
nerfstudio-project/gsplat
CUDA accelerated rasterization of gaussian splatting
ybkurt/VIFT
mingukkang/elatentlpips
Author's Implementation for E-LatentLPIPS
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
StevenBaby/doudizhu
KupynOrest/head_detector
Official repo for VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads.
XLabs-AI/x-flux
black-forest-labs/flux
Official inference repo for FLUX.1 models
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
johndpope/IMF
Implicit Motion Function - (unofficial) Microsoft recreation
krennic999/STAR
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
fusiming3/MARS
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
roboflow/supervision
We write your reusable computer vision tools. 💜
Vchitect/LaVie
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
G-U-N/Motion-I2V
[SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
gojasper/flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
fal-ai/aura-sr
AuraSR: GAN-based Super-Resolution for real-world
lucidrains/gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
zh460045050/VQGAN-LC
ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
AvLab-CV/PASL
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation