hblflybird

hblflybird's Stars

AFeng-x/PixWizard
74
cswry/OSEDiff
Language:Python1509
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
Language:Python1.5k116
nerfstudio-project/gsplat
CUDA accelerated rasterization of gaussian splatting
Language:Cuda2k248
ybkurt/VIFT
7
mingukkang/elatentlpips
Author's Implementation for E-LatentLPIPS
Language:Python621
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Language:Python82741
StevenBaby/doudizhu
Language:Python31
KupynOrest/head_detector
Official repo for VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads.
Language:Python1356
XLabs-AI/x-flux
Language:Python1.4k102
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python14.5k1k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python7.9k732
johndpope/IMF
Implicit Motion Function - (unofficial) Microsoft recreation
Language:Python91
krennic999/STAR
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
1131
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.3k1k
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
Language:Python2.6k166
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Language:Python41723
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Language:Python3.7k584
fusiming3/MARS
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
812
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python22.9k1.7k
Vchitect/LaVie
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
Language:Python85260
G-U-N/Motion-I2V
[SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
Language:Python918
gojasper/flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Language:Python45733
fal-ai/aura-sr
AuraSR: GAN-based Super-Resolution for real-world
Language:Python39531
lucidrains/gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
Language:Python1.8k104
zh460045050/VQGAN-LC
Language:Python946
ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language:Python1.6k190
AvLab-CV/PASL
Language:Python91
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python10.2k1k
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Language:Python5.2k436