cbstyle's Stars
360CVGroup/Bridge_Diffusion_Model
Latent diffusion method for non-English language native Text-to-Image generation
XLabs-AI/x-flux
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
SonyResearch/micro_diffusion
Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.
feizc/DiT-MoE
Scaling Diffusion Transformers with Mixture of Experts
InsaneLife/ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
sayakpaul/cmmd-pytorch
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
TIGER-AI-Lab/Mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
baofff/U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Anima-Lab/MaskDiT
Code for Fast Training of Diffusion Models with Masked Transformers
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
NVlabs/stylegan
StyleGAN - Official TensorFlow Implementation
NVlabs/stylegan3
Official PyTorch implementation of StyleGAN3
amusi/ECCV2024-Papers-with-Code
ECCV 2024 论文和开源项目合集,同时欢迎各位大佬提交issue,分享ECCV 2024论文和开源项目
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
black-forest-labs/flux
Official inference repo for FLUX.1 models
mira-space/MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
frank-xwang/InstanceDiffusion
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
hustvl/DiG
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Kwai-Kolors/Kolors
Kolors Team
alipay/Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
rafaelpadilla/Object-Detection-Metrics
Most popular metrics used to evaluate object detection algorithms.
silent-chen/layout-guidance
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance