cbstyle

cbstyle's Stars

360CVGroup/Bridge_Diffusion_Model
Latent diffusion method for non-English language native Text-to-Image generation
Language:Python31
XLabs-AI/x-flux
Language:Python1.5k110
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
Language:TeX37720
SonyResearch/micro_diffusion
Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.
1521
feizc/DiT-MoE
Scaling Diffusion Transformers with Mixture of Experts
Language:Python1968
InsaneLife/ChineseNLPCorpus
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
Language:Python4.3k784
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.7k175
sayakpaul/cmmd-pytorch
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
Language:Python935
TIGER-AI-Lab/Mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
Language:Python17114
baofff/U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
Language:Jupyter Notebook90660
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Language:Python48219
Anima-Lab/MaskDiT
Code for Fast Training of Diffusion Models with Masked Transformers
Language:Python36013
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.2k94
NVlabs/stylegan
StyleGAN - Official TensorFlow Implementation
Language:Python14.1k3.2k
NVlabs/stylegan3
Official PyTorch implementation of StyleGAN3
Language:Python6.4k1.1k
amusi/ECCV2024-Papers-with-Code
ECCV 2024 论文和开源项目合集，同时欢迎各位大佬提交issue，分享ECCV 2024论文和开源项目
1.9k255
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
Language:Python63872
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python15k1.1k
mira-space/MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
Language:Python3589
frank-xwang/InstanceDiffusion
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
Language:Python49125
hustvl/DiG
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Language:Python1133
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python2k86
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python3.4k289
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Language:Python1.1k56
Kwai-Kolors/Kolors
Kolors Team
Language:Python3.7k248
alipay/Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
Language:Python1155
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Language:Python1.6k79
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
1.7k74
rafaelpadilla/Object-Detection-Metrics
Most popular metrics used to evaluate object detection algorithms.
Language:Python5k1k
silent-chen/layout-guidance
[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance
Language:Python23212