TinyZeaMays's Stars
unity-research/IP-Adapter-Instruct
IP Adapter Instruct
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
agentic-learning-ai-lab/procreate-diffusion
Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"
black-forest-labs/flux
Official inference repo for FLUX.1 models
pytorch/tensordict
TensorDict is a pytorch dedicated tensor container.
LituRout/RB-Modulation
Reference-Based Modulation (RB-Modulation)
mrflogs/LoRA-Pro
Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "
karpathy/LLM101n
LLM101n: Let's build a Storyteller
evilsocket/cake
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
SeonmiP/KineTy
Official Code for "Kinetic Typography Diffusion Model (ECCV 2024)"
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
lucidrains/e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
hehonghui/awesome-english-ebooks
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
antgroup/echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Fangkang515/CE3D
Official Implementation of ECCV2024 paper: Chat Edit 3D: Interactive 3D Scene Editing via Text Prompts
Kwai-Kolors/Kolors
Kolors Team
Zulko/moviepy
Video editing with Python
mayuelala/FollowYourEmoji
[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
OPPO-Mente-Lab/GlyphDraw2
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
muzishen/RCDMs
[AAAI 2025] Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong semantic and temporal consistency, integrating rich contextual conditions and enabling one-pass inference for enhanced coherence.
theEricMa/ScaleDreamer
This is the official repository for ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation [ECCV2024]
evahuman/EVA
KwaiVGI/LivePortrait
Bring portraits to life!
Character-Adapter/Character-Adapter
sjtuplayer/SuperSVG
[CVPR 2024] SuperSVG: Superpixel-based Scalable Vector Graphics Synthesis
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
snap-research/4Real
Towards Photorealistic 4D Scene Generation via Video Diffusion Models
VITA-Group/4DGen
"4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei
maswang32/hearinganythinganywhere
Hearing Anything Anywhere Code Release
czg1225/AsyncDiff
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising