coolbunnyx's Stars
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
hako-mikan/sd-webui-regional-prompter
set prompt to divided region
jbilcke-hf/ai-comic-factory
Generate comic panels using a LLM + SDXL. Powered by Hugging Face 🤗
KovenYu/WonderJourney
zideliu/StyleDrop-PyTorch
Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)
weixi-feng/LayoutGPT
Official repo for LayoutGPT
FuxiaoLiu/LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
AILab-CVC/TaleCrafter
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
HL-hanlin/VideoDirectorGPT
official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning (COLM 2024)
microsoft/ReCo
ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023
YangLing0818/RealCompo
[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
SooLab/Free-Bloom
[NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator