Pinned Repositories
hand-to-diffusion
Official PyTorch Implementation of "Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation"
DiT4Edit
anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
MoLE
An official pytorch implementation of "MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts"
Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
convnet
convnet
mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
vd001.github.io
mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
vd001's Repositories
vd001/convnet
convnet
vd001/Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
vd001/mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
vd001/vd001.github.io