vd001

Pinned Repositories

hand-to-diffusion
Official PyTorch Implementation of "Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation"
25 7 10
DiT4Edit
25 7 20
anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Language:Python704 11 4536
MoLE
An official pytorch implementation of "MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts"
Language:Python26 3 40
Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
0 0 00
convnet
convnet
Language:HTML4 1 01
mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
Language:Python0 0 00
vd001.github.io
0 1 00
mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language:Python2.4k 30 233177

vd001's Repositories

vd001/convnet
convnet
Language:HTML4 1 01
vd001/Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
0 0 00
vd001/mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
Language:Python0 0 00
vd001/vd001.github.io
0 1 00