maobenz

NUS ECE phd student

Beijing China

maobenz's Stars

showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Language:Python4.3k 51 97386
thu-ml/prolificdreamer
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
Language:Python1.5k 110 2745
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python1.1k 15 4746
lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Language:Python1k 10 3374
Tangshitao/MVDiffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion, NeurIPS 2023 (spotlight)
Language:Python510 22 5227
showlab/Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
276 16 011
showlab/sparseformer
(ICLR 2024, CVPR 2024) SparseFormer
Language:Python65 9 32
showlab/ShowRoom3D
This is the project page of ShowRoom3D
25 4 32