mobled37

Multimodal AI Lab

Hanyang UniversitySeoul

Pinned Repositories

algorithm
Language:Python0 1 00
Anim-Director
The codes of Siggraph Asia 2024 paper "Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation"
Language:Python00
animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
Language:Python00
AnimateDiff
Official implementation of AnimateDiff.
Language:Python0 0 00
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python0 0 00
Auffusion
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Language:Jupyter Notebook0 0 00
ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Language:Python0 0 00
cvml_omnipose
Language:Python1 1 00
publications
Kwanyoung Lee paper publications
0 1 00
utils
Deeplearning utils for multimodal research
Language:Python0 1 00

mobled37's Repositories

mobled37/cvml_omnipose
Language:Python1 1 00
mobled37/algorithm
Language:Python0 1 00
mobled37/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
Language:Python00
mobled37/AnimateDiff
Official implementation of AnimateDiff.
Language:Python0 0 00
mobled37/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python0 0 00
mobled37/Auffusion
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Language:Jupyter Notebook0 0 00
mobled37/ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Language:Python0 0 00
mobled37/ControlNet
Let us control diffusion models!
Language:Python0 0 00
mobled37/CuAP
Language:Python0 1 00
mobled37/publications
Kwanyoung Lee paper publications
0 1 00
mobled37/utils
Deeplearning utils for multimodal research
Language:Python0 1 00
mobled37/CycleNet
Official Code for NeurIPS 2023 Paper: CycleNet: Rethinking Cycle Consistent in Text‑Guided Diffusion for Image Manipulation
mobled37/deep-learning-from-scratch
Language:Jupyter Notebook1 0
mobled37/Deep-Learning-Notes
Studying
1 0
mobled37/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python0 0
mobled37/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion
mobled37/GlueGen
Language:Python
mobled37/grok-1
Grok open release
mobled37/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python0 0
mobled37/InfEdit
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
Language:Python0 0
mobled37/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook0 0
mobled37/LipVoicer
Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"
Language:Python0 0
mobled37/MM-Diffusion
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Language:Python0 0
mobled37/SonicDiffusion
Language:Jupyter Notebook0 0
mobled37/StableVITON
Language:Python0 0
mobled37/StreamingAvatar
Language:JavaScript0 0
mobled37/StreamingAvatarSDK
Streaming Avatar SDK
Language:TypeScript0 0
mobled37/talc
mobled37/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python0 0