Pinned Repositories
algorithm
Anim-Director
The codes of Siggraph Asia 2024 paper "Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation"
animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
AnimateDiff
Official implementation of AnimateDiff.
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Auffusion
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
cvml_omnipose
publications
Kwanyoung Lee paper publications
utils
Deeplearning utils for multimodal research
mobled37's Repositories
mobled37/cvml_omnipose
mobled37/algorithm
mobled37/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
mobled37/AnimateDiff
Official implementation of AnimateDiff.
mobled37/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
mobled37/Auffusion
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
mobled37/ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
mobled37/ControlNet
Let us control diffusion models!
mobled37/CuAP
mobled37/publications
Kwanyoung Lee paper publications
mobled37/utils
Deeplearning utils for multimodal research
mobled37/CycleNet
Official Code for NeurIPS 2023 Paper: CycleNet: Rethinking Cycle Consistent in Text‑Guided Diffusion for Image Manipulation
mobled37/deep-learning-from-scratch
mobled37/Deep-Learning-Notes
Studying
mobled37/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
mobled37/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion
mobled37/GlueGen
mobled37/grok-1
Grok open release
mobled37/ImageBind
ImageBind One Embedding Space to Bind Them All
mobled37/InfEdit
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
mobled37/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
mobled37/LipVoicer
Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"
mobled37/MM-Diffusion
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
mobled37/SonicDiffusion
mobled37/StableVITON
mobled37/StreamingAvatar
mobled37/StreamingAvatarSDK
Streaming Avatar SDK
mobled37/talc
mobled37/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch