Pinned Repositories
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
AnimateDiff
Official implementation of AnimateDiff.
AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
AnyV2V
A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
BrushNet
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
ColossalAI
Making large AI models cheaper, faster and more accessible
ConsistentID
Customized ID Consistent for human
TongHengcheng's Repositories
TongHengcheng/AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
TongHengcheng/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
TongHengcheng/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
TongHengcheng/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
TongHengcheng/ColossalAI
Making large AI models cheaper, faster and more accessible
TongHengcheng/ConsistentID
Customized ID Consistent for human
TongHengcheng/ControlNet_Plus_Plus
Inference code for: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
TongHengcheng/Ctrl-Adapter
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
TongHengcheng/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
TongHengcheng/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
TongHengcheng/Dough
Dough is a open source tool for steering AI animations with precision.
TongHengcheng/facefusion
Next generation face swapper and enhancer
TongHengcheng/InstanceDiffusion
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
TongHengcheng/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
TongHengcheng/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
TongHengcheng/Lumina-T2X
Lumina-T2X is a model for Text to Any Modality Generation
TongHengcheng/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
TongHengcheng/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level MLLM on Your Phone
TongHengcheng/MiniGemini
Official implementation for Mini-Gemini
TongHengcheng/Monkey
【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
TongHengcheng/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
TongHengcheng/omniglue
Code release for CVPR'24 submission 'OmniGlue'
TongHengcheng/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
TongHengcheng/PowerPaint
TongHengcheng/PuLID
Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
TongHengcheng/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
TongHengcheng/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
TongHengcheng/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
TongHengcheng/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
TongHengcheng/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information