TongHengcheng

Aire

Pinned Repositories

AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
Language:C++0 0 00
AnimateDiff
Official implementation of AnimateDiff.
Language:Python00
AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
Language:Python0 0 00
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Language:Python0 0 00
AnyV2V
A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Language:Jupyter Notebook00
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
00
BrushNet
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language:Python00
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python00
ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python00
ConsistentID
Customized ID Consistent for human
Language:Python00

TongHengcheng's Repositories

TongHengcheng/AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
Language:Python0 0 00
TongHengcheng/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Language:Python0 0 00
TongHengcheng/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
00
TongHengcheng/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python00
TongHengcheng/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python00
TongHengcheng/ConsistentID
Customized ID Consistent for human
Language:Python00
TongHengcheng/ControlNet_Plus_Plus
Inference code for: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Language:Python00
TongHengcheng/Ctrl-Adapter
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Language:Python00
TongHengcheng/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python0 0 00
TongHengcheng/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Language:Jupyter Notebook0 0 00
TongHengcheng/Dough
Dough is a open source tool for steering AI animations with precision.
Language:Python0 0
TongHengcheng/facefusion
Next generation face swapper and enhancer
Language:Python0 0
TongHengcheng/InstanceDiffusion
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
TongHengcheng/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
TongHengcheng/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
TongHengcheng/Lumina-T2X
Lumina-T2X is a model for Text to Any Modality Generation
TongHengcheng/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
TongHengcheng/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level MLLM on Your Phone
Language:Python
TongHengcheng/MiniGemini
Official implementation for Mini-Gemini
TongHengcheng/Monkey
【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python
TongHengcheng/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
TongHengcheng/omniglue
Code release for CVPR'24 submission 'OmniGlue'
TongHengcheng/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
TongHengcheng/PowerPaint
TongHengcheng/PuLID
Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
TongHengcheng/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
TongHengcheng/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
TongHengcheng/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
TongHengcheng/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
TongHengcheng/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Language:Python