junleen's Stars
2noise/ChatTTS
A generative speech model for daily dialogue.
s0md3v/roop
one-click face swap
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
hustvl/4DGaussians
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
open-mmlab/mmhuman3d
OpenMMLab 3D Human Parametric Model Toolbox and Benchmark
hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
IDEA-Research/X-Pose
[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"
lisiyao21/Bailando
Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"
FacePerceiver/facer
Face analysis tools for modern research, equipped with state-of-the-art Face Parsing and Face Alignment
IDEA-Research/HumanTOMATO
[ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation
IDEA-Research/MotionLLM
[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos
RenderMe-360/RenderMe-360
RenderMe-360: Large Digital Asset Library and Benchmark Towards High-fidelity Head Avatars
kepengxu/PGTFormer
[IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer
GAP-LAB-CUHK-SZ/MVHumanNet
cvlab-kaist/MoDiTalker
oneThousand1000/Portrait3D
(SIGGRAPH 2024) Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior
tobias-kirschstein/diffusion-avatars
haoz19/MagicPose4D
Code for MagicPose4D: Crafting Articulated Models with Appearance and Motion Control
ZhengdiYu/SignAvatars
SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
HG-ha/SenseVoice-Api
阿里SenseVoice的fastpi封装,采用onnx发布,附带量化模型,支持GPU。支持从URL文件进行语音识别。
junleen/ViCoFace