Pinned Repositories
2th_practice
practice more , study more
3th_practice
act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
AI-generated-characters
AI-generated-character
Audio2Head
code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021
clash_for_windows_pkg
A Windows/macOS/Linux GUI based on Clash
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
DH_live
每个人都能用的数字人
dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
EDTalk
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
sangsongzhen's Repositories
sangsongzhen/2th_practice
practice more , study more
sangsongzhen/3th_practice
sangsongzhen/act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
sangsongzhen/AI-generated-characters
AI-generated-character
sangsongzhen/Audio2Head
code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021
sangsongzhen/clash_for_windows_pkg
A Windows/macOS/Linux GUI based on Clash
sangsongzhen/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
sangsongzhen/DH_live
每个人都能用的数字人
sangsongzhen/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
sangsongzhen/EDTalk
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
sangsongzhen/egl_probe
A helpful module for listing available GPUs for EGL rendering.
sangsongzhen/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
sangsongzhen/emotional-vits
无需情感标注的情感可控语音合成模型,基于VITS
sangsongzhen/FACEGOOD-Audio2Face
http://www.facegood.cc
sangsongzhen/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
sangsongzhen/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
sangsongzhen/line-solver
Queueing Theory Algorithms for Python, Java, and MATLAB
sangsongzhen/LiveSpeechPortraits
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
sangsongzhen/mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
sangsongzhen/practice
sangsongzhen/QueueingTheorySimulation-Matlab-Supermarket
这是一个使用Matlab对超市排队系统进行模拟仿真项目
sangsongzhen/robomimic
robomimic: A Modular Framework for Robot Learning from Demonstration
sangsongzhen/sangsongzhen
Config files for my GitHub profile.
sangsongzhen/Speech-Emotion-Recognition
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
sangsongzhen/SpeechEnhancement
基于深度学习的语音增强工具(Speech Enhancement Tools Based on Deep Learning)
sangsongzhen/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference