ZhaoZeqing's Stars
google/styleguide
Style guides for Google-originated open-source projects
google/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
WeNeedHome/SummaryOfLoanSuspension
全国各省市停贷通知汇总
yeemachine/kalidokit
Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.
xiangyuecn/Recorder
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
kwai/DouZero
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
baidu-research/warp-ctc
Fast parallel CTC.
clovaai/stargan-v2
StarGAN v2 - Official PyTorch Implementation (CVPR 2020)
onnx/tutorials
Tutorials for creating and using ONNX models
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
FACEGOOD/FACEGOOD-Audio2Face
http://www.facegood.cc
microsoft/NeuralSpeech
TimoBolkart/voca
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
soubhiksanyal/RingNet
Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision
EvelynFan/FaceFormer
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
cnlinxi/book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
JimWest/MeFaMo
TimoBolkart/TF_FLAME
Tensorflow framework for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 2D or 3D keypoints, and how to generate textured head meshes from Images.
yl4579/StyleTTS
Official Implementation of StyleTTS
facebookresearch/CPC_audio
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
mindslab-ai/assem-vc
Official Code for Assem-VC @ICASSP2022
tencent-ailab/bddm
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
kahnchana/opl
Official repository for "Orthogonal Projection Loss" (ICCV'21)
rishikksh20/gmvae_tacotron
Gaussian Mixture VAE Tacotron
benject/mirror
Mirror : a maya facial capture animation toolkit based on mediapipe