Pinned Repositories
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
flash-attention
Fast and memory-efficient exact attention
KAN-TTS
mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
merlin
This is now the official location of the Merlin project.
ML-NLP
此项目是机器学习(Machine Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
emmacirl's Repositories
emmacirl/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
emmacirl/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
emmacirl/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
emmacirl/flash-attention
Fast and memory-efficient exact attention
emmacirl/KAN-TTS
emmacirl/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
emmacirl/merlin
This is now the official location of the Merlin project.
emmacirl/ML-NLP
此项目是机器学习(Machine Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
emmacirl/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
emmacirl/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
emmacirl/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
emmacirl/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model
emmacirl/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
emmacirl/tensorflow
Computation using data flow graphs for scalable machine learning
emmacirl/TensorFlow-Book
Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
emmacirl/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
emmacirl/Transformer-TTS
TTS model based on Transformer.
emmacirl/tutorials
机器学习相关教程
emmacirl/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
emmacirl/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
emmacirl/whisper
Robust Speech Recognition via Large-Scale Weak Supervision