Pinned Repositories
AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
AIchessgame
AudioLDM2_TTS
AuxiliaryASR
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
Deploying-RAG-on-Kubernetes-with-Jenkins-for-Legal-Document-Retrieval
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Flask-Web
Train_Hifigan_XTTS
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
xtts-streaming-server
tuanh123789's Repositories
tuanh123789/AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
tuanh123789/Train_Hifigan_XTTS
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
tuanh123789/AudioLDM2_TTS
tuanh123789/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
tuanh123789/Flask-Web
tuanh123789/xtts-streaming-server
tuanh123789/AIchessgame
tuanh123789/AuxiliaryASR
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
tuanh123789/Deploying-RAG-on-Kubernetes-with-Jenkins-for-Legal-Document-Retrieval
tuanh123789/face
tuanh123789/FastSpeech2_S
tuanh123789/Hey_siri_wuw
tuanh123789/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
tuanh123789/F5-TTS-inference
tuanh123789/f5-tts-trtllm
tuanh123789/high-frequency-vocabulary
30,000 most common English words with Chinese dictionary explanations in order of frequency.
tuanh123789/PitchExtractor
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
tuanh123789/PL-BERT
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
tuanh123789/segmentation
tuanh123789/split_text
tuanh123789/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
tuanh123789/Textclassifier
tuanh123789/train_ngram_model
tuanh123789/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
tuanh123789/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
tuanh123789/tuanh123789.github.io
tuanh123789/vietnamese_text_normalize
tuanh123789/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
tuanh123789/xtts-cpp-python
Python bindings for xtts.cpp using ggml-python
tuanh123789/xtts-finetune-webui
Slightly improved official version for finetune xtts