hifi-gan
There are 30 repositories under hifi-gan topic.
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
ivcylc/qa-mdt
OpenMusic: SOTA Text-to-music (TTM) Generation
keonlee9420/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
NTT123/vietTTS
Vietnamese Text to Speech library
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
rishikksh20/Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
nipponjo/tts-arabic-pytorch
TTS models for Arabic (Tacotron2, FastPitch)
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
keonlee9420/Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
hwRG/End-to-End-TTS-Fine-Tune
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
lucadellalib/discrete-wavlm-codec
A neural speech codec based on discrete WavLM representations
NTT123/hifigan-tpu
Train HiFi-GAN on TPU
jik876/hifi-gan-demo
Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"
manhph2211/ViTTS
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system :smile: In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...
ssmlkl/MnTTS2
This is the experimental description of MnTTS2.
nipponjo/tts-german-pytorch
TTS (FastPitch) for German
34j/neural-source-filter
Python package for NSF and NSF-HiFi-GAN (unofficial)
PeechApp/tts-peech
DelightfulTTS with Hifi-GAN and Univnet vocoders
mehdihosseinimoghadam/Catalan-Text-to-Speech
Catalan Text to Speech
watchstep/glow-tts-jejueo
제주어 음성 합성 (보완 중)
blaisewf/HiFi-SAN
HiFi-SAN: Slicing Adversarial Networks for Efficient and High Fidelity Speech Synthesis
khaykingleb/HiFi-GAN
Vocoder for TTS
lordzuko/SpeakingStyle
Aligning latent space of speaking style with human perception using a re-embedding strategy
andrew264/AudioExpts
Doing devious stuff with audio
claire-1125/POSCO_Academy_AI_Project
포스코 청년 AI·Big Data 아카데미 - AI 프로젝트
hwRG/HiFi-GAN-Pytorch
If you have a wav & transcript, can train HiFi-GAN right now.
RALYHDB/ASV-spoofing
This repository contains the code and resources associated with my Bachelor's Thesis. The project evaluates the performance of various automatic speaker verification (ASV) systems against identity spoofing attacks generated using text-to-speech (TTS) synthesis technologies.
free001style/HiFiGAN
HiFiGAN Implementation