Pinned Repositories
phonemizer
Simple text to phones converter for multiple languages
voxceleb_trainer
In defence of metric learning for speaker recognition
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
python-audio-separator
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Speech-Editing-Toolkit
It's a repository for implementations of neural speech editing algorithms.
shreeshailgan's Repositories
shreeshailgan doesn’t have any repository yet.