Pinned Repositories
ACSSR
bark
🔊 Text-Prompted Generative Audio Model
faster-whisper
Faster Whisper transcription with CTranslate2
LLRT_whisper
A not very efficient attempt to create a real time openai/whisper (Audio to Text Transcriber)
Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with
Paper-Reading
📖 Paper reading list in dialogue systems and natural language generation (constantly updating 🤗).
PhySO
Physical Symbolic Optimization
torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
whisper_real_time
Real time transcription with OpenAI Whisper.
haloha123's Repositories
haloha123/faster-whisper
Faster Whisper transcription with CTranslate2
haloha123/ACSSR
haloha123/bark
🔊 Text-Prompted Generative Audio Model
haloha123/LLRT_whisper
A not very efficient attempt to create a real time openai/whisper (Audio to Text Transcriber)
haloha123/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with
haloha123/Paper-Reading
📖 Paper reading list in dialogue systems and natural language generation (constantly updating 🤗).
haloha123/PhySO
Physical Symbolic Optimization
haloha123/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
haloha123/whisper_real_time
Real time transcription with OpenAI Whisper.