haloha123

Pinned Repositories

ACSSR
Language:Python0 0 00
bark
🔊 Text-Prompted Generative Audio Model
Language:Python0 0 00
faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python1 0 00
LLRT_whisper
A not very efficient attempt to create a real time openai/whisper (Audio to Text Transcriber)
Language:Python00
Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with
Language:Python0 0 00
Paper-Reading
📖 Paper reading list in dialogue systems and natural language generation (constantly updating 🤗).
0 0 00
PhySO
Physical Symbolic Optimization
Language:Python00
torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
Language:Python0 0 00
whisper_real_time
Real time transcription with OpenAI Whisper.
Language:Python0 0 00

haloha123's Repositories

haloha123/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python1 0 00
haloha123/ACSSR
Language:Python0 0 00
haloha123/bark
🔊 Text-Prompted Generative Audio Model
Language:Python0 0 00
haloha123/LLRT_whisper
A not very efficient attempt to create a real time openai/whisper (Audio to Text Transcriber)
Language:Python00
haloha123/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with
Language:Python0 0 00
haloha123/Paper-Reading
📖 Paper reading list in dialogue systems and natural language generation (constantly updating 🤗).
0 0 00
haloha123/PhySO
Physical Symbolic Optimization
Language:Python00
haloha123/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
Language:Python0 0 00
haloha123/whisper_real_time
Real time transcription with OpenAI Whisper.
Language:Python0 0 00