shauncassini's Stars
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
google-deepmind/graphcast
huggingface/parler-tts
Inference and training library for high-quality TTS models.
mjpost/sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
RaivoKoot/Video-Dataset-Loading-Pytorch
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
mlco2/impact
ML has an impact on the climate. But not all models are born equal. Compute your model's emissions with our calculator and add the results to your paper with our generated latex template
umer-sheikh/bird-whisperer
[InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Classification" accepted in InterSpeech 2024 conference.
just-ai/speechflow
facebookresearch/emphassess
This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses paper (de Seyssel et al., 2023).
nicolamendini/thinkythreads
Networks of Notes. Also available for Android on Google Play Store