shauncassini

Into AI, puzzles and chaos theory

The University of SheffieldSheffield, UK

shauncassini's Stars

m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python12.5k 139 7161.3k
google-deepmind/graphcast
Language:Python4.6k 80 92588
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.6k 55 115471
mjpost/sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
Language:Python1.1k 19 157164
RaivoKoot/Video-Dataset-Loading-Pytorch
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Language:Python449 5 1243
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
254 27 225
mlco2/impact
ML has an impact on the climate. But not all models are born equal. Compute your model's emissions with our calculator and add the results to your paper with our generated latex template
Language:HTML208 6 1737
umer-sheikh/bird-whisperer
[InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Classification" accepted in InterSpeech 2024 conference.
Language:Python30 2 03
just-ai/speechflow
Language:Python15 4 03
facebookresearch/emphassess
This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses paper (de Seyssel et al., 2023).
Language:Python13 4 21
nicolamendini/thinkythreads
Networks of Notes. Also available for Android on Google Play Store
Language:JavaScript1

shauncassini

shauncassini's Stars

m-bain/whisperX

google-deepmind/graphcast

huggingface/parler-tts

mjpost/sacrebleu

RaivoKoot/Video-Dataset-Loading-Pytorch

kahne/SpeechTransProgress

mlco2/impact

umer-sheikh/bird-whisperer

just-ai/speechflow

facebookresearch/emphassess

nicolamendini/thinkythreads