Pinned Repositories
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
angfederi.comments
asr-server
FastCGI support for Kaldi ASR
avalanche
Avalanche: a End-to-End Library for Continual Learning.
bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Dialogical-Emotion-Decoding
[ICASSP20] A Dialogical Emotion Decoder For Speech Emotion Recognition in Spoken Dialog
docker-mirakurun-epgstation
flipside_ph
A Kaldi-based set of experimental recipes for Filipino ASR
feddybear's Repositories
feddybear/flipside_ph
A Kaldi-based set of experimental recipes for Filipino ASR
feddybear/docker-mirakurun-epgstation
feddybear/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
feddybear/angfederi.comments
feddybear/avalanche
Avalanche: a End-to-End Library for Continual Learning.
feddybear/bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
feddybear/cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
feddybear/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
feddybear/Dialogical-Emotion-Decoding
[ICASSP20] A Dialogical Emotion Decoder For Speech Emotion Recognition in Spoken Dialog
feddybear/dynitag
Collaborative audio annotation tool
feddybear/espnet
End-to-End Speech Processing Toolkit
feddybear/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
feddybear/flutter_sherpa_onnx
Flutter plugin wrapping the Sherpa-ONNX runtime
feddybear/grpc-web
gRPC for Web Clients
feddybear/jovo-framework
🔈 The Open Source Voice Layer: Build Voice Experiences for Alexa, Google Assistant, Samsung Bixby, Web Apps, and much more
feddybear/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
feddybear/label-studio-frontend
Data labeling react app that is backend agnostic and can be embedded into your applications — distributed as an NPM package
feddybear/lightweight_spkr_anon
Lightweight speaker anonymization [IEEE SLT2021]
feddybear/mysite
My site.
feddybear/ParallelWaveGAN
Unofficial Parallel WaveGAN implementation with Pytorch
feddybear/pychain
PyTorch implementation of LF-MMI for End-to-end ASR
feddybear/returnn
The RWTH extensible training framework for universal recurrent neural networks
feddybear/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
feddybear/sew
feddybear/speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
feddybear/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
feddybear/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
feddybear/tongrams
The C++ library implementing the compressed data structures described in the paper "Efficient Data Structures for Massive N-Gram Datasets", by Giulio Ermanno Pibiri and Rossano Venturini, published in ACM SIGIR 2017.
feddybear/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
feddybear/wavesurfer.js
Navigable waveform built on Web Audio and Canvas