feddybear

Nothing much to see here.

Tokyo, Japan

Pinned Repositories

academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript0 1 00
angfederi.comments
0 1 00
asr-server
FastCGI support for Kaldi ASR
Language:C++0 1 00
avalanche
Avalanche: a End-to-End Library for Continual Learning.
Language:Python0 1 00
bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Language:Shell0 1 00
coolgpus_kyawa2fork
GPU fan control for headless Linux
Language:Python00
cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
Language:Python0 1 00
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python0 0 00
docker-mirakurun-epgstation
Language:TypeScript1 1 00
flipside_ph
A Kaldi-based set of experimental recipes for Filipino ASR
Language:Jupyter Notebook7 4 31

feddybear's Repositories

feddybear/flipside_ph
A Kaldi-based set of experimental recipes for Filipino ASR
Language:Jupyter Notebook7 4 31
feddybear/docker-mirakurun-epgstation
Language:TypeScript1 1 00
feddybear/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript0 1 00
feddybear/angfederi.comments
0 1 00
feddybear/avalanche
Avalanche: a End-to-End Library for Continual Learning.
Language:Python0 1 00
feddybear/bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Language:Shell0 1 00
feddybear/coolgpus_kyawa2fork
GPU fan control for headless Linux
Language:Python00
feddybear/cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
Language:Python0 1 00
feddybear/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python0 0 00
feddybear/Dialogical-Emotion-Decoding
[ICASSP20] A Dialogical Emotion Decoder For Speech Emotion Recognition in Spoken Dialog
Language:Python0 1 00
feddybear/dynitag
Collaborative audio annotation tool
Language:JavaScript0 1 00
feddybear/espnet
End-to-End Speech Processing Toolkit
Language:Python0 0 00
feddybear/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Language:Python1 0
feddybear/flutter_sherpa_onnx
Flutter plugin wrapping the Sherpa-ONNX runtime
Language:Dart0 0
feddybear/grpc-web
gRPC for Web Clients
Language:C++1 0
feddybear/jovo-framework
🔈 The Open Source Voice Layer: Build Voice Experiences for Alexa, Google Assistant, Samsung Bixby, Web Apps, and much more
Language:TypeScript1 0
feddybear/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Language:Python1 0
feddybear/label-studio-frontend
Data labeling react app that is backend agnostic and can be embedded into your applications — distributed as an NPM package
Language:JavaScript1 0
feddybear/lightweight_spkr_anon
Lightweight speaker anonymization [IEEE SLT2021]
Language:Python1 0
feddybear/mysite
My site.
Language:HTML2 0
feddybear/ParallelWaveGAN
Unofficial Parallel WaveGAN implementation with Pytorch
Language:Python1 0
feddybear/pychain
PyTorch implementation of LF-MMI for End-to-end ASR
Language:C++1 0
feddybear/returnn
The RWTH extensible training framework for universal recurrent neural networks
Language:Python1 0
feddybear/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
feddybear/sew
Language:Python1 0
feddybear/speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Language:JavaScript1 0
feddybear/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
Language:Python1 0
feddybear/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Language:Python1 0
feddybear/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Language:C++1 0
feddybear/wavesurfer.js
Navigable waveform built on Web Audio and Canvas
Language:JavaScript1 0