Pinned Repositories
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
awesome-speech-recognition-speech-synthesis-papers
automatic speech recognition paper roadmap, including HMM, DNN, RNN, CNN, Seq2Seq, Attention
CommonCorrections
Easily fix common corrections in speech!
freecyclescraper
Tells you by voice the free stuff near you from FREECYCLE as soon as it arrives (so you can be first)
kDS2iOS
KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
pyaudioconvert
Simple utility to convert audio from one form to another (via sox)
SimplePythonWER
Simple WER calculation in python that's pip installable without dependencies
SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
robmsmt's Repositories
robmsmt/KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
robmsmt/ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
robmsmt/SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
robmsmt/kDS2iOS
robmsmt/CommonCorrections
Easily fix common corrections in speech!
robmsmt/pyaudioconvert
Simple utility to convert audio from one form to another (via sox)
robmsmt/ReflectLog
robmsmt/SimplePythonWER
Simple WER calculation in python that's pip installable without dependencies
robmsmt/atlas
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
robmsmt/audio
robmsmt/cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
robmsmt/DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
robmsmt/DeepSpeech-1
A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.
robmsmt/docker-datasci-cpu
Generic ubuntu based Docker image using CPU libs for DS & ML
robmsmt/English-to-IPA
Converts English text to IPA notation
robmsmt/float-toy
Use this to build intuition for the IEEE floating-point format
robmsmt/gantts
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
robmsmt/glove-gensim
Converting GloVe vectors into word2vec format for easy usage with Gensim
robmsmt/kafka-py27-base-docker
robmsmt/KerasResNetAPI
robmsmt/llm.c
LLM training in simple, raw C/CUDA
robmsmt/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
robmsmt/orbit
Dumb planets
robmsmt/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
robmsmt/python-ftfy
Fixes mojibake and other glitches in Unicode text, after the fact.
robmsmt/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
robmsmt/robmsmt.github.io
robmsmt/spaCy
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
robmsmt/Squeezeformer
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
robmsmt/zsh-utils
A minimal, opinionated set of ZSH plugins designed to be small, simple, and focused.