kahne's Stars
algorithm-visualizer/algorithm-visualizer
:fireworks:Interactive Online Platform that Visualizes Algorithms from Code
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
VowpalWabbit/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
jason718/awesome-self-supervised-learning
A curated list of awesome self-supervised methods
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
syhw/wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
plasticityai/magnitude
A fast, efficient universal vector embedding utility package.
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
lucasjinreal/alfred
alfred-py: A deep learning utility library for **human**, more detail about the usage of lib to: https://zhuanlan.zhihu.com/p/341446046
dmort27/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
facebookresearch/voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
Unbabel/COMET
A Neural Framework for MT Evaluation
AdolfVonKleist/Phonetisaurus
Phonetisaurus G2P
as-ideas/DeepPhonemizer
Grapheme to phoneme conversion with deep learning.
facebookresearch/covost
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
kakaobrain/g2pm
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
facebookresearch/DME
Dynamic Meta-Embeddings for Improved Sentence Representations
kahne/NonAutoregGenProgress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
bytedance/neurst
Neural end-to-end Speech Translation Toolkit
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
kahne/fastwer
A PyPI package for fast word/character error rate (WER/CER) calculation
kamperh/eskmeans
Embedded segmental K-means (ES-KMeans) in Python.