wsr692's Stars
wsr692/doks
Hugo theme helping you build modern documentation websites.
dmlc/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
ioannist/dynamodb-faiss-builder
Lambda function that creates a Faiss index for a DynamoDB table
cptcrunchy/sound-board
facebookresearch/textlesslib
Library for Textless Spoken Language Processing
dobby-seo/kosr
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
ramonsanabria/speech-reading-list
A speech recognition research reading list maintained by Ramon Sanabria
jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
kamranahmedse/design-patterns-for-humans
An ultra-simplified explanation to design patterns
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
ratsgo/speechbook
articles about speech recognition
espnet/espnet
End-to-End Speech Processing Toolkit
katspaugh/wavesurfer.js
Audio waveform player
DeepSE/deeplearning-models
A collection of various deep learning architectures, models, and tips
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
auspicious3000/autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
hoaaoh/Audio2Vec
Audio2Vec with multi lingual