GaryGao99's Stars
google-research/lottery-ticket-hypothesis
A reimplementation of "The Lottery Ticket Hypothesis" (Frankle and Carbin) on MNIST.
ARM-software/keyword-transformer
Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769
sovrasov/flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
Ephrem-ETH/E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
janson9192/autokws2021
phracker/MacOSX-SDKs
A collection of those pesky SDK folders: MacOSX10.1.5.sdk thru MacOSX11.3.sdk
DushyantaDhyani/kdtf
Knowledge Distillation using Tensorflow
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
miracleyoo/pytorch-lightning-template
An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much easier using this template, and keep your freedom to edit all the functions as well. Big-project-friendly as well. No need to rewrite your config in hydra.
Lightning-AI/pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
teddykoker/tinyloader
castorini/honk
PyTorch implementations of neural network models for keyword spotting
OAID/Tengine
Tengine is a lite, high performance, modular inference engine for embedded device
Tencent/TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
Kitt-AI/snowboy
Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy
cupy/cupy
NumPy & SciPy for GPU
tts-tutorial/survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
roy-ht/editdistance
Fast implementation of the edit distance(Levenshtein distance)
SpeechColab/Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
lowerquality/gentle
gentle forced aligner
feelins/Praat_Scripts
Some basic praat scripts.
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
philipperemy/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
AdolfVonKleist/Phonetisaurus
Phonetisaurus G2P
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
tmquan/RefineGAN