iwaterxt's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
google-research/bert
TensorFlow code and pre-trained models for BERT
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
acheong08/ChatGPT
Reverse engineered ChatGPT API
mli/paper-reading
深度学习经典、新论文逐段精读
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
YannickJadoul/Parselmouth
Praat in Python, the Pythonic way
fbcotter/pytorch_wavelets
Pytorch implementation of 2D Discrete Wavelet (DWT) and Dual Tree Complex Wavelet Transforms (DTCWT) and a DTCWT based ScatterNet
k2-fsa/icefall
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
chenkui164/FastASR
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
SpeechColab/Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
zhusleep/pytorch_chinese_lm_pretrain
pytorch中文语言模型预训练
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
lingjzhu/charsiu
Charsiu: A neural phonetic aligner.
GeWu-Lab/OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
sequitur-g2p/sequitur-g2p
This is a github repository of the abandonware Sequitur G2P by Bisani & Ney
mlyg/unified-focal-loss
Daisyqk/Automatic-Prosody-Annotation
JoungheeKim/Non-Attentive-Tacotron
This is Pytorch Implementation of Google's Non-attentive Tacotron.
google-research-datasets/WikipediaHomographData
Labeled data for homograph disambiguation
choiHkk/pitch-control-vits
Nathan-Roll1/PSST
Prosodic Speech Segmentation with Transformers
ishine/PnG-BERT
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS