iwaterxt

iwaterxt's Stars

hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python39k 386 1.7k4.3k
google-research/bert
TensorFlow code and pre-trained models for BERT
Language:Python38.5k 1k 1.1k9.6k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.8k 425 4.2k6.4k
acheong08/ChatGPT
Reverse engineered ChatGPT API
Language:Python28.1k 290 8054.5k
mli/paper-reading
深度学习经典、新论文逐段精读
27.7k 734 02.5k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.7k 291 432.3k
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12.2k 98 3481.6k
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.6k 70 1.3k803
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.6k 51 292472
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.3k 46 400490
YannickJadoul/Parselmouth
Praat in Python, the Pythonic way
Language:C++1.1k 22 77117
fbcotter/pytorch_wavelets
Pytorch implementation of 2D Discrete Wavelet (DWT) and Dual Tree Complex Wavelet Transforms (DTCWT) and a DTCWT based ScatterNet
Language:Python1k 13 54149
k2-fsa/icefall
Language:Python974 48 688310
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Language:C++583 30 208110
modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
Language:Python498 13 7184
chenkui164/FastASR
这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时)，所以识别效果也很好，可以媲美许多商用的ASR软件。
Language:C492 23 7077
SpeechColab/Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
Language:Python458 18 2364
zhusleep/pytorch_chinese_lm_pretrain
pytorch中文语言模型预训练
Language:Python389 8 1078
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
Language:Python322 17 2844
lingjzhu/charsiu
Charsiu: A neural phonetic aligner.
Language:Jupyter Notebook285 9 1735
GeWu-Lab/OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
Language:Python248 4 4719
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Language:C++192 8 3836
sequitur-g2p/sequitur-g2p
This is a github repository of the abandonware Sequitur G2P by Bisani & Ney
Language:Python157 11 4555
mlyg/unified-focal-loss
Language:Python153 2 1924
Daisyqk/Automatic-Prosody-Annotation
Language:Python111 3 551
JoungheeKim/Non-Attentive-Tacotron
This is Pytorch Implementation of Google's Non-attentive Tacotron.
Language:Jupyter Notebook57 5 112
google-research-datasets/WikipediaHomographData
Labeled data for homograph disambiguation
53 7 214
choiHkk/pitch-control-vits
Language:Jupyter Notebook31 5 37
Nathan-Roll1/PSST
Prosodic Speech Segmentation with Transformers
Language:Jupyter Notebook23 4 25
ishine/PnG-BERT
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
Language:Python21 2 01