liuyanfeier's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
chinese-poetry/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
karpathy/llm.c
LLM training in simple, raw C/CUDA
iDvel/rime-ice
Rime 配置:雾凇拼音 | 长期维护的简体词库
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
espnet/espnet
End-to-End Speech Processing Toolkit
UFund-Me/Qbot
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
niderhoff/nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
daquexian/onnx-simplifier
Simplify your onnx model
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
pndurette/gTTS
Python library and CLI tool to interface with Google Translate's text-to-speech API
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
ssnhd/googlevoice
注册 Google Voice 号码详细步骤
wilicc/gpu-burn
Multi-GPU CUDA stress test
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
k2-fsa/icefall
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
savoirfairelinux/num2words
Modules to convert numbers to words. 42 --> forty-two
huggingface/community-events
Place where folks can contribute to 🤗 community events
thu-spmi/CAT
A CRF-based ASR Toolkit
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
shjwudp/shu
中文书籍收录整理, Collection of Chinese Books
wenet-e2e/west
We Speech Transcript based on LLM, in 300 lines of code.
MozillaItalia/DeepSpeech-Italian-Model
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
sweekarsud/Goodness-of-Pronunciation
idiap/pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
lqfeng/ChineseCharacters
中文繁体和简体字符对照表