ycchuang

ycchuang's Stars

suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.5k 328 4374.2k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.3k 428 4.2k6.4k
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
18k 369 241.5k
ExistentialAudio/BlackHole
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
Language:C15k 124 399581
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
Language:Python9.9k 114 707883
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
Language:Python9.9k 108 2k895
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
9.4k 286 451.5k
lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Language:Python6.5k 208 165986
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，LLaMA等模型应用在纠错场景，开箱即用。
Language:Python5.5k 84 4701.1k
pytorch/serve
Serve, optimize and scale PyTorch models in production
Language:Java4.2k 57 1.6k850
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python4.1k 90 1k1.1k
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
Language:C++2.5k 71 369512
mckinsey/causalnex
A Python library that helps data scientists to infer causation rather than observing correlation.
Language:Python2.2k 49 139257
Music-and-Culture-Technology-Lab/omnizart
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
Language:Python1.6k 25 7699
moses-smt/mosesdecoder
Moses, the machine translation system
Language:Roff1.6k 152 79775
travistangvh/ChatGPT-Data-Science-Prompts
A repository of 60 useful data science prompts for ChatGPT
1.4k 35 0252
microsoft/NeuralSpeech
Language:Python1.4k 33 124185
bastibe/SoundCard
A Pure-Python Real-Time Audio Library
Language:Python681 20 13469
Ailln/cn2an
📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）
Language:Python660 6 6981
bojone/word-discovery
速度更快、效果更好的中文新词发现
Language:Python508 11 12103
kakaobrain/g2pm
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
Language:Python336 15 1872
awslabs/mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020)
Language:Python333 15 2159
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
Language:Python278 5 1738
belambert/asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Language:Python268 15 978
intxcc/pyaudio_portaudio
A fork to record speaker output with python. PyAudio with PortAudio for Windows | Extended | Loopback | WASAPI | Latest precompiled Version
Language:C245 15 4361
aparrish/phonetic-similarity-vectors
Source code to accompany my paper "Poetic sound similarity vectors using phonetic features"
Language:Jupyter Notebook166 4 116
kmario23/KenLM-training
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
112 6 721
gpodder/gpodder.github.io
Collaboratively-maintained gPodder website
Language:HTML35 9 1039
esun-ai/phonetic_mlm
Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition
Language:Python175
samemon/Voice-to-Age
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
Language:Python15 4 45