ycchuang's Stars
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
ExistentialAudio/BlackHole
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
pytorch/serve
Serve, optimize and scale PyTorch models in production
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
mckinsey/causalnex
A Python library that helps data scientists to infer causation rather than observing correlation.
Music-and-Culture-Technology-Lab/omnizart
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
moses-smt/mosesdecoder
Moses, the machine translation system
travistangvh/ChatGPT-Data-Science-Prompts
A repository of 60 useful data science prompts for ChatGPT
microsoft/NeuralSpeech
bastibe/SoundCard
A Pure-Python Real-Time Audio Library
Ailln/cn2an
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
bojone/word-discovery
速度更快、效果更好的中文新词发现
kakaobrain/g2pm
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
awslabs/mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020)
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
belambert/asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
intxcc/pyaudio_portaudio
A fork to record speaker output with python. PyAudio with PortAudio for Windows | Extended | Loopback | WASAPI | Latest precompiled Version
aparrish/phonetic-similarity-vectors
Source code to accompany my paper "Poetic sound similarity vectors using phonetic features"
kmario23/KenLM-training
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
gpodder/gpodder.github.io
Collaboratively-maintained gPodder website
esun-ai/phonetic_mlm
Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition
samemon/Voice-to-Age
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.