JinnyYin's Stars
ddlBoJack/Awesome-Speech-Language-Model
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
nuochenpku/LLaMA_Analysis
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
timsainb/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
mosaicml/streaming
A Data Streaming Library for Efficient Neural Network Training
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
zeroQiaoba/MERTools
Toolkits for Multimodal Emotion Recognition
EIHW/MuSe-2023
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
github/copilot-docs
Documentation for GitHub Copilot
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
songhwanjun/Awesome-Noisy-Labels
A Survey
wangyanckxx/FERV39k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
apachecn/hands-on-ml-zh
:book: [译] Sklearn 与 TensorFlow 机器学习实用指南【版权问题,网站已下线!!】
ageron/handson-ml3
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
HarisIqbal88/PlotNeuralNet
Latex code for making neural networks diagrams
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
doocs/leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
aishell-foundation/DaCiDian
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
theblackcat102/edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
lucidrains/conformer
Implementation of the convolutional module from the Conformer paper, for use in Transformers