FreeLo0op's Stars
XYiliang/CDVAE
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
meta-llama/llama-stack-apps
Agentic components of the Llama Stack APIs
mli/paper-reading
深度学习经典、新论文逐段精读
cs230-stanford/cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
huggingface/huggingface_hub
The official Python client for the Huggingface Hub.
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
datawhalechina/self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程
chrisstaite/lameenc
Python bindings around the LAME encoder
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
2noise/ChatTTS
A generative speech model for daily dialogue.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
KindXiaoming/pykan
Kolmogorov Arnold Networks
mapull/chinese-dictionary
中文汉语拼音辞典,汉字拼音字典,词典,成语词典,常用字、多音字字典数据库
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
wenet-e2e/WeTextProcessing
Text Normalization & Inverse Text Normalization
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
syhw/wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.