FreeLo0op

FreeLo0op's Stars

XYiliang/CDVAE
Language:Python1
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python7.8k1.1k
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook13.3k3.2k
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.1k838
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python3.6k312
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Language:PostScript17.8k2.2k
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.7k1.2k
meta-llama/llama-stack-apps
Agentic components of the Llama Stack APIs
Language:Python3.6k390
mli/paper-reading
深度学习经典、新论文逐段精读
26.5k2.4k
cs230-stanford/cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
Language:Python3.8k986
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python27.6k4.1k
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.4k668
huggingface/huggingface_hub
The official Python client for the Huggingface Hub.
Language:Python2k530
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11k1.8k
datawhalechina/self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型，更适合**宝宝的部署教程
Language:Jupyter Notebook8.2k980
chrisstaite/lameenc
Python bindings around the LAME encoder
Language:C518
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.1k402
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python58742
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.7k424
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31.1k3.4k
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6k757
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language:Python685116
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.1k656
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook14.7k1.3k
mapull/chinese-dictionary
中文汉语拼音辞典，汉字拼音字典，词典，成语词典，常用字、多音字字典数据库
450109
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
Language:Python26886
wenet-e2e/WeTextProcessing
Text Normalization & Inverse Text Normalization
Language:Python45567
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python1.3k243
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.6k1.4k
syhw/wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
1.9k226