imxtx

My interests are in computer vision and systems.

China

imxtx's Stars

mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook40.6k 418 694.3k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python36.6k 297 1.1k4.5k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python33.4k 191 5833.6k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python20k 135 1.2k1.4k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python18.1k 111 4791.4k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k 203 4002.3k
mamoe/mirai
高效率 QQ 机器人支持库
Language:Kotlin14.6k 136 2k2.5k
karpathy/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Language:Jupyter Notebook10.8k 151 311.6k
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python9.1k 82 665882
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
Language:Python9.1k 136 5851.1k
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.7k 82 154769
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.6k 64 158645
google/latexify_py
A library to generate LaTeX expression from Python code.
Language:Python7.4k 57 84392
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python7k 55 2071.3k
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.9k 54 124503
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Language:MATLAB4.3k 39 0561
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.9k 44 162349
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python3k 88 98418
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python2.7k 30 52185
r9y9/wavenet_vocoder
WaveNet vocoder
Language:Python2.3k 97 193500
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
2.2k 36 1188
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2.1k 49 127323
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python2k 31 165511
microsoft/NeuralSpeech
Language:Python1.4k 33 126181
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
Language:Python1.3k 46 5587
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k 54 31103
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language:Python933 22 5755
fighting41love/zhvoice
Chinese voice corpus. 中文语音语料，语音更加清晰自然，包含8个开源数据集，3200个说话人，900小时语音，1300万字。
608 9 0115
jianchang512/gptsovits-api
适用于 GPT-SoVITS 的api调用接口
Language:Python220 6 730
glory20h/VoiceLDM
VoiceLDM: Text-to-Speech with Environmental Context
Language:Python166 7 58

imxtx

imxtx's Stars

mlabonne/llm-course

coqui-ai/TTS

2noise/ChatTTS

unslothai/unsloth

fishaudio/fish-speech

meta-llama/llama-recipes

mamoe/mirai

karpathy/micrograd

FunAudioLLM/CosyVoice

jiaaro/pydub

Plachtaa/VALL-E-X

netease-youdao/EmotiVoice

google/latexify_py

jaywalnut310/vits

huggingface/parler-tts

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

FunAudioLLM/SenseVoice

enhuiz/vall-e

ictnlp/LLaMA-Omni

r9y9/wavenet_vocoder

mlabonne/llm-datasets

lifeiteng/vall-e

jik876/hifi-gan

microsoft/NeuralSpeech

0nutation/SpeechGPT

lucidrains/naturalspeech2-pytorch

jishengpeng/WavTokenizer

fighting41love/zhvoice

jianchang512/gptsovits-api

glory20h/VoiceLDM