shaun95's Stars
2noise/ChatTTS
A generative speech model for daily dialogue.
meta-llama/llama3
The official Meta Llama 3 GitHub site
karpathy/llm.c
LLM training in simple, raw C/CUDA
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
dnhkng/GlaDOS
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.
microsoft/Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
NX-AI/xlstm
Official repository of the xLSTM.
google-ai-edge/model-explorer
A modern model graph visualizer and debugger
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
JinhuaLiang/WavCraft
Official repo for WavCraft, an AI agent for audio creation and editing
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
tairov/llama2.py
Inference Llama 2 in one file of pure Python
AI-Guru/xlstm-resources
Resources about xLSTM by Sepp Hochreiter
liutaocode/TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Plachtaa/FAcodec
Training code for FAcodec presented in NaturalSpeech3
bytedance/Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
myscience/x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024)
karpathy/calorie
nice and effective super simple calorie counter web app
sony/soundctm
Pytorch implementation of SoundCTM
XiangLi2022/CM-TTS
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
resemble-ai/arabic-text-diacritization
Benchmark Arabic text diacritization dataset
Cerber2ol8/GPT-SoVITS-Notebook
GPT-SoVITS的Notebook工作流
MaxMax2016/StreamSpeech
实时流式,StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
shaun95/OpenVoice
Instant voice cloning by MyShell