shaun95

Enthusiast of neural synthesizers and vocoders, always curious.

shaun95's Stars

2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31.1k 179 5153.4k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.3k 215 2423k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda23.6k 231 1342.6k
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
Language:Python14.9k 139 2.1k2.4k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.2k 92 161.1k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.9k 270 109769
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python11.9k 171 230806
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.1k 83 36838
dnhkng/GlaDOS
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.
Language:Python2.9k 41 49276
microsoft/Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
Language:Jupyter Notebook2.3k 15 67228
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python1.8k 21 179168
NX-AI/xlstm
Official repository of the xLSTM.
Language:Python1.3k 13 4392
google-ai-edge/model-explorer
A modern model graph visualizer and debugger
Language:JavaScript994 34 5075
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Language:HTML914 18 226105
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python883 12 1366
JinhuaLiang/WavCraft
Official repo for WavCraft, an AI agent for audio creation and editing
Language:Python648 71 396
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
Language:Python509 18 3642
tairov/llama2.py
Inference Llama 2 in one file of pure Python
Language:Python405 4 132
AI-Guru/xlstm-resources
Resources about xLSTM by Sepp Hochreiter
276 26 016
liutaocode/TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Language:Python224 23 019
Plachtaa/FAcodec
Training code for FAcodec presented in NaturalSpeech3
Language:Python160 9 2217
bytedance/Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
Language:Python119 4 514
myscience/x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024)
Language:Python117 8 513
karpathy/calorie
nice and effective super simple calorie counter web app
Language:HTML91 1 28
sony/soundctm
Pytorch implementation of SoundCTM
Language:Python68 3 16
XiangLi2022/CM-TTS
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Language:Python61 3 23
resemble-ai/arabic-text-diacritization
Benchmark Arabic text diacritization dataset
Language:Python4 2 01
Cerber2ol8/GPT-SoVITS-Notebook
GPT-SoVITS的Notebook工作流
Language:Python3 1 00
MaxMax2016/StreamSpeech
实时流式，StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python1 0 0
shaun95/OpenVoice
Instant voice cloning by MyShell
Language:Python1 0 0

shaun95

shaun95's Stars

2noise/ChatTTS

meta-llama/llama3

karpathy/llm.c

UKPLab/sentence-transformers

naklecha/llama3-from-scratch

BradyFU/Awesome-Multimodal-Large-Language-Models

openai/tiktoken

karpathy/minbpe

dnhkng/GlaDOS

microsoft/Phi-3CookBook

OpenLLMAI/OpenRLHF

NX-AI/xlstm

google-ai-edge/model-explorer

erew123/alltalk_tts

ictnlp/StreamSpeech

JinhuaLiang/WavCraft

X-LANCE/SLAM-LLM

tairov/llama2.py

AI-Guru/xlstm-resources

liutaocode/TTS-arxiv-daily

Plachtaa/FAcodec

bytedance/Make-An-Audio-2

myscience/x-lstm

karpathy/calorie

sony/soundctm

XiangLi2022/CM-TTS

resemble-ai/arabic-text-diacritization

Cerber2ol8/GPT-SoVITS-Notebook

MaxMax2016/StreamSpeech

shaun95/OpenVoice