Stonesjtu

Machine Learning from Huge to Tiny Deep Learning from Python to RTL

NIOShanghai, China

Stonesjtu's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++68.2k 547 4k9.8k
xai-org/grok-1
Grok open release
Language:Python49.6k 574 2108.3k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python35.9k 213 1.3k4.1k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python32.5k 188 5583.5k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python29.9k 216 2522.9k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.2k 226 2643.1k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python21k 209 3852.2k
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook15.1k 113 4121.4k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.3k 99 5501.1k
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python10.7k 125 217785
vosen/ZLUDA
CUDA on non-NVIDIA GPUs
Language:Rust9.8k 134 178639
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.7k 94 2k996
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
Language:C++6k 39 88509
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python5.3k 39 40508
pytorch/torchtitan
A native PyTorch Library for large model training
Language:Python2.6k 42 178205
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.5k 24 187204
Tele-AI/Telechat
Language:Python1.8k 21 6199
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.7k 29 2970
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Language:Jupyter Notebook1.5k 65 40133
NVIDIA/cccl
CUDA Core Compute Libraries
Language:C++1.3k 31 1.5k164
tspeterkim/flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
Language:Cuda630 4 654
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
Language:Python588 21 4552
Jokeren/Awesome-GPU
Awesome resources for GPUs
495 25 050
HazyResearch/aisys-building-blocks
Building blocks for foundation models.
397 30 016
efeslab/Atom
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Language:Cuda278 10 1924
proger/accelerated-scan
Accelerated First Order Parallel Associative Scan
Language:Python164 8 78
NVlabs/cub
THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.
Language:Cuda83 5 050
UofT-EcoSystem/Minuet
[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs
Language:Cuda73 2 33
X-LANCE/PaperReading
整理各研究方向经典论文
10 3 0
Stonesjtu/Awesome-GPU
Awesome resources for GPUs
2 0 00

Stonesjtu

Stonesjtu's Stars

ggerganov/llama.cpp

xai-org/grok-1

RVC-Boss/GPT-SoVITS

2noise/ChatTTS

myshell-ai/OpenVoice

meta-llama/llama3

facebookresearch/audiocraft

KindXiaoming/pykan

state-spaces/mamba

InstantID/InstantID

vosen/ZLUDA

NVIDIA/TensorRT-LLM

google/gemma.cpp

google/gemma_pytorch

pytorch/torchtitan

mit-han-lab/llm-awq

Tele-AI/Telechat

HazyResearch/ThunderKittens

X-LANCE/AniTalker

NVIDIA/cccl

tspeterkim/flash-attention-minimal

X-LANCE/SLAM-LLM

Jokeren/Awesome-GPU

HazyResearch/aisys-building-blocks

efeslab/Atom

proger/accelerated-scan

NVlabs/cub

UofT-EcoSystem/Minuet

X-LANCE/PaperReading

Stonesjtu/Awesome-GPU