hnyls2002

@acm-21, RA @ucbrise, member @lm-sys @sgl-project Talk is cheap, show show way...

SJTU, UCBBerkeley

hnyls2002's Stars

cli/cli
GitHub’s official command line tool
Language:Go37.8k 913 4.5k6k
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python25k 318 1k3.2k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.9k 158 1.6k2.3k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.5k 306 1.4k2.6k
guidance-ai/guidance
A guidance language for controlling large language models.
Language:Jupyter Notebook19.3k 119 5491.1k
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
17.1k 216 271.6k
abetlen/llama-cpp-python
Python bindings for llama.cpp
Language:Python8.3k 73 1.2k1k
outlines-dev/outlines
Structured Text Generation
Language:Python8.2k 47 553414
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++8k 78 172418
mamba-org/mamba
The Fast Cross-Platform Package Manager
Language:C++7k 46 1.8k368
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.8k 63 805622
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.7k 60 106520
ollama-webui/ollama-webui
ChatGPT-Style Web UI Client for Ollama 🦙
Language:Svelte5.2k 44 309494
lark-parser/lark
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
Language:Python5k 57 926420
openxla/xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
Language:C++2.8k 41 390455
yhzhang0128/egos-2000
Envision a future where every student can read all the code of a teaching operating system.
Language:C2.2k 32 13156
rustcc/writing-an-os-in-rust
《使用Rust编写操作系统》
Language:Rust2.2k 63 7207
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
2k 51 10108
Niek/chatgpt-web
ChatGPT web interface using the OpenAI API
Language:Svelte1.9k 21 170476
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.8k 24 39100
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++1.7k 30 664234
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1.6k 21 153163
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
Language:Python884 13 15391
Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda871 13 15138
skyzh/write-you-a-vector-db
A Vector Database Tutorial (over CMU-DB's BusTub system)
Language:C++645 9 018
efeslab/Atom
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Language:Cuda281 10 2025
lambda7xx/awesome-AI-system
paper and its code for AI System
236 6 314
mkuchnik/relm
ReLM is a Regular Expression engine for Language Models
Language:Python104 4 111
yichuan520030910320/MLsys_reading_list
A record of reading list on some MLsys popular topic
6 1 00
wennitao/Advanced-Compiler
Advanced Compiler Assignment of ACM Class
Language:C2 1 00

hnyls2002

hnyls2002's Stars

cli/cli

JaidedAI/EasyOCR

haotian-liu/LLaVA

microsoft/unilm

guidance-ai/guidance

HqWu-HITCS/Awesome-Chinese-LLM

abetlen/llama-cpp-python

outlines-dev/outlines

SJTU-IPADS/PowerInfer

mamba-org/mamba

sgl-project/sglang

pytorch-labs/gpt-fast

ollama-webui/ollama-webui

lark-parser/lark

openxla/xla

yhzhang0128/egos-2000

rustcc/writing-an-os-in-rust

zjunlp/LLMAgentPapers

Niek/chatgpt-web

S-LoRA/S-LoRA

flexflow/FlexFlow

flashinfer-ai/flashinfer

SafeAILab/EAGLE

Liu-xiandong/How_to_optimize_in_GPU

skyzh/write-you-a-vector-db

efeslab/Atom

lambda7xx/awesome-AI-system

mkuchnik/relm

yichuan520030910320/MLsys_reading_list

wennitao/Advanced-Compiler