hnyls2002

@acm-21, RA @ucbrise, member @lm-sys @sgl-project Talk is cheap, show show way...

SJTU, UCBBerkeley

hnyls2002's Stars

xai-org/grok-1
Grok open release
Language:Python49.7k 591 2148.3k
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
Language:Python41.4k 331 3.7k5.4k
astral-sh/ruff
An extremely fast Python linter and code formatter, written in Rust.
Language:Rust34.2k 84 5.8k1.2k
MonitorControl/MonitorControl
🖥 Control your display's brightness & volume on your Mac as if it was a native Apple Display. Use Apple Keyboard keys or custom shortcuts. Shows the native macOS OSDs.
Language:Swift28.5k 158 950827
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.9k 158 1.6k2.3k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.5k 307 1.4k2.6k
guidance-ai/guidance
A guidance language for controlling large language models.
Language:Jupyter Notebook19.4k 119 5501.1k
dottxt-ai/outlines
Structured Text Generation
Language:Python10.2k 48 665532
abetlen/llama-cpp-python
Python bindings for llama.cpp
Language:Python8.4k 74 1.2k1k
mamba-org/mamba
The Fast Cross-Platform Package Manager
Language:C++7k 46 1.8k369
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.9k 63 805635
lark-parser/lark
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
Language:Python5k 56 926420
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Language:Python5k 25 90158
openxla/xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
Language:C++2.8k 42 394456
rustcc/writing-an-os-in-rust
《使用Rust编写操作系统》
Language:Rust2.2k 63 7206
Niek/chatgpt-web
ChatGPT web interface using the OpenAI API
Language:Svelte1.9k 21 170475
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.8k 24 39100
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++1.7k 30 664234
Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda873 13 15138
skyzh/write-you-a-vector-db
A Vector Database Tutorial (over CMU-DB's BusTub system)
Language:C++647 9 018
skyzh/chicv
A minimal and fully-customizable CV template for Typst.
Language:Typst618 4 143
efeslab/Atom
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Language:Cuda282 10 2025
IlyaGrebnov/libsais
libsais is a library for linear time suffix array, longest common prefix array and burrows wheeler transform construction based on induced sorting algorithm.
Language:C190 15 1824
FasterDecoding/BitDelta
Language:Jupyter Notebook188 4 514
FasterDecoding/REST
REST: Retrieval-Based Speculative Decoding, NAACL 2024
Language:C185 6 2111
matchy233/typst-chi-cv-template
😍 Rip-off of rip-off of skyzh's CV, using typst
Language:Typst125 2 410
mkuchnik/relm
ReLM is a Regular Expression engine for Language Models
Language:Python103 4 111
Intsights/PySubstringSearch
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
Language:C41 3 55
yichuan520030910320/MLsys_reading_list
A record of reading list on some MLsys popular topic
6 1 00
ModelTC/general-sam-py
Python bindings for general-sam and some utilities
Language:Python3 8 00

hnyls2002

hnyls2002's Stars

xai-org/grok-1

oobabooga/text-generation-webui

astral-sh/ruff

MonitorControl/MonitorControl

haotian-liu/LLaVA

microsoft/unilm

guidance-ai/guidance

dottxt-ai/outlines

abetlen/llama-cpp-python

mamba-org/mamba

sgl-project/sglang

lark-parser/lark

XuehaiPan/nvitop

openxla/xla

rustcc/writing-an-os-in-rust

Niek/chatgpt-web

S-LoRA/S-LoRA

flexflow/FlexFlow

Liu-xiandong/How_to_optimize_in_GPU

skyzh/write-you-a-vector-db

skyzh/chicv

efeslab/Atom

IlyaGrebnov/libsais

FasterDecoding/BitDelta

FasterDecoding/REST

matchy233/typst-chi-cv-template

mkuchnik/relm

Intsights/PySubstringSearch

yichuan520030910320/MLsys_reading_list

ModelTC/general-sam-py