ChuanhongLi

Pinned Repositories

bcc
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
Language:C0 0 00
cgroup-icmp-drop
This is a simple ebpf cgroup program, just used for eBPF learing
Language:C0 1 00
eBPF-learning
I'm a new beginner for eBPF, and this project is used to record the way to it
0 1 00
exl2-for-all
EXL2 quantization generalized to other models.
Language:Python0 0 00
GenZ-LLM-Analyzer
LLM Inference analyzer for different hardware platforms
Language:Jupyter Notebook0 0 00
gobpf
Go bindings for creating BPF programs.
Language:C0 0 00
QuIP-for-all
QuIP quantization
Language:Python0 0 00
quip-sharp
Language:Python0 0 00
vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0 00
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python37.5k 220 5.6k4.6k

ChuanhongLi's Repositories

ChuanhongLi/bcc
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
Language:C0 0 00
ChuanhongLi/cgroup-icmp-drop
This is a simple ebpf cgroup program, just used for eBPF learing
Language:C0 1 00
ChuanhongLi/eBPF-learning
I'm a new beginner for eBPF, and this project is used to record the way to it
0 1 00
ChuanhongLi/exl2-for-all
EXL2 quantization generalized to other models.
Language:Python0 0 00
ChuanhongLi/GenZ-LLM-Analyzer
LLM Inference analyzer for different hardware platforms
Language:Jupyter Notebook0 0 00
ChuanhongLi/gobpf
Go bindings for creating BPF programs.
Language:C0 0 00
ChuanhongLi/QuIP-for-all
QuIP quantization
Language:Python0 0 00
ChuanhongLi/quip-sharp
Language:Python0 0 00
ChuanhongLi/vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0 00
ChuanhongLi/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
ChuanhongLi/CacheBlend
ChuanhongLi/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.