JiaQuan1203

Pinned Repositories

vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python28.7k4.3k
wanda
A simple and effective LLM pruning approach.
Language:Python63885

JiaQuan1203's Repositories

JiaQuan1203 doesn’t have any repository yet.