Pinned Repositories
ANT-Quantization
llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Quest
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
BoardgameAI_UnderWorld
SJTU ACM Class Machine Learning 2022 Assignment
Bookstore_Sakits
SJTU ACM Class Data Structure 2020 Assignment
Compiler_Violet
SJTU ACM Class Compiler Design and Implementation 2022 Assignment
CPU_Shieru
SJTU ACM Class Architecture 2021 Assignment
flashinfer-dev
FlashInfer: Kernel Library for LLM Serving
llm-awq-dev
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Stupid-Template-Library
SJTU ACM Class Data Structure 2020 Assignment
Sakits's Repositories
Sakits/CPU_Shieru
SJTU ACM Class Architecture 2021 Assignment
Sakits/Bookstore_Sakits
SJTU ACM Class Data Structure 2020 Assignment
Sakits/Compiler_Violet
SJTU ACM Class Compiler Design and Implementation 2022 Assignment
Sakits/BoardgameAI_UnderWorld
SJTU ACM Class Machine Learning 2022 Assignment
Sakits/flashinfer-dev
FlashInfer: Kernel Library for LLM Serving
Sakits/llm-awq-dev
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Sakits/Stupid-Template-Library
SJTU ACM Class Data Structure 2020 Assignment
Sakits/TicketSystem_Enoshima-Dentetsu
SJTU ACM Class Data Structure 2020 Assignment
Sakits/zhiyuan-salon
向致远学院师生提供查询参与致远沙龙次数服务的网站