Pinned Repositories
deeplearningbook-chinese
Deep Learning Book Chinese Translation
DeepLearningBook-ReadingNotes
DeepLearningBook 读书会笔记及讲义
LeetCode
:pencil: Python / C++ 11 Solutions of All 468 LeetCode Questions
Machine-Learning
OmniQuant
OmniQuant is a simple and powerful quantization technique for LLMs.
triton-index
Cataloging released Triton kernels.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
brisker's Repositories
brisker/deeplearningbook-chinese
Deep Learning Book Chinese Translation
brisker/DeepLearningBook-ReadingNotes
DeepLearningBook 读书会笔记及讲义
brisker/LeetCode
:pencil: Python / C++ 11 Solutions of All 468 LeetCode Questions
brisker/Machine-Learning
brisker/OmniQuant
OmniQuant is a simple and powerful quantization technique for LLMs.
brisker/triton-index
Cataloging released Triton kernels.