yyfcc17

deep learning, model compression, AI infra

AlibabaHangzhou China

Pinned Repositories

MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
Language:C++8.7k 199 2.6k1.7k
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python4.4k 31 455473
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.7k 16 399204
llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.4k 24 174187
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.5k 108 1.1k943
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.5k 92 1.9k958
nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Language:Python4.8k 25 84149
archbase
教科书《计算机体系结构基础》（胡伟武等，第三版）的开源版本
Language:TeX0 0 00
MNN
MNN is a lightweight deep neural network inference engine.
Language:C++0 1 00
reinforcement-learning-an-introduction
Python code for Reinforcement Learning: An Introduction
Language:Python0 2 00

yyfcc17's Repositories

yyfcc17/archbase
教科书《计算机体系结构基础》（胡伟武等，第三版）的开源版本
Language:TeX0 0 00
yyfcc17/MNN
MNN is a lightweight deep neural network inference engine.
Language:C++0 1 00
yyfcc17/reinforcement-learning-an-introduction
Python code for Reinforcement Learning: An Introduction
Language:Python0 2 00

yyfcc17

Pinned Repositories

MNN

AutoGPTQ

AutoAWQ

llm-awq

cutlass

TensorRT-LLM

nvitop

archbase

MNN

reinforcement-learning-an-introduction

yyfcc17's Repositories

yyfcc17/archbase

yyfcc17/MNN

yyfcc17/reinforcement-learning-an-introduction