Aaronhuang-778
PhD @ HKU (CVMI Lab); B.Eng @ BUAA. #Deep Learning #Model Compression #Quantization/Binarization #LLM
HKU - CVMI Lab (https://xjqi.github.io/cvmi.html)Hong Kong
Pinned Repositories
AaronHuang-778.github.io
Wei's Academic Personal Homepage
BiLLM
(ICML 2024) BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Mulit-precision-Quantization
This is a repository for practicing of mulit-precision NN models. This is not for any paper or project.
PPPair_Programming
使用拓扑排序和DFS算法计算复杂单词环信息,并设计UI界面
Pytorch-NN-Model_Extraction
Extract the computer map from pytorch and it`s onnx
SliM-LLM
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
Sys-Compile
一个Sysy语言编译器(C语言的子集),其中包括前端和后端的生成。前端采用语法树分析,构建自定义IR中间代码,后端生成目标代码为MIPS
V-MIND
This is the dataset of our paper which enhanced the MIND with all News Pics
LLaMA3-Quantization
A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..
SNNGX_qSNN_encryption
Aaronhuang-778's Repositories
Aaronhuang-778/BiLLM
(ICML 2024) BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Aaronhuang-778/SliM-LLM
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
Aaronhuang-778/V-MIND
This is the dataset of our paper which enhanced the MIND with all News Pics
Aaronhuang-778/Mulit-precision-Quantization
This is a repository for practicing of mulit-precision NN models. This is not for any paper or project.
Aaronhuang-778/PPPair_Programming
使用拓扑排序和DFS算法计算复杂单词环信息,并设计UI界面
Aaronhuang-778/Pytorch-NN-Model_Extraction
Extract the computer map from pytorch and it`s onnx
Aaronhuang-778/Sys-Compile
一个Sysy语言编译器(C语言的子集),其中包括前端和后端的生成。前端采用语法树分析,构建自定义IR中间代码,后端生成目标代码为MIPS
Aaronhuang-778/AaronHuang-778.github.io
Wei's Academic Personal Homepage
Aaronhuang-778/awesome-efficient-aigc
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
Aaronhuang-778/Git_LFS_for_LargeModel
Aaronhuang-778/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Aaronhuang-778/Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
Aaronhuang-778/InferenceModelFactory
Aaronhuang-778/Miniscope-DAQ-Cypress-firmware
DAQ firmware for V3 and V4 Miniscope platforms
Aaronhuang-778/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
Aaronhuang-778/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Aaronhuang-778/spike_nn_practise
Aaronhuang-778/Transformer-Time-Series-Forecasting