jiajunsun68

a student

sddxChina Shandong province

jiajunsun68's Stars

binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python66.8k 283 1.7k8.2k
microsoft/BitNet
Official inference framework for 1-bit LLMs
Language:C++12.6k 132 97879
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++8k 78 172418
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Language:Python4.8k 34 134268
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
Language:Python3.5k 30 7861.1k
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python2k 29 50160
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
1.3k 42 686
Vahe1994/AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression https://arxiv.org/abs/2405.14852
Language:Python1.2k 19 102182
luanshiyinyang/FacialExpressionRecognition
人脸识别之表情识别项目相关源码
Language:Python775 5 40161
locuslab/wanda
A simple and effective LLM pruning approach.
Language:Python700 9 6595
microsoft/TransformerCompression
For releasing code related to compression methods for transformers, accompanying our publications
Language:Python392 11 5043
NVlabs/Taylor_pruning
Pruning Neural Networks with Taylor criterion in Pytorch
Language:Python315 17 567
Aaronhuang-778/BiLLM
(ICML 2024) BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Language:Python204 6 1814
hahnyuan/PB-LLM
PB-LLM: Partially Binarized Large Language Models
Language:Python148 3 810
AIoT-MLSys-Lab/SVD-LLM
Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"
Language:Python109 7 208
VILA-Lab/GBLM-Pruner
Are gradient information useful for pruning of LLMs?
Language:Python41 3 28
cjyaras/deep-lora-transformers
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)
Language:Python13 2 00
Dereck0602/Bolaco
Language:Python9 1 10
saintslab/bmrs-structured-pruning
Code release for the paper "BMRS: Bayesian Model Reduction for Structured Pruning"
Language:Python3 2 10

jiajunsun68

jiajunsun68's Stars

binary-husky/gpt_academic

microsoft/BitNet

SJTU-IPADS/PowerInfer

microsoft/LLMLingua

open-mmlab/mmpretrain

IST-DASLab/gptq

HuangOwen/Awesome-LLM-Compression

Vahe1994/AQLM

luanshiyinyang/FacialExpressionRecognition

locuslab/wanda

microsoft/TransformerCompression

NVlabs/Taylor_pruning

Aaronhuang-778/BiLLM

hahnyuan/PB-LLM

AIoT-MLSys-Lab/SVD-LLM

VILA-Lab/GBLM-Pruner

cjyaras/deep-lora-transformers

Dereck0602/Bolaco

saintslab/bmrs-structured-pruning