YongHuaZhang-BUAA

Pinned Repositories

accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python00
ADMM-NN
Language:C++0 1 00
admm-pruning
Prune DNN using Alternating Direction Method of Multipliers (ADMM)
Language:Python0 1 00
AI-Job-Notes
AI算法岗求职攻略（涵盖准备攻略、刷题指南、内推和AI公司清单等资料）
0 1 00
algorithm
My LeetCode Solutions with Explanation and Time Complexity Analysis
Language:Python0 1 00
Awesome-Pruning
A curated list of neural network pruning resources.
1 2 01
Interview-Notebook
:books: 技术面试需要掌握的基础知识整理
1 1 00
ml-road
Machine Learning Resources, Practice and Research
Language:Python1 1 00
Model_Compression_Paper
1 1 00
Tools-NetworkModeViewer-Netron
Visualizer for neural network, deep learning and machine learning models
Language:JavaScript1 1 00

YongHuaZhang-BUAA's Repositories

YongHuaZhang-BUAA/bitsandbytes
LLM：8-bit CUDA functions for PyTorch
Language:Python0 0
YongHuaZhang-BUAA/cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Language:Cuda0 0
YongHuaZhang-BUAA/DeepSpeed
LLM：DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0
YongHuaZhang-BUAA/ExplanationIntervention
LLM：PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind
Language:Python0 0
YongHuaZhang-BUAA/FisherPruning
Finetuning：Group Fisher Pruning for Practical Network Compression(ICML2021)
Language:Python0 0
YongHuaZhang-BUAA/FlanT5-CoT-Specialization
LLM：Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.
Language:Jupyter Notebook0 0
YongHuaZhang-BUAA/FlexGen
LLM：FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Language:Python0 0
YongHuaZhang-BUAA/FPGA-BDF
Avnet Board Definition Files
Language:Tcl1 0
YongHuaZhang-BUAA/gptq
LLM：Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
YongHuaZhang-BUAA/hls4ml
Machine learning on FPGAs using HLS
Language:C++0 0
YongHuaZhang-BUAA/hls4ml-tutorial
Tutorial notebooks for hls4ml
Language:Jupyter Notebook0 0
YongHuaZhang-BUAA/iree
👻
Language:C++0 0
YongHuaZhang-BUAA/LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
0 0
YongHuaZhang-BUAA/Lion
Lion: Adversarial Distillation of Closed-Source Large Language Model
Language:Python0 0
YongHuaZhang-BUAA/llm-awq
LLM：AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python0 0
YongHuaZhang-BUAA/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
Language:Python0 0
YongHuaZhang-BUAA/lm-evaluation-harness
LLM：A framework for few-shot evaluation of autoregressive language models.
Language:Python0 0
YongHuaZhang-BUAA/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
Language:Python0 0
YongHuaZhang-BUAA/neural-compressor
LLM：Provide unified APIs for SOTA model compression techniques, such as low precision (INT8/INT4/FP4/NF4) quantization, sparsity, pruning, and knowledge distillation on mainstream AI frameworks such as TensorFlow, PyTorch, and ONNX Runtime.
Language:Python0 0
YongHuaZhang-BUAA/owq
LLM：Code for the "OWQ: Lessons learned from activation outliers for weight quantization in large language models".
Language:Python0 0
YongHuaZhang-BUAA/pytorch-cifar
95.47% on CIFAR10 with PyTorch
Language:Python0 0
YongHuaZhang-BUAA/qlora
LLM：QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook0 0
YongHuaZhang-BUAA/QuIP
Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"
Language:Python0 0
YongHuaZhang-BUAA/RPTQ4LLM
LLM：Reorder-based post-training quantization for large language model
Language:Python0 0
YongHuaZhang-BUAA/smoothquant
LLM：[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python0 0
YongHuaZhang-BUAA/Sparse-storage-formats
The codes of sparse storage formats for Vitis HLS
Language:C++0 0
YongHuaZhang-BUAA/SpQR
LLM：SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
Language:Python0 0
YongHuaZhang-BUAA/SqueezeLLM
LLM：SqueezeLLM: Dense-and-Sparse Quantization
Language:Python0 0
YongHuaZhang-BUAA/wanda
LLM Pruning：A simple and effective LLM pruning approach.
Language:Python0 0
YongHuaZhang-BUAA/XilinxBoardStore