Pinned Repositories
Arthur-Ling
Config files for my GitHub profile.
solve_sycamore
Reproduce the random circuit sampling experiments of Sycamore quantum circuit
ANT-Quantization
torch-int
This repository contains integer operators on GPUs for PyTorch.
I-BERT
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats
Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
cutlass
CUDA Templates for Linear Algebra Subroutines
OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
Arthur-Ling's Repositories
Arthur-Ling/Arthur-Ling
Config files for my GitHub profile.
Arthur-Ling/solve_sycamore
Reproduce the random circuit sampling experiments of Sycamore quantum circuit