Pinned Repositories
AdaBM
[CVPR2024] Official Code for the "AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution"
CADyQ
[ECCV2022] Official Code for the "CADyQ: Content-Aware Dynamic Quantization for Image Super Resolution"
QuantSR
[NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.
Efficient-Computing
Efficient computing methods developed by Huawei Noah's Ark Lab
CABM-pytorch
CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input
auto-round
SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
EfficientDM
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models"
Quest
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
DFSQ
super-resolution; post-training quantization; model compression
vokkko's Repositories
vokkko/auto-round
SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
vokkko/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
vokkko/EfficientDM
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models"
vokkko/Quest
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference