vokkko's Stars
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
chaiNNer-org/chaiNNer
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and powerful programmatic image processing application.
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
JDAI-CV/dabnn
dabnn is an accelerated binary neural networks inference framework for mobile platform
MyNiuuu/MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
mit-han-lab/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Cornell-RelaxML/quip-sharp
mit-han-lab/duo-attention
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
intel/auto-round
Advanced Quantization Algorithm for LLMs/VLMs.
efeslab/Atom
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
facebookresearch/SpinQuant
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
Guangxuan-Xiao/torch-int
This repository contains integer operators on GPUs for PyTorch.
snap-research/BitsFusion
ChenMnZ/PrefixQuant
An algorithm for static activation quantization of LLMs
Linwei-Chen/LIS
IJCV2023 Instance Segmentation in the Dark
OswaldHe/HMT-pytorch
Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"
cauyxy/bilivideos
ThisisBillhe/EfficientDM
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models"
thu-nics/ViDiT-Q
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
thu-nics/MixDQ
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
Kai-Liu001/2DQuant
PyTorch code for our paper "2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution"
1hunters/retraining-free-quantization
RFQuant: Retraining-free Model Quantization via One-Shot Weight-Coupling Learning, CVPR (2024)
zysxmu/ERQ