/Quantization-in-Depth

Dive into advanced quantization techniques. Learn to implement and customize linear quantization functions, measure quantization error, and compress model weights using PyTorch for efficient and accessible AI models.

Primary LanguageJupyter Notebook

Watchers