Study of model compression techniques: Knowledge distillation, Quantization, and Pruning
Primary LanguageJupyter Notebook