kentaroy47/quantize-huggingface

Quantize Huggingface transformers like BERT :hugs:

Jupyter Notebook

Quantize huggingface models

see huggingface-tweet.ipynb for the implementation.

we quantize distilbert for test, but you can swap to any models.