basis embedding: a product quantization based model compression method for language models.
Primary LanguagePython