FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
Primary LanguagePython