/bitllama

Initial implementation of 1.58-bit Llama Model

Primary LanguagePythonApache License 2.0Apache-2.0

Issues