/picollm

On-device LLM Inference Powered by X-Bit Quantization

Primary LanguagePythonApache License 2.0Apache-2.0

Issues