Clean C language version of quantizing llama2 model and running quantized llama2 model
Primary LanguageCApache License 2.0Apache-2.0