/Optimized-CPU-Implementation-of-Llama2

Optimized CPU Implementation of Llama2-LLM

Primary LanguagePythonMIT LicenseMIT

Optimized-CPU-Implementation-of-Llama2

Optimized CPU Implementation of Llama2

Implimented :-

"TheBloke/Llama-2-7B-Chat-GGML" 4-bit Model from Huggingface Hub Model Link

Simple UI on local

alt text