/llamacpp_py

Running LLaMA models with int4 quantization in python with llama.cpp

Primary LanguageC++

No issues in this repository yet.