/llmrocky

Rocky, AI assistant

Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

LLaMA-Powered Rocky AI

An AI chatbot assistant named Rocky powered by LLaMA2

screenshot

Supported Platforms

  • Windows
  • (optional) GPU

Pre-requisites

How to Use

  • Install Python for Windows
  • Clone this repository
git clone https://github.com/by-park/llmrocky.git
  • Install pre-requisites
pip install ctransformers

If GPU is supported, please use this command

pip install ctransformers[cuda]
  • Download LLaMA2 (llama-2-7b-chat.ggmlv3.q2_K.bin) and place the model file under the 'model' folder.
  • If GPU is not supported, please remove the parameter named 'gpu_layers' in main.py

from (with GPU)

llm = AutoModelForCausalLM.from_pretrained("model\\llama-2-7b-chat.ggmlv3.q2_K.bin", model_type="llama", gpu_layers=32)

to (without GPU)

llm = AutoModelForCausalLM.from_pretrained("model\\llama-2-7b-chat.ggmlv3.q2_K.bin", model_type="llama")
  • run the 'main.py' (F5 key for Python default IDLE)
python main.py

Work in Progress

Asset Source