/LlamaWebServer

Web server implementation of Llama

Primary LanguageJavaScriptGNU General Public License v3.0GPL-3.0

Llama Web Server

This is a web server implementation of Llama that lets you run a GGUF model file locally. It includes a user interface that's similar to WhatsApp. You can use the default GGUF model file that is downloaded during the installation process or you can download a GGUF model file from HuggingFace.co and place it in the model folder.

How to run the server

  • Run npm install
  • Run npm run download:q8 or npm run download:q3 (for systems with limited RAM)
  • Run npm run start
  • Browse to http://localhost

How to run the server using a different port

  • Run npm run start 8080
  • Browse to http://localhost:8080

How to run the server in the background

  • Run npm install -g forever
  • Run npm run forever
  • Browse to http://localhost
  • To stop the server, run npm run stop

Forcing the light and dark modes

  • Browse to http://localhost/?lightmode
  • Browse to http://localhost/?darkmode

System prompt

The system prompt is defined in the strings.js file.

Disclaimer

You are legally responsible for any damage that you could cause with this software.