/ollami

Ollami is a frontend for Ollama, allowing user to quickly chat with their local model.

Primary LanguageSvelteMIT LicenseMIT

Ollami 🖐️

"Oh l'ami" - French for "Hi friend!"

Ollami

Ollami is a frontend application designed to interact with local Ollama models for text generation, reasoning, chat and more.

Why Use Ollami? 💡

  • Save time and resources by running your favorite models directly on your machine.
  • Quickly access and interact with a wide range of models, available directly in the interface.
  • Seamlessly test and evaluate local model performance in a real-world application context.

How to install Ollama 🤝

Get up and running with large language models locally: Ollama Website.

macOS 🍎

Download Ollama for macOS

Windows 🪟 (Preview)

Download Ollama for Windows

Linux 🐧

curl -fsSL https://ollama.com/install.sh | sh

Install your first model (CLI) ⚡

Open your favorite terminal, and run the following commands:

ollama run llama3:latest

That's it! Your first model is up and running!

Install Ollami 🔧

With Docker 🐳

Note

This guide assumes that you have Docker Desktop installed locally. If not please install Docker

Clone the repository with git to your local machine development folder using the following command:

git clone https://github.com/aetaix/ollami.git ollami
cd ollami

Make sure Docker Desktop is open, then run the following command:

docker compose up -d

Go to localhost:5050 to access Ollami!

With NPM (Developpers only) 🧰

Note

This guide assumes that you have installed the latest version of Node.js and npm. If not : Download Node.js (Node.js + npm)

Clone the repository to your local machine development folder using the following command:

git clone https://github.com/aetaix/ollami.git ollami
cd ollami

Install packages and start the app:

npm install

Launch the app:

npm run dev

Tip

No need to add .env variable, the app will use the default Ollama server locally started while using the ollama run command. By default the server is running on http://127.0.0.1:11434

Explore Available Models

Ollami have a built in library of available models that can be downloaded and run locally.

Ollami

Of course, take the time to explore the different models available and choose the one that best suits your needs.

Here are some example models that can be downloaded:

Model Parameters Size Download
Llama 3 7B 3.8GB ollama run llama3
Mistral 7B 4.1GB ollama run mistral
Phi-3 3.8B 2.3GB ollama run phi3
Neural Chat 7B 4.1GB ollama run neural-chat
Starling 7B 4.1GB ollama run starling-lm
Code Llama 7B 3.8GB ollama run codellama
Llama 2 Uncensored 7B 3.8GB ollama run llama2-uncensored
Llama 2 13B 13B 7.3GB ollama run llama2:13b
Llama 2 70B 70B 39GB ollama run llama2:70b
Orca Mini 3B 1.9GB ollama run orca-mini
Vicuna 7B 3.8GB ollama run vicuna
LLaVA 7B 4.5GB ollama run llava
Gemma 2B 1.4GB ollama run gemma:2b
Gemma 7B 4.8GB ollama run gemma:7b

Tip

You should have at least 8 GB of RAM available to run the 7B models, 16 GB to run the 13B models, and 32 GB to run the 33B models.