Description

Allows to convert gguf-ed LMMs to llamafile and upload them to public HF repo You can do that from your local machine and from GitHub Actions

Status

cp .env.example .env

docker compose up -d olmo

docker cp OLMo-1.7-7B-hf.Q8_0.gguf llfiler-olmo-1:/app/

docker exec -it llfiler-olmo-1 /bin/bash

llamafile-0.8.6/bin/llamafile-convert OLMo-1.7-7B-hf.Q8_0.gguf

huggingface-cli upload "$HF_REPO" "$HF_REPO_FILE"

Copy .github/workflows/main.yml workflow to your repo
Add secret HF_TOKEN to your repo secrets
Input HF_REPO, HF_REPO_FILE, REMOTE_GGUF_MODEL, LLAMAFILE_RELEASE on workflow start

cp Olmo.Modelfile.example Olmo.Modelfile

Create ollama directory in root of the project (there'll be your secrets and models saved):

mkdir ollama

docker compose up -d ollama

docker cp OLMo-1.7-7B-hf.Q8_0.gguf llfiler-ollama-1:/

In your Olmo.Modelfile replace <path_to_model> with filename of gguf model
Create repo for model and add public key from ollama/id_ed25519.pub to your account
Create model with Ollama (by default tag will be latest):

ollama create <your_name>/<model_name>:optional_tag -f Modelfile

ollama push <your_name>/<model_name>:optional_tag