mostlygeek/llama-swap

Support the v1/embedding endpoint

Closed this issue · 0 comments

llama.cpp supports the OpenAI v1/embeddings endpoint. This endpoint also specifies a model key in the POST'd JSON.

Should be trivial to swap in a llama.cpp configured to provide embeddings.