Support the v1/embedding endpoint

Question

Closed this issue 21 days ago · 0 comments

llama.cpp supports the OpenAI v1/embeddings endpoint. This endpoint also specifies a model key in the POST'd JSON.

Should be trivial to swap in a llama.cpp configured to provide embeddings.