c0sogi/LLMChat

text-embedding-ada-002 for embeddings

Closed this issue · 2 comments

openai says to use "text-embedding-ada-002" for all text embeddings. It's very cheap. gpt3.5/4 are 1000x more expensive tokenizer_model: str = "text-embedding-ada-002"

c0sogi commented

no. We always use the text-embedding-ada-002 model for embedding. See the OpenAIEembeddings class. You can see that model: str = "text-embedding-ada-002".

        embeddings: Embeddings = OpenAIEmbeddings(
            client=openai.Embedding,
            openai_api_key=openai_api_key,
        )

Okay. Don't want to use ada-002 as tokenizer?