brianpetro/obsidian-smart-connections

Best embedding models?

Opened this issue · 2 comments

I have a lot of psychology and neuroscience content in my vault I like to use to help write articles and things of that nature. What embeddings model should I use? I'm worried about trying different ones and spending too many tokens on it.

In terms of online vs local, currently I'm using a laptop with an i7 and 16gb RAM but no GPU. Should I opt for something online or will the lack of GPU make a difference in the quality of embeddings? I'm not too worried about uploading my data. I just want whatever will work best. Only have $5 of OpenAI credits right now so trying not to go too overboard on the cost...

The default local model would be a good start to get a feel for things. The OpenAI models should perform better but it would make sense to get familiar with things using the BGE-micro-v2 🌴

Interesting. Maybe something else is going on then. Maybe I should clarify why I asked then. I tried getting it to query my notes by asking "based on my notes" but it doesn't seem to be pulling up much relevant stuff, or will even just reference only the existing note. I remember in V1 it would be able to do what I'm asking very well... I must have something set up wrong here.