I’m trying to test more embedding models and I’m wondering what does this community use…

I know that it “may vary depending on use case”, so in that case please share model and related use case.

Currently I’m using mostly bge-large-v1.5 or instructor-xl…

(intrested in both bi encoder and cross encoder)

Thanks im advance!!!

  • r_s_s_i_u@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    As you said, it depends but my to go has been Sentence transformersSBert due to its effectiveness. But if you have access to sufficient compute or it’s for offline use case (i.e get embeddings once and just keep refusing them), embeddings from LLMs works well on most use cases