Non-openAI embedding models for better performance #604
dougkeiller
started this conversation in
Ideas
Replies: 1 comment
-
A local model helps narrow the scope of training to your authoritative notes. If working on research or proprietary notes, I wouldn't want to risk bringing in outside data without vetting or having data leaking. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
While many new local embedding models have been added, on my laptop I find local embeddings slow. I suggest adding more cloud embedding models in addition to openAI.
This ranking of embedding model performance on Hugging Face could help select the next service to add. We should look at th Retrieval column as the key metric, per https://www.linkedin.com/pulse/enhance-your-rag-llm-performance-discovering-perfect-embedding-eezfc/
https://huggingface.co/spaces/mteb/leaderboard
Screenshot of the embedding models ranked by retrieval:
Beta Was this translation helpful? Give feedback.
All reactions