Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to reproduces openai v3 embedding mteb benchmark ? #6

Open
akashAD98 opened this issue Jan 28, 2024 · 5 comments
Open

how to reproduces openai v3 embedding mteb benchmark ? #6

akashAD98 opened this issue Jan 28, 2024 · 5 comments

Comments

@akashAD98
Copy link

can you please give us some guide for this?

@akashAD98 akashAD98 changed the title how to reproduces openai v3 embedding ? how to reproduces openai v3 embedding mteb benchmark ? Jan 28, 2024
@Muennighoff
Copy link
Collaborator

Maybe the OAI team is interested in providing a guide for reproducing / their script; I didn't run the OAI v3 benchmarking unfortunately cc @Rabrg

@akashAD98
Copy link
Author

@Muennighoff bcz i wanted to test the v3-large model with reducing the dimensinality & its impact on mteb

image

@Muennighoff
Copy link
Collaborator

Note that the v3 model is on the leaderboard both with 3072 & with 256 dimensions https://huggingface.co/spaces/mteb/leaderboard

We will also add its other dimensionalities if the OAI team wants to

@akashAD98
Copy link
Author

@Muennighoff thanks .looking for OpenAI v3 embedding notebook reproduce script

@akashAD98
Copy link
Author

Note that the v3 model is on the leaderboard both with 3072 & with 256 dimensions https://huggingface.co/spaces/mteb/leaderboard

We will also add its other dimensionalities if the OAI team wants to

I'm currently experiencing difficulties in reproducing results with an older version of the OpenAI model. I was wondering if switching to a newer model version could potentially solve these issues. Unfortunately, I haven't had the chance to run the OAI v3 benchmarking script myself. Given this, I am considering trying out the latest model version, but I am uncertain if this would yield successful results

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants