Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What metrics are used to reproduce the evaluation in the leaderboards? #9

Open
zhimin-z opened this issue Nov 30, 2023 · 2 comments
Open

Comments

@zhimin-z
Copy link
Contributor

zhimin-z commented Nov 30, 2023

image
Can you list the metrics correspondence to each benchmark one by one? @matflores @wjessup @bradybd @DarthNerdus Are all of them accuracy?

@acebot712
Copy link
Contributor

The usual metrics corresponding to each dataset are used. However, it would be good to mention that, along with the fact if CoT was used or Zero Shot was used to update the table.

@zhimin-z
Copy link
Contributor Author

zhimin-z commented Dec 10, 2023

The usual metrics corresponding to each dataset are used. However, it would be good to mention that, along with the fact if CoT was used or Zero Shot was used to update the table.

Thanks for your quick reply. What are the usual metrics corresponding to each dataset? I cannot find any related information in this repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants