Skip to content

Commit

Permalink
Automated leaderboard update
Browse files Browse the repository at this point in the history
  • Loading branch information
actions-user committed Dec 27, 2024
1 parent 2898342 commit 474386e
Show file tree
Hide file tree
Showing 2 changed files with 64,359 additions and 64,358 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@ gemma-2-9b-it-WPO-HB,76.72506842726064,77.82503168985093,2285,https://huggingfac
SelfMoA + gemma-2-9b-it-SimPO,75.04950944068965,71.9958856144492,1930,https://github.com/wenzhe-li/Self-MoA/,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/SelfMoA_gemma-2-9b-it-SimPO/model_outputs.json,community
Blendax.AI-gm-l3-v35,73.37270365010379,73.41035740244067,2186,https://www.blendax.ai/post/blendaxai-gm-l3-v35,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/blendaxai-gm-l3-v35/model_outputs.json,community
gemma-2-9b-it-SimPO,72.3508446939842,65.86422561532919,1833,https://huggingface.co/princeton-nlp/gemma-2-9b-it-SimPO,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/gemma-2-9b-it-SimPO/model_outputs.json,community
TOA,72.154700064664,69.0451116814115,1999,https://github.com/oceanypt/TOA,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/TOA/model_outputs.json,community
FuseChat-Gemma-2-9B-Instruct,70.18106263911686,70.49713534560247,2155,https://huggingface.co/FuseAI/FuseChat-Gemma-2-9B-Instruct,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/FuseChat-Gemma-2-9B-Instruct/model_outputs.json,community
OpenPipe MoA GPT-4 Turbo,68.37866250336802,63.15493451236265,1856,https://openpipe.ai/blog/mixture-of-agents,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/openpipe-moa-gpt-4-turbo-v1/model_outputs.json,community
gemma-2-9b-it-DPO,67.6620382198043,65.35922380122982,2016,https://huggingface.co/princeton-nlp/gemma-2-9b-it-DPO,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/gemma-2-9b-it-DPO/model_outputs.json,community
Expand Down
Loading

0 comments on commit 474386e

Please sign in to comment.