Compute average performance of models over all tasks #71

imenelydiaker · 2024-02-08T14:51:50Z

All in a CSV file:

Average performance over each task
Overall average performance
Models ranking

Note: model_short column is the model name without the prefix

…//github.com/Lyon-NLP/mtebscripts into 69-get-models-performance-on-other-languages

MathieuCiancone · 2024-02-19T17:41:23Z

script_mteb_french/results_analysis/compute_avergae_performance.py

@@ -0,0 +1,73 @@
+import os


modify file name => compute_average_performance instead of avergae

MathieuCiancone · 2024-02-19T17:42:33Z

script_mteb_french/results_analysis/compute_avergae_performance.py

+    format="%(asctime)s - %(name)s - %(levelname)s - %(message)s",
+)
+
+


more elegant way : use os.path.basename(model_name) will return the last part of a path (the model name)

MathieuCiancone · 2024-02-19T18:11:37Z

script_mteb_french/results_analysis/compute_avergae_performance.py

+    new_df.insert(0, "overall_avg", overall_avg)
+
+    new_df.reset_index(inplace=True)
+    models_short_name = new_df.model.apply(lambda x: models_name_to_index[x])


I think you can simplify code and avoid storing the index mapping, then computing on a new dataframe, then reseting the index. You can compute directly averages on level 0, and change the columns name by prefixing with "avg_"

# Compute average on each task type averaged_df = result_df.T.groupby(level=0).mean().T # change column names by prefixing with "avg_" averaged_df.columns = [f"avg_{col.lower()}" for col in averaged_df.columns] # compute overall average averaged_df.insert(0, "avg_overall", averaged_df.mean(axis=1)) # only keep model name in path averaged_df.index = averaged_df.index.map(os.path.basename)

…//github.com/Lyon-NLP/mtebscripts into 69-get-models-performance-on-other-languages

imenelydiaker added 4 commits February 8, 2024 13:56

get mteb results --incomplete

5d2bfe8

get mteb results --incomplete

473125a

Merge branch '69-get-models-performance-on-other-languages' of https:…

35cd2e1

…//github.com/Lyon-NLP/mtebscripts into 69-get-models-performance-on-other-languages

compute average performance of models

2e6f083

imenelydiaker requested review from wissam-sib, MathieuCiancone and schmarion February 8, 2024 14:51

imenelydiaker linked an issue Feb 8, 2024 that may be closed by this pull request

Get average performance over all tasks #69

Open

Average performance CSV clean

fd061cf

MathieuCiancone reviewed Feb 19, 2024

View reviewed changes

imenelydiaker added 2 commits February 20, 2024 13:10

performance of English vs French

a12e66e

Merge branch '69-get-models-performance-on-other-languages' of https:…

7018f1c

…//github.com/Lyon-NLP/mtebscripts into 69-get-models-performance-on-other-languages

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compute average performance of models over all tasks #71

Compute average performance of models over all tasks #71

imenelydiaker commented Feb 8, 2024 •

edited

Loading

MathieuCiancone Feb 19, 2024

MathieuCiancone Feb 19, 2024

MathieuCiancone Feb 19, 2024 •

edited

Loading

		format="%(asctime)s - %(name)s - %(levelname)s - %(message)s",
		)

Compute average performance of models over all tasks #71

Are you sure you want to change the base?

Compute average performance of models over all tasks #71

Conversation

imenelydiaker commented Feb 8, 2024 • edited Loading

MathieuCiancone Feb 19, 2024

Choose a reason for hiding this comment

MathieuCiancone Feb 19, 2024

Choose a reason for hiding this comment

MathieuCiancone Feb 19, 2024 • edited Loading

Choose a reason for hiding this comment

imenelydiaker commented Feb 8, 2024 •

edited

Loading

MathieuCiancone Feb 19, 2024 •

edited

Loading