Add script to merge results of mteb-fr with those of mteb-original #79

imenelydiaker · 2024-02-19T21:42:44Z

Copy model folder with all tasks from mteb-fr when model ha snot been evaluated in mteb
Copy task file from mteb-fr to existing model folder in mteb
Add "fr" results from mteb-fr to existing model evaluation on a task in mteb.
[NOT] Replace existing "fr" results. (will not do because revision number is different)
Handle model name case (e.g. LASER2 in mteb == laser2 in mteb-fr)
Better handling of mteb/results repo (bash script to install git lfs and clone mteb/results repository from HF)

…bscripts into 6-put-results-on-hf

MathieuCiancone · 2024-02-26T13:48:48Z

script_mteb_french/src/UniversalSentenceEncoderEmbeddingFunction.py

@@ -34,7 +34,8 @@ def model_name(self):
        return self._model_name

    def encode_documents(self, input: Documents) -> Embeddings:
-        return self.model(input).numpy().tolist()
+        truncated_documents = [" ".join(x.split(' ')[:self.max_token_length]) for x in input]


Why splitting on words instead of tokens ?

AbstractEmbeddingFunction already has a truncation method implemented, which is called before the encode_documents method so no need to implement truncation here :)

I think @wissam-sib added this because we couldn't run MUSE-large. But it was on another PR and it was already merged into main I think (I rebased this branch). Should I remove this ?

MathieuCiancone · 2024-02-26T13:49:52Z

script_mteb_french/upload_results_to_hf.py

+    'devtest'
+]
+
+def split_model_name(model_name: str):


more elegant to use os.path.basename(mypath) to get the last folder

Add script to copy mteb-fr results to mteb-orig

3c41d44

imenelydiaker requested a review from MathieuCiancone February 19, 2024 21:42

imenelydiaker linked an issue Feb 19, 2024 that may be closed by this pull request

Put Results on HF #6

Open

imenelydiaker added 8 commits February 20, 2024 13:39

refactoring

4b3b54e

Add script to copy mteb-fr results to mteb-orig

d04ed94

refactoring

d4755e8

Merge branch '6-put-results-on-hf' of https://github.com/Lyon-NLP/mte…

0b08ba2

…bscripts into 6-put-results-on-hf

Add script to copy mteb-fr results to mteb-orig

e06bc36

refactoring

a5d6d5c

Add script to copy mteb-fr results to mteb-orig

30f9fb6

Merge branch '6-put-results-on-hf' of https://github.com/Lyon-NLP/mte…

c62bb22

…bscripts into 6-put-results-on-hf

MathieuCiancone approved these changes Feb 26, 2024

View reviewed changes

MathieuCiancone self-requested a review February 26, 2024 13:53

Merge branch 'main' into 6-put-results-on-hf

1c3767b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add script to merge results of mteb-fr with those of mteb-original #79

Add script to merge results of mteb-fr with those of mteb-original #79

imenelydiaker commented Feb 19, 2024 •

edited

Loading

MathieuCiancone Feb 26, 2024

imenelydiaker Feb 26, 2024 •

edited

Loading

MathieuCiancone Feb 26, 2024

Add script to merge results of mteb-fr with those of mteb-original #79

Are you sure you want to change the base?

Add script to merge results of mteb-fr with those of mteb-original #79

Conversation

imenelydiaker commented Feb 19, 2024 • edited Loading

MathieuCiancone Feb 26, 2024

Choose a reason for hiding this comment

imenelydiaker Feb 26, 2024 • edited Loading

Choose a reason for hiding this comment

MathieuCiancone Feb 26, 2024

Choose a reason for hiding this comment

imenelydiaker commented Feb 19, 2024 •

edited

Loading

imenelydiaker Feb 26, 2024 •

edited

Loading