Benchmark GitHub Actions workflow #31163

ydshieh · 2024-05-31T12:22:46Z

What does this PR do?

Benchmark GitHub Actions workflow.

(Not done yet)

Maybe change decoding steps from 128 to 1024
Run against Llama2 too

We are mostly only interested in summary.json file.

However, I am uploading the whole directory of an experiment in each run. The reason is to keep the benchmark config files available if we ever need to access this information. However, this seems a bit too much (too many files for which most of time they remain the same across different dates)

HuggingFaceDocBuilderDev · 2024-05-31T12:44:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

Looks great already!

ArthurZucker · 2024-05-31T12:45:39Z

.github/workflows/benchmark.yml

+        working-directory: /transformers
+        run: |
+          python3 -m pip install optimum-benchmark>=0.2.0
+          HF_TOKEN=${{ secrets.TRANSFORMERS_HUB_BOT_HF_TOKEN }} python3 benchmark/benchmark.py --repo_id hf-internal-testing/benchmark_results --path_in_repo $(date +'%Y-%m-%d') --config-dir benchmark/config --config-name generation --commit=${{ github.sha }} backend.model=google/gemma-2b backend.cache_implementation=null,static backend.torch_compile=false,true --multirun


can we post it somewhere / run a python script to compare the results from previous commit / from average of previous commits?

I am considering doing this with a Space app which will fetch the results from this dataset and show some graph.
But if you think we should also do such comparison within the same workflow run, I can add something.
(so far the dataset is kinda empty, so maybe better I add that part one day in the next week?)

ok sounds good

ArthurZucker · 2024-05-31T12:46:13Z

.github/workflows/benchmark.yml

+        working-directory: /transformers
+        run: |
+          python3 -m pip install optimum-benchmark>=0.2.0
+          HF_TOKEN=${{ secrets.TRANSFORMERS_HUB_BOT_HF_TOKEN }} python3 benchmark/benchmark.py --repo_id hf-internal-testing/benchmark_results --path_in_repo $(date +'%Y-%m-%d') --config-dir benchmark/config --config-name generation --commit=${{ github.sha }} backend.model=google/gemma-2b backend.cache_implementation=null,static backend.torch_compile=false,true --multirun


let's be super careful and have a finegrained token for that!

I will regenerate new tokens for this and also for .github/workflows/check_tiny_models.yml which also uses the same token.

Update to a new secret TRANSFORMERS_BENCHMARK_TOKEN (finegrained token)

ArthurZucker

Almost good to go, let's run it when important model slow is triggered as well

ArthurZucker · 2024-06-03T12:09:17Z

.github/workflows/benchmark.yml

+on:
+  schedule:
+    - cron: "17 2 * * *"


IMO we should run it when pushed on main for important models!

Yeah, I agree. I would instead push the results to another dataset (one for daily CI, one for push to main event)

update to add the following new block:

for .github/workflows/push-important-models.yml being triggered

upload to hf-internal-testing/benchmark_results_merge_event

- name: Benchmark (merged to main event) if: github.event_name == 'push' && github.ref_name == 'main' working-directory: /transformers run: | python3 -m pip install optimum-benchmark>=0.2.0 HF_TOKEN=${{ secrets.TRANSFORMERS_BENCHMARK_TOKEN }} python3 benchmark/benchmark.py --repo_id hf-internal-testing/benchmark_results_merge_event --path_in_repo $(date +'%Y-%m-%d') --config-dir benchmark/config --config-name generation --commit=${{ github.sha }} backend.model=google/gemma-2b backend.cache_implementation=null,static backend.torch_compile=false,true --multirun

ArthurZucker

Thanks! 🚀

ydshieh added 14 commits May 30, 2024 16:12

benchmark workflow

058cecf

benchmark workflow

55e826e

benchmark workflow

edfc006

benchmark workflow

ef197e6

build

8c2a653

build

100e0dc

build

231aed2

build

57bf799

build

01f8532

build

1c76378

build

e3ed8d9

build

0918d50

build

4ef4539

build

65bce8f

ydshieh requested a review from ArthurZucker May 31, 2024 12:42

ArthurZucker reviewed May 31, 2024

View reviewed changes

build

4da667e

ydshieh requested a review from ArthurZucker May 31, 2024 13:36

ArthurZucker reviewed Jun 3, 2024

View reviewed changes

ydshieh added 3 commits June 3, 2024 15:35

build

2ba28be

build

9965a37

build

a169bc8

ydshieh requested a review from ArthurZucker June 3, 2024 13:55

ArthurZucker approved these changes Jun 5, 2024

View reviewed changes

ydshieh merged commit 03ea160 into main Jun 5, 2024
8 checks passed

ydshieh deleted the benchmark_on_github branch June 5, 2024 08:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark GitHub Actions workflow #31163

Benchmark GitHub Actions workflow #31163

ydshieh commented May 31, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented May 31, 2024

ArthurZucker left a comment

ArthurZucker May 31, 2024

ydshieh May 31, 2024

ArthurZucker Jun 3, 2024

ArthurZucker May 31, 2024

ydshieh May 31, 2024

ydshieh May 31, 2024

ArthurZucker left a comment

ArthurZucker Jun 3, 2024

ydshieh Jun 3, 2024

ydshieh Jun 3, 2024

ArthurZucker left a comment

Benchmark GitHub Actions workflow #31163

Benchmark GitHub Actions workflow #31163

Conversation

ydshieh commented May 31, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented May 31, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

ydshieh commented May 31, 2024 •

edited

Loading