Group logs online evals #708

vrigal · 2024-06-27T15:28:37Z

Based on #660

Closes #697

This was more complex than I thought, as it is not possible to increment an existing table from the Python client.
I ran the eval.py script for every evaluation task of group N1O85rIASLmCwKfUpCvTlw. Results are displayed on https://wandb.ai/teklia/da-en/runs/5ljd4a4n

eu9ene · 2024-07-02T19:10:52Z

pipeline/eval/eval.py

+        group_logs_client.open(resume=True)
+
+        # Restore existing metrics data
+        data = list_existing_group_logs_metrics(group_logs_client.wandb)


We run many evals tasks at the same time. So there's a chance of race condition here, right? I'm thinking maybe we can pursue an approach with building an automatic report based on the metrics we already write to the runs.

This is indeed possible, and would mean that the second eval metric overrides the first one. However, the chances of this happening are small: listing takes ~1.5s and publishing takes ~2s (which is small compared to the runtime of eval tasks). Relying on runs would probably be a better approach here.

I agree chances are low but still, as we discussed, let's explore the approach with reporting not to republish the evals again.

eu9ene reviewed Jul 2, 2024

View reviewed changes

vrigal added 7 commits July 8, 2024 16:38

Upgrade group_logs metrics from online evaluation tasks

e42e942

Support incrementing group_logs metrics table

916112d

Use real run name

b30323e

Remove useless indent & fixes

a8da475

Nits

590308f

Support disabled publication through WANDB_PUBLICATION

600101a

Fix linting

e0cee4c

vrigal force-pushed the group-logs-online-evals branch from 82e7c53 to e0cee4c Compare July 8, 2024 14:38

Merge branch 'main' into group-logs-online-evals

8dae6c3

eu9ene approved these changes Aug 8, 2024

View reviewed changes

eu9ene merged commit d5b94fe into mozilla:main Aug 8, 2024
35 checks passed

vrigal mentioned this pull request Aug 30, 2024

Add group ID suffix to group_logs metrics published from online evaluation tasks #820

Merged

vrigal mentioned this pull request Oct 11, 2024

Models are missing in group logs #874

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Group logs online evals #708

Group logs online evals #708

vrigal commented Jun 27, 2024

eu9ene Jul 2, 2024

vrigal Jul 3, 2024

eu9ene Jul 8, 2024

Group logs online evals #708

Group logs online evals #708

Conversation

vrigal commented Jun 27, 2024

eu9ene Jul 2, 2024

Choose a reason for hiding this comment

vrigal Jul 3, 2024

Choose a reason for hiding this comment

eu9ene Jul 8, 2024

Choose a reason for hiding this comment