Evaluation results logging is inconsistent #1058

hokmund · 2021-05-29T21:24:46Z

Describe the Issue

Metrics calculated by evaluation hook are sometimes logged as train/{metric_name} and sometimes as val/{metric_name}.

More precisely, imagine that you have evaluation interval equal to 250 iterations and logging interval equal to 20 iterations. On the 250th iteration you will have your evaluation results logged as val/{metric_name}. After that, on the 500th iteration both train loss logging and evaluation will occur and you will have your evaluation results logged as train/{metric_name}.

It is extremely frustrating, especially if you use tensorboard logger that builds you two different charts for train/{metric_name} and val/{metric_name}.

Bug fix

This issue is caused by this line of code: https://github.com/open-mmlab/mmcv/blob/master/mmcv/runner/hooks/logger/base.py#L61

When training iteration is logged, time is in the buffer. Otherwise it's not.
This LOC comes from this PR that applied get_mode method everywhere despite the fact that previously it was only used in the text logger which logs the mode for the whole iteration rather than for each data point separately. Some of the issues caused by that PR were already fixed.

Before that pull request was merged, evaluation metrics were always logged with train tag. If such behavior is acceptable, I am willing to create a PR.

However, if you think that we need to always log evaluation results with val tag, it will require a lot of redesigning, because we will either need to create separate hook methods for evaluation or make EvalHook explicitly set val mode and flush the logger in the same manner as val(...) method of runner currently does.

The text was updated successfully, but these errors were encountered:

zhouzaida · 2021-06-10T09:24:08Z

Thanks for your callback. I think we need to always log evaluation results with val tag.

zhouzaida · 2021-06-10T09:46:48Z

Are you willing to create a PR?

zhouzaida · 2021-08-11T17:33:32Z

closed by #1252

zhouzaida mentioned this issue Jul 15, 2021

Iteration Plan v1.3.10 - July 2021 #1191

Closed

12 tasks

ZwwWayne added the Bug label Jul 22, 2021

clownrat6 mentioned this issue Jul 22, 2021

[Fix] Fix the conflict between train info and evaluation info when evaluation iter #1213

Closed

zhouzaida mentioned this issue Jul 26, 2021

Iteration Plan v1.3.11 - Aug 2021 #1216

Closed

18 tasks

zhouzaida mentioned this issue Aug 11, 2021

[Fix] Fix the bug that the training log and evaluating log are mixed #1252

Merged

zhouzaida closed this as completed Aug 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation results logging is inconsistent #1058

Evaluation results logging is inconsistent #1058

hokmund commented May 29, 2021 •

edited

Loading

zhouzaida commented Jun 10, 2021 •

edited

Loading

zhouzaida commented Jun 10, 2021

zhouzaida commented Aug 11, 2021

Evaluation results logging is inconsistent #1058

Evaluation results logging is inconsistent #1058

Comments

hokmund commented May 29, 2021 • edited Loading

zhouzaida commented Jun 10, 2021 • edited Loading

zhouzaida commented Jun 10, 2021

zhouzaida commented Aug 11, 2021

hokmund commented May 29, 2021 •

edited

Loading

zhouzaida commented Jun 10, 2021 •

edited

Loading