add exception handling and retry on loading train batch #194

AnnaKwa · 2020-03-19T23:58:41Z

This allows the model training to proceed by skipping a batch if it runs into data quality or read issues. It will also save the actual number of batches used in the final config copy.

oliverwm1

Looks good, thanks for adding the logging statements!

I found the exception handling in _create_training_batch_with_retries a bit hard to follow. What do you think about just doing the retries no matter what the exact ValueError is?

fv3net/regression/sklearn/train.py

fv3net/regression/dataset_handler.py

AnnaKwa · 2020-03-20T23:10:48Z

Thanks for the review @oliverwm1 - I left a comment about the retry exception, let me know what you think. Ready for re-review

oliverwm1

LGTM!

Pre commit config, flake8, black

Anna Kwa added 7 commits March 19, 2020 22:58

add exception handling and retry on loading train batch

8461835

adjust number of batches in config record if bad batch skipped

243e43a

fix errors

bdef986

fix errors

a5b3c29

rm unused import

921fec4

add backoff dependency to setup.py

963fbf5

rm backoff from setup.py and add envoriment.yml

431dd77

oliverwm1 requested changes Mar 20, 2020

View reviewed changes

fv3net/regression/sklearn/train.py Outdated Show resolved Hide resolved

fv3net/regression/dataset_handler.py Show resolved Hide resolved

Anna Kwa added 2 commits March 20, 2020 23:08

use custom error for retries

090ad3d

fix typos and lint

3890ecb

AnnaKwa requested a review from oliverwm1 March 20, 2020 23:10

oliverwm1 approved these changes Mar 23, 2020

View reviewed changes

AnnaKwa merged commit db72139 into master Mar 23, 2020

AnnaKwa deleted the fix/ml_training_read_errors branch March 23, 2020 18:18

spencerkclark pushed a commit that referenced this pull request May 7, 2021

Merge pull request #194 from TomAugspurger/pre-commit-config

67b85fd

Pre commit config, flake8, black

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add exception handling and retry on loading train batch #194

add exception handling and retry on loading train batch #194

AnnaKwa commented Mar 19, 2020

oliverwm1 left a comment

AnnaKwa commented Mar 20, 2020

oliverwm1 left a comment

add exception handling and retry on loading train batch #194

add exception handling and retry on loading train batch #194

Conversation

AnnaKwa commented Mar 19, 2020

oliverwm1 left a comment

Choose a reason for hiding this comment

AnnaKwa commented Mar 20, 2020

oliverwm1 left a comment

Choose a reason for hiding this comment