Load ckpt_path when given to test/validate/predict #8347

SeanNaren · 2021-07-09T08:59:54Z

🚀 Feature

When the ckpt_path is passed to the test/validation/predict functions of the Trainer, they load the weights even if a model is provided.

Motivation

I noticed that one of our DeepSpeed test was incorrect (see here). resume_from_checkpoint does not re-load the weights for test/validate/predict, which is probably the right thing to do, however when modified to pass ckpt_path to the test function I noticed the weights are not loaded, which is default behaviour.

As described by @carmocca I suggested we change the behaviour as such:

BEFORE

trainer.test(model, ckpt_path=None) # use provided model
trainer.test(model, ckpt_path='best') # use provided model, ignore ckpt_path
trainer.test(model, ckpt_path='my_path') # use provided model, ignore ckpt_path

trainer.fit(model)
# then
trainer.test(ckpt_path=None) # use latest model
trainer.test(ckpt_path='my_path') # load path

AFTER

trainer.test(model, ckpt_path=None) # use provided model
trainer.test(model, ckpt_path='best') # load best model
trainer.test(model, ckpt_path='my_path') # load path

trainer.fit(model)
# then
trainer.test(ckpt_path=None) # load best model
trainer.test(ckpt_path='my_path') # load path

This imo makes the behaviour in line with what's expected + allows deepspeed to be used as an engine in the cases where inference cannot happen without the Trainer (when there is sharding orchestration etc).

The text was updated successfully, but these errors were encountered:

tchaton · 2021-07-09T11:26:10Z

Sounds good to me !

SeanNaren · 2021-07-12T09:16:22Z

Behaviour now is for test, validate and predict:

trainer.test(model, ckpt_path=None) # use provided model
trainer.test(model, ckpt_path='best') # load best model
trainer.test(model, ckpt_path='my_path') # load path

trainer.fit(model)
# then
trainer.test(ckpt_path=None) # load best model
trainer.test(ckpt_path='my_path') # load path

SeanNaren added feature Is an improvement or enhancement help wanted Open to be worked on labels Jul 9, 2021

SeanNaren self-assigned this Jul 9, 2021

SeanNaren mentioned this issue Jul 9, 2021

Load ckpt path when model provided in validate/test/predict #8352

Merged

12 tasks

SeanNaren added this to the v1.5 milestone Jul 12, 2021

tchaton closed this as completed in #8352 Jul 28, 2021

SeanNaren mentioned this issue Jul 29, 2021

[Fix] Add delay property for checkpointing, refactor loading checkpoint (DeepSpeed Checkpointing Fix 1/n) #8627

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load ckpt_path when given to test/validate/predict #8347

Load ckpt_path when given to test/validate/predict #8347

SeanNaren commented Jul 9, 2021 •

edited

Loading

tchaton commented Jul 9, 2021

SeanNaren commented Jul 12, 2021 •

edited

Loading

Load ckpt_path when given to test/validate/predict #8347

Load ckpt_path when given to test/validate/predict #8347

Comments

SeanNaren commented Jul 9, 2021 • edited Loading

🚀 Feature

Motivation

BEFORE

AFTER

tchaton commented Jul 9, 2021

SeanNaren commented Jul 12, 2021 • edited Loading

SeanNaren commented Jul 9, 2021 •

edited

Loading

SeanNaren commented Jul 12, 2021 •

edited

Loading