Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dev checkpt load #63

Merged
merged 8 commits into from
Mar 18, 2021
Merged

Dev checkpt load #63

merged 8 commits into from
Mar 18, 2021

Conversation

ASpeiser
Copy link
Collaborator

No description provided.

@Haydnspass Haydnspass linked an issue Mar 1, 2021 that may be closed by this pull request
@Haydnspass
Copy link
Collaborator

@ASpeiser Have you checked what happens with tensorboard?

@ASpeiser
Copy link
Collaborator Author

ASpeiser commented Mar 2, 2021

I'm having difficulties generally and often need to restart TB for it to update the data. But when I tried chekpointing it did work fine so I don't think it makes it any worse.

if debug:
experiment_path.mkdir(exist_ok=True)
else:
experiment_path.mkdir(exist_ok=False)

model_out = experiment_path / Path('model.pt')
ckpt_path = model_out.parent / Path('ckpt.pt')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ckpt_path should not be hard coded to ckpt.pt relative to the model.
Ckpt_path should be inferred from param.InOut.checkpoint_init (will probably require a bit of logic here).

@pep8speaks
Copy link

pep8speaks commented Mar 12, 2021

Hello @ASpeiser! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

Line 104:1: W293 blank line contains whitespace
Line 182:15: F541 f-string is missing placeholders

Comment last updated at 2021-03-18 20:45:25 UTC

@Haydnspass Haydnspass merged commit 68418d7 into master Mar 18, 2021
@Haydnspass Haydnspass deleted the dev_checkpt_load branch March 18, 2021 20:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Implement checkpoint loading in codebase
3 participants