run/repro: add option to not remove outputs before reproduction #1214

efiop · 2018-10-12T01:24:56Z

https://discordapp.com/channels/485586884165107732/485596304961962003/540232443769258014
https://discordapp.com/channels/485586884165107732/485586884165107734/547430257821483008
https://discordapp.com/channels/485586884165107732/485596304961962003/557823591379369994

ghost · 2018-11-20T03:34:22Z

What about corrupting the cache, @efiop ? Is this for reflink only?

efiop · 2018-11-20T08:13:04Z

@MrOutis Great point! We should dvc unprotect the outputs when this option is specified, so that user has safe copies(or reflinks).

AlJohri · 2019-01-30T18:54:37Z

Adding my +1 here. This would be helpful for doing a warm start during parameter search. Some of our stages are very long running and starting from the previous results would help it complete much faster.

guysmoilov · 2019-02-19T15:28:13Z

@AlJohri WDYT of this (hacky) possible solution?
Add a stage before the training stage, that takes the latest cached checkpoint from the output of the training stage, moves or copies it to a different path, and then that resume checkpoint is a cached dependency of the training stage?

This should allow each git commit to describe an accurate snapshot of a training run - the checkpoint you start from, and the checkpoint that resulted.

efiop · 2019-03-20T10:22:46Z

Probably should look something like:

$ dvc run --outs-no-remove ckpt ...

which would add something like

remove: False

to the dvc file for ckpt output, so that dvc repro knows about it.

pared · 2019-03-25T10:57:54Z

Should be closed by #1759

pared · 2019-03-25T11:00:25Z

@AlJohri there is new option in run for this case, Ill add it to docs soon, but you can find it in closing issue name.

piojanu · 2019-03-26T08:34:59Z

What should I do if from time to time I want to start fresh? Use dvc remove on .dvc file or rm the output?
EDIT: dvc remove doesn't remove persistent outputs (though some progress bars appear the first time it is called, but if executed again the command does nothing). Is it expected behaviour?

pared · 2019-03-26T09:41:23Z

@piojanu actually what this option is doing is just setting persist flag inside stage file. So if you want to edit behaviour after some time, just change this flag in .dvc file. As to dvc remove, I am looking into that.

pared · 2019-03-26T09:56:00Z

@piojanu thank you for pointing that out! I created issue for that one. #1784

pared · 2019-03-26T11:49:00Z

@piojanu It is also worth noting, that this behaviour is only possible, if user actually appends data to output. If, for example, our run command somehow destroys file, persist flag will not be able to prevent that. For example, reproducing given command:
dvc run --outs-persist something "echo something > something"
will always result in overwriting the file, beause that is how > works. However, if we use ">>",
then we can utilize persist functionality.

piojanu · 2019-03-26T17:12:16Z

Yea, I understand that :) But maybe it is worth including it in docs. Also, should dvc remove (when fixed) work or it is not recommended path and it can break something?

pared · 2019-03-26T17:27:18Z

@piojanu it should work, even now, since patch entered master, it was my mistake, didn't test whether remove works for persistent outputs.

guysmoilov · 2019-04-02T12:29:51Z

Very good guys! This is a very important feature, I already started using it to great effect instead of my above workaround: https://dagshub.com/Guy/fairseq/src/dvc/dvc-example/train.dvc

I vote for having a short flag for it, I think it will be commonly used for anyone that wants to train with checkpoints, or for scenarios like this: https://discuss.dvc.org/t/version-checking-with-dvc/168/5

Maybe -p -P ?

efiop · 2019-04-02T14:56:17Z

@guysmoilov Thanks a lot for your input on discuss forum! 🙂

Mind creating a feature request for it? 🙂

shcheklein · 2019-04-11T00:01:03Z

@AlJohri quick question, but the "warm start" and "reusing previous results" - are you referring to reusing the previous model (trained with a different set of params) to continue training it with a new set? Or is it something different in your case? We would really appreciate your input here! Thanks!

AlJohri · 2019-04-11T03:39:40Z

hey there, sorry I'm actually not 100% sure what I meant before. it was related to more efficient hyperparameter searching but I can't remember the particular use case. I had a job where the parameter search took a very long time but I don't recall how I thought a warm start might solve that in this scenario

"warm start machine learning" has a lot of hits on google so perhaps you can read more about the more general use case

ghost · 2019-04-22T22:56:28Z

Hello, @guysmoilov , @AlJohri, @pared , @efiop !
I was thinking about the usefulness of having a persist flag and the reason behind it, after reading the whole convo the use case is still not clear for me.

The idea to resume a process from a certain point could be handled by the script itself (writing to a temporary file not specified on the run command as output and then, when the process has finished, move the temporary file to the wanted location).

It would be great if you can come up with an example of how you are using this feature today 😃

efiop added the enhancement Enhances DVC label Oct 12, 2018

efiop added this to the Queue milestone Oct 12, 2018

efiop self-assigned this Oct 12, 2018

efiop removed their assignment Nov 1, 2018

efiop mentioned this issue Nov 23, 2018

dvc: consider introducing build matrix #1018

Closed

efiop mentioned this issue Dec 4, 2018

Stating dependencies between scripts/modules #1401

Closed

prihoda mentioned this issue Jan 2, 2019

Reconfigurable pipelines #1462

Closed

efiop added the p1-important Important, aka current backlog of things to do label Mar 20, 2019

efiop assigned pared Mar 20, 2019

This was referenced Mar 21, 2019

run: add --outs-persist and --outs-persist-no-cache options #1759

Merged

RFC data cloud: declutter test #1761

Merged

efiop added the complexity/c3 (small fix) label Mar 21, 2019

pared closed this as completed Mar 25, 2019

pared mentioned this issue Mar 25, 2019

run: add --outs-persist and --outs-persist-no-cache description iterative/dvc.org#217

Closed

pared mentioned this issue Mar 26, 2019

dvc remove does not remove persistent outputs #1784

Closed

guysmoilov mentioned this issue Apr 14, 2019

Short flag for persisted outputs #1884

Closed

ghost mentioned this issue Jul 30, 2019

dvc: deprecate persistent outputs #2340

Closed

jorgeorpinel mentioned this issue Dec 17, 2020

guide: doc persist field in dvc.yaml, maybe improve explanation in run --outs-persist iterative/dvc.org#2027

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

run/repro: add option to not remove outputs before reproduction #1214

run/repro: add option to not remove outputs before reproduction #1214

efiop commented Oct 12, 2018 •

edited by pared

Loading

ghost commented Nov 20, 2018

efiop commented Nov 20, 2018

AlJohri commented Jan 30, 2019

guysmoilov commented Feb 19, 2019

efiop commented Mar 20, 2019

pared commented Mar 25, 2019 •

edited

Loading

pared commented Mar 25, 2019

piojanu commented Mar 26, 2019 •

edited

Loading

pared commented Mar 26, 2019

pared commented Mar 26, 2019 •

edited

Loading

pared commented Mar 26, 2019

piojanu commented Mar 26, 2019

pared commented Mar 26, 2019

guysmoilov commented Apr 2, 2019

efiop commented Apr 2, 2019

shcheklein commented Apr 11, 2019

AlJohri commented Apr 11, 2019

ghost commented Apr 22, 2019

run/repro: add option to not remove outputs before reproduction #1214

run/repro: add option to not remove outputs before reproduction #1214

Comments

efiop commented Oct 12, 2018 • edited by pared Loading

ghost commented Nov 20, 2018

efiop commented Nov 20, 2018

AlJohri commented Jan 30, 2019

guysmoilov commented Feb 19, 2019

efiop commented Mar 20, 2019

pared commented Mar 25, 2019 • edited Loading

pared commented Mar 25, 2019

piojanu commented Mar 26, 2019 • edited Loading

pared commented Mar 26, 2019

pared commented Mar 26, 2019 • edited Loading

pared commented Mar 26, 2019

piojanu commented Mar 26, 2019

pared commented Mar 26, 2019

guysmoilov commented Apr 2, 2019

efiop commented Apr 2, 2019

shcheklein commented Apr 11, 2019

AlJohri commented Apr 11, 2019

ghost commented Apr 22, 2019

efiop commented Oct 12, 2018 •

edited by pared

Loading

pared commented Mar 25, 2019 •

edited

Loading

piojanu commented Mar 26, 2019 •

edited

Loading

pared commented Mar 26, 2019 •

edited

Loading