-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1.0 (not only) testing - comments, finding, issues, etc #3777
Comments
The default is
I don't like this, because that specific stage might be generating lots of other stuffs than just
Good point. That's a bad UX.
Broken by 18e8f07
You mean, not supporting old
Oops.
The problem is what to show for bare pipeline show. One idea is to show all possible DAGs |
not a problem I believe, I think most people will understand that stage will be running (means generating outputs). Also in majority of cases there will be one - two outputs, one of the metrics - it's totally fine. On the other hand - it's very convenient - I don't care about some stage names, I care about my model.
yes, but it's a matter of defining rules (first check stage name, second output or something like this). I would also be surprised (and it's bad design) if stage and output that are not related to each other have the same name. |
@shcheklein thank you for the great summary! Also, I'd add that repro is too verbose. I do repro for two unchanged stages without lock:
All this information is overwhelming - it takes time to read and understand that basically nothing happened. What is actually important - two stages hit the run-cache and lock file was generates (this info is not presented directly). I'd expect something like:
|
Maybe a message like "Additional output generated: someplot.png, metrics.json" would be nice in that case?
Yes. As long as there is a clear hierarchy of rules. It should be fine. I would say look for a stage name first, then look for a filename. At least I would love |
* tests: unit: move stage tests into a separate dir * stage: fix --dry-run Part of #3777
The
Can you please elaborate on this? We are keeping backward compatibility, so what do you mean by deprecate?
We need to decide what should we do for
/cc @shcheklein |
could you please elaborate a bit? not sure I completely understood your point.
If we can detect new/old project - don't allow to run this if it's a new one? I guess what I want to simplify the logic eventually in terms of what do we accept into But would love to start removing stuff that does not make much sense from the new version perpespective.
we have a separate ticket(s) for changing semantics of the I would at say as a minimum for the DVC 1.0 release we can just keep the same semantics and accept exactly the same stuff the |
@shcheklein, I have extracted two of the issues from here. And, Closing this one... |
Some notes on the playground, how to reproduce, and what I'm testing for.
Repo to reproduce: here.
I'm running testing on a new version of the
example-get-started
that I built with this updated gen script.The repo itself (that script from above generates) can be downloaded here. Should be enough to reproduce issues below described below. And can be useful for everyone as an easy entry point to try DVC 1.0 on a bit more sophisticated project.
This list below might include things not related 100% to the latest changes we introduced in 1.0, but all of them are potentially relevant. E.g., we change pipelines interface and should try scenarios around.
dvc repro
behavior and UI/UXI think we should assume:dvc.yaml
prefix by default. Instead of running a stage like thisdvc repro dvc.yaml:prepare
, I should be able to run it like this (ifdvc.yaml
is found in the current dir? or if it is a single one in the project?)#3842
Repro excessive warning on the default target file name. Either make it info (in general I would say. default target: reduce loglevel to debug #3822WARNING
is a rare thing that we need to show), or don't show again. Instead make a message about the stage that we execute more meaningful (see below)Import stage reusing lock mechanism leads to ugly warnings on every repro(PR: repro: do not log when stage is locked and is import #3823):When I runlog: reduce log, make it readable/easier #3834dvc repro
I don't really see the stage name being reproduced - we can probably include it w/o within the "Running command" message. It can solve the problem of some WARNING (see above).When you change something and it hits cache it feels that we need to explain a bit on what's going on? show summary of changes to the workspace?(Partially fixed by log: reduce log, make it readable/easier #3834)dvc repro
options--dry
does not work at all.-p
,.--pipeline
- deprecate? It does not make sense moving forward to accept.dvc
files.Related to the previous one - we still accept
.dvc
files intodvc repro
- we should not be doing this anymore - semantics is not clear anymore. Deprecate or remove.repro: fix help message for default target #3831--help
review and fix it. e.g. we don't expectDvcfile
by default anymore.We probably need "ignore-run-cache" option?run/repro: rename "build cache" -> "run cache" #3788--ignore-build-cache
- name can be confusing now. Do we keep the same semantics at all? (Same question regarding thedvc run
.)Other commands
dvc pipeline show
- expectsDvcfile
, should now do the same asdvc repro
+ fix help.dvc pipeline show
output is broken from a lot of different perspectives - see this comment for suggestions and details.We broke the progress bar for the
get/import
. It is nowDownloading 0/1 files
vsDownloading 0/36.2M
before. Mean that we effectively don't show a progress bar for the case like:This is not fixed by reverting b312895?w=1
Extracted to #3874
The text was updated successfully, but these errors were encountered: