Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cmd ref: add list command #1021

Merged
merged 4 commits into from
Mar 14, 2020
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
138 changes: 138 additions & 0 deletions public/static/docs/command-reference/list.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,138 @@
# list

List <abbr>repository</abbr> contents, including files and directories tracked
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved
by DVC (<abbr>data artifacts</abbr>) and by Git.

## Synopsis

```usage
usage: dvc list [-h] [-q | -v] [-R] [--outs-only] [--rev [REV]] url [path]

positional arguments:
url Location of DVC repository to list.
path Path to a file or directory within the repository.
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved
```

## Description

List files, dirs and <abbr>data artifacts</abbr> for the pointed URL. The output
is sorted lexicographically.
gurobokum marked this conversation as resolved.
Show resolved Hide resolved
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved

With the command you may list all <abbr>data artifacts</abbr> the repo contains.
Also it works with remote repos and you can list files before trying to get it
(with `dvc get` or `dvc import`).

The `url` argument specifies the address of the DVC or Git repository containing
the data source. Both HTTP and SSH protocols are supported for online repos
(e.g. `[user@]server:project.git`). `url` can also be a local file system path.
When the url is remote Git URL the content is checkout into temporary directory.

`--outs-only` option allows to filter <abbr>data artifacts</abbr> into the repo,
so only <abbr>data artifacts</abbr> will be printed.

`path` argument is used for pointing relative path into the repo. So you may use
it when need to list files for some specific path. With recursive option `-R` it
allows to filter output by prefix.

Also with `path` argument you may check existense of some file - if the file
doesn't exist the error would be thrown.

## Options

- `--outs-only` - show only <abbr>data artifacts</abbr>.

- `-R`, `--recursive` - recursively prints the directory. When `path` is not
specified the directory is the root of the repo.

- `--rev` - commit hash, branch or tag name, etc. (any
[Git revision](https://git-scm.com/docs/revisions)) of the repository to list
content for. The latest commit in `master` (tip of the default branch) is used
by default when this option is not specified.

- `-h`, `--help` - prints the usage/help message, and exit.

- `-q`, `--quiet` - do not write anything to standard output. Exit with 0 if no
problems arise, otherwise 1.

- `-v`, `--verbose` - displays detailed tracing information. when this option is
not specified.

## Example: List remote git repo

We can use the command for getting information about remote repository with all
files, dirs and <abbr>data artifacts</abbr>.

```dvc
$ dvc list -R https://github.com/iterative/dataset-registry

.gitignore
README.md
get-started/.gitignore
get-started/data.xml
get-started/data.xml.dvc
images/.gitignore
images/dvc-logo-outlines.png
images/dvc-logo-outlines.png.dvc
images/owl_sticker.png
images/owl_sticker.png.dvc
images/owl_sticker.svg
...
```

Or

```dvc
$ dvc list https://github.com/iterative/dataset-registry

.gitignore
README.md
get-started
images
tutorial
use-cases
```

for getting flat information about the repo

## Example: List the repo with the rev

Another useful case is checking the files for the **specific revision**

```dvc
$ dvc list -R --rev 7476a858f6200864b5755863c729bff41d0fb045 \
https://github.com/iterative/dataset-registry

.gitignore
README.md
get-started/.gitignore
get-started/data.xml
get-started/data.xml.dvc
tutorial/nlp/.gitignore
tutorial/nlp/Posts.xml.zip
tutorial/nlp/Posts.xml.zip.dvc
tutorial/nlp/pipeline.zip
tutorial/nlp/pipeline.zip.dvc
tutorial/ver/.gitignore
tutorial/ver/data.zip
tutorial/ver/data.zip.dvc
tutorial/ver/new-labels.zip
```

## Example: Check the path

Before trying to get or import some <abbr>data artifacts</abbr> with `dvc get`
or `dvc import` we can check their existence with

```dvc
$ dvc list --outs-only \
https://github.com/iterative/dataset-registry \
tutorial/nlp/pipeline.zip
```

Or everything under the prefix

```dvc
$ dvc list -R --outs-only \
https://github.com/iterative/dataset-registry \
tutorial
```
4 changes: 4 additions & 0 deletions public/static/docs/sidebar.json
Original file line number Diff line number Diff line change
Expand Up @@ -241,6 +241,10 @@
"label": "install",
"slug": "install"
},
{
"label": "list",
"slug": "list"
},
{
"label": "lock",
"slug": "lock"
Expand Down