Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Misc. updates #2182

Merged
merged 10 commits into from
Mar 3, 2021
6 changes: 3 additions & 3 deletions content/docs/command-reference/cache/dir.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,10 +29,10 @@ cache directory.

## Options

- `--global` - modify a global config file (e.g. `~/.config/dvc/config`) instead
of the project's `.dvc/config`.
- `--global` - modify the global config file (e.g. `~/.config/dvc/config`)
instead of the project's `.dvc/config`.

- `--system` - modify a system config file (e.g. `/etc/dvc/config`) instead of
- `--system` - modify the system config file (e.g. `/etc/dvc/config`) instead of
`.dvc/config`.

- `--local` - modify a local [config file](/doc/command-reference/config)
Expand Down
8 changes: 4 additions & 4 deletions content/docs/command-reference/config.md
Original file line number Diff line number Diff line change
Expand Up @@ -64,11 +64,11 @@ multiple projects and users, respectively:
need to specify private config option values that you don't want to track and
share with Git (credentials, private locations, etc).

- `--global` - modify a global config file (e.g. `~/.config/dvc/config`) instead
of the project's `.dvc/config`. Useful to apply config options to all your
projects.
- `--global` - modify the global config file (e.g. `~/.config/dvc/config`)
instead of the project's `.dvc/config`. Useful to apply config options to all
your projects.

- `--system` - modify a system config file (e.g. `/etc/dvc/config`) instead of
- `--system` - modify the system config file (e.g. `/etc/dvc/config`) instead of
`.dvc/config`. Useful to apply config options to all the projects (all users)
in the machine. May require superuser access e.g.
`sudo dvc config --system ...` (Linux).
Expand Down
4 changes: 2 additions & 2 deletions content/docs/command-reference/pull.md
Original file line number Diff line number Diff line change
Expand Up @@ -233,9 +233,9 @@ already set up and you can use `dvc remote list` to check them. To remember how
it's done, and set a context for the example, let's define a default SSH remote:

```dvc
$ dvc remote add -d r1 ssh://_username_@_host_/path/to/dvc/remote/storage
$ dvc remote add -d r1 ssh://[email protected]/path/to/dvc/remote/storage
$ dvc remote list
r1 ssh://_username_@_host_/path/to/dvc/remote/storage
r1 ssh://[email protected]/path/to/dvc/remote/storage
```

> DVC supports several
Expand Down
4 changes: 2 additions & 2 deletions content/docs/command-reference/push.md
Original file line number Diff line number Diff line change
Expand Up @@ -116,15 +116,15 @@ To use `dvc push` (without options), a default

```dvc
$ dvc remote add --default r1 \
ssh://_username_@_host_/path/to/dvc/cache/directory
ssh://[email protected]/path/to/dvc/cache/directory
```

> For existing <abbr>projects</abbr>, remotes are usually already set up. You
> can use `dvc remote list` to check them:
>
> ```dvc
> $ dvc remote list
> r1 ssh://_username_@_host_/path/to/dvc/cache/directory
> r1 ssh://[email protected]/path/to/dvc/cache/directory
> ```

Push entire data <abbr>cache</abbr> from the current <abbr>workspace</abbr> to
Expand Down
28 changes: 18 additions & 10 deletions content/docs/command-reference/remote/modify.md
Original file line number Diff line number Diff line change
Expand Up @@ -134,14 +134,19 @@ these parameters, you could use the following options.
$ dvc remote modify myremote credentialpath /path/to/creds
```

- `configpath` - path to the AWS config file. The location defaults to
`~/.aws/config`. It supports S3-specific
[configuration values](https://docs.aws.amazon.com/cli/latest/topic/s3-config.html#configuration-values):
- `configpath` - path to the
[AWS CLI config file](https://docs.aws.amazon.com/cli/latest/userguide/cli-configure-files.html).
The default AWS CLI config file path (e.g. `~/.aws/config`) is used if this
parameter isn't set.

```dvc
$ dvc remote modify myremote --local configpath /path/to/config
```

> Note that only the S3-specific
> [configuration values](https://docs.aws.amazon.com/cli/latest/topic/s3-config.html#configuration-values)
> are used.

- `endpointurl` - endpoint URL to access S3:

```dvc
Expand Down Expand Up @@ -500,7 +505,8 @@ more information.
### Click for SSH

- `url` - remote location, in a regular
[SSH format](https://tools.ietf.org/id/draft-salowey-secsh-uri-00.html#sshsyntax):
[SSH format](https://tools.ietf.org/id/draft-salowey-secsh-uri-00.html#sshsyntax).
Note that this can already the `user` parameter, embedded into the URL:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this can already the - something is missing here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah yes, "included". Thanks


```dvc
$ dvc remote modify myremote url \
Expand All @@ -513,7 +519,7 @@ more information.

> Note that your server's SFTP root might differ from its physical root (`/`).

- `user` - username to access the remote.
- `user` - username to access the remote:

```dvc
$ dvc remote modify --local myremote user myuser
Expand All @@ -523,8 +529,9 @@ more information.

1. `user` parameter set with this command (found in `.dvc/config`);
2. User defined in the URL (e.g. `ssh://[email protected]/path`);
3. User defined in `~/.ssh/config` for this host (URL);
4. Current user
3. User defined in the SSH config file (e.g. `~/.ssh/config`) for this host
(URL);
4. Current system user

- `port` - port to access the remote.

Expand All @@ -536,7 +543,8 @@ more information.

1. `port` parameter set with this command (found in `.dvc/config`);
2. Port defined in the URL (e.g. `ssh://example.com:1234/path`);
3. Port defined in `~/.ssh/config` for this host (URL);
3. Port defined in the SSH config file (e.g. `~/.ssh/config`) for this host
(URL);
4. Default SSH port 22

- `keyfile` - path to private key to access the remote.
Expand Down Expand Up @@ -641,8 +649,8 @@ by HDFS. Read more about by expanding the WebHDFS section in

- `hdfscli_config` - path to a `HdfsCLI` cfg file. WebHDFS access depends on
`HdfsCLI`, which allows the usage of a configuration file by default located
in `~/.hdfscli.cfg`. In the file, multiple aliases can be set with their own
connection parameters, like `url` or `user`. If using a cfg file,
in `~/.hdfscli.cfg` (Linux). In the file, multiple aliases can be set with
their own connection parameters, like `url` or `user`. If using a cfg file,
`webhdfs_alias` can be set to specify which alias to use.

```dvc
Expand Down
2 changes: 1 addition & 1 deletion content/docs/start/data-pipelines.md
Original file line number Diff line number Diff line change
Expand Up @@ -222,7 +222,7 @@ This should be a good point to commit the changes with Git. These include

## Reproduce

The whole point of creating this `dvc.yaml` pipeline file is an ability to
The whole point of creating this `dvc.yaml` pipelines file is an ability to
reproduce the pipeline:

```dvc
Expand Down
19 changes: 10 additions & 9 deletions content/docs/user-guide/external-dependencies.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,6 +37,9 @@ certain `dvc remote` types. Currently, the following protocols are supported:
Let's take a look at defining and running a `download_file` stage that simply
downloads a file from an external location, on all the supported location types.

> See the [Remote alias example](#example-using-dvc-remote-aliases) for info. on
> using remote locations that require manual authentication setup.

<details>

### Click for Amazon S3
Expand Down Expand Up @@ -88,7 +91,7 @@ $ dvc run -n download_file \
$ dvc run -n download_file \
-d ssh://[email protected]/path/to/data.txt \
-o data.txt \
scp [email protected]:/path/to/data.txt data.txt
scp ssh://[email protected]:/path/to/data.txt data.txt
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm, I don't usually specify the protocol like this

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right. Reverting this change.

```

⚠️ DVC requires both SSH and SFTP access to work with remote SSH locations.
Expand Down Expand Up @@ -144,10 +147,9 @@ $ dvc run -n download_file \
## Example: Using DVC remote aliases

You may want to encapsulate external locations as configurable entities that can
be managed independently. This is useful if multiple dependencies (or stages)
reuse the same location, or if its likely to change in the future. And if the
location requires authentication, you need a way to configure it in order to
connect.
be managed independently. This is useful if the connection requires
authentication, if multiple dependencies (or stages) reuse the same location, or
if the URL is likely to change in the future.

[DVC remotes](/doc/command-reference/remote) can do just this. You may use
`dvc remote add` to define them, and then use a special URL with format
Expand All @@ -157,12 +159,11 @@ dependency.
Let's see an example using SSH. First, register and configure the remote:

```dvc
$ dvc remote add myssh ssh://myserver.com
$ dvc remote modify --local myssh user myuser
$ dvc remote modify --local myssh password mypassword
$ dvc remote add myssh ssh://[email protected]
$ dvc remote modify --local myssh password 'mypassword'
```

> Please refer to `dvc remote add` for more details like setting up access
> Please refer to `dvc remote modify` for more details like setting up access
> credentials for the different remote types.

Now, use an alias to this remote when defining the stage:
Expand Down