Set the default PATH to include the RAPIDS conda env #449

jacobtomlinson · 2022-04-06T13:05:10Z

As a workaround for issues raised in #438 and #439 this PR sets the PATH explicitly to include the RAPIDS conda environment.

Many container platforms including Kubernetes override the default entrypoint, so this change ensures binaries like jupyter are still on the path even without the entrypoint being run. Also when calling docker exec on a container the environment is not copied from the main process and without the entrypoint the path is also wrong.

Signed-off-by: Jacob Tomlinson <[email protected]>

mmccarty

Thanks!

beckernick · 2022-04-06T13:43:53Z

Since this puts the RAPIDS environment at the front of PATH, I believe this is technically a breaking change in some situation due to PATH ordering possibly resulting in different executables being run.

Are breaking changes in containers handled similarly to libraries?

jacobtomlinson · 2022-04-06T13:56:35Z

The PATH here is exactly what gets set the entry point (I just copy/pasted it). And if the entry point is called then this will be overwritten anyway. So I don't think this is breaking. It just sets it for situations where the entry point isn't called.

beckernick · 2022-04-06T13:57:37Z

Makes sense, thanks for the additional context 👍

ajschmidt8

Editing the PATH variable is never ideal, but from some offline discussions with @jacobtomlinson, it seems that this is necessary to unblock some folks at Google. So we can go ahead and merge this workaround until we're able to get a proper fix in place.

jacobtomlinson · 2022-04-06T15:02:51Z

Thanks @ajschmidt8. Really appreciate the compromise here.

mmccarty · 2022-04-06T15:02:55Z

Thanks @ajschmidt8

* Set the default PATH to include the RAPIDS conda env Signed-off-by: Jacob Tomlinson <[email protected]> * Generate dockerfiles Signed-off-by: Jacob Tomlinson <[email protected]>

* Set the default PATH to include the RAPIDS conda env Signed-off-by: Jacob Tomlinson <[email protected]> * Generate dockerfiles Signed-off-by: Jacob Tomlinson <[email protected]> Co-authored-by: Jacob Tomlinson <[email protected]>

jakirkham · 2022-04-06T16:21:24Z

generated-dockerfiles/rapidsai-core_centos7-base.amd64.Dockerfile

@@ -58,6 +58,9 @@ WORKDIR ${RAPIDS_DIR}

 COPY NVIDIA_Deep_Learning_Container_License.pdf . 
 COPY entrypoint.sh /opt/docker/bin/entrypoint
+
+ENV "PATH=/opt/conda/envs/rapids/bin:/opt/conda/condabin:/opt/conda/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin"


We do something very similar in conda-forge. Though we do this in the entrypoint script

Yeah the challenge that we are trying to deal with is that many container platforms do not call the entrypoint script.

Right had a similar discussion with @mmccarty offline

IIUC some call docker run, but may docker exec into the container as well. With docker exec, it is possible to prefix any command with the entrypoint script as part of the command, which would handle this case

Manipulating the PATH will largely work in most cases. Where it falls short is conda activate injects things into the current shell, which is lost without activation. Also any packages that have activation scripts won't be run. Usually this isn't a big deal, but sometimes it can cause issues

While I agree adding PATH was the right thing to do in the short-term, in the medium term we may want to improve on this further to ensure activation is used in these other cases. Solutions could range from having folks run the entrypoint and including that in docs to other fixes in the container that aid with activation. Not entirely sure all of the constraints, but maybe we can discuss those offline

Sure I totally agree. I guess the issue is that the entrypoint is specifically a Docker thing, and there are many other container runtimes and platforms that folks are trying to use our images on (contained, cri-o, kubernetes, kubeflow, sagemaker, vertex, azureml, etc). Most do not call the entrypoint at all if a custom command is set, and many set custom commands that cannot be overridden. So we should try not to rely on the entrypoint as a way to correctly set up our environment.

I think this would mean we need to set things up in the base environment, or manually set the environment up to be in a "post conda activate` state when the container is started. Setting environment variables should be straight forwards, but I hadn't considered activation scripts.

Let's chat in the team meeting today and we can feedback here or in #438/#439.

Set the default PATH to include the RAPIDS conda env

907f40b

Signed-off-by: Jacob Tomlinson <[email protected]>

jacobtomlinson requested a review from a team as a code owner April 6, 2022 13:05

Generate dockerfiles

4e59c5a

Signed-off-by: Jacob Tomlinson <[email protected]>

mmccarty approved these changes Apr 6, 2022

View reviewed changes

ajschmidt8 approved these changes Apr 6, 2022

View reviewed changes

ajschmidt8 merged commit 814b719 into rapidsai:branch-22.04 Apr 6, 2022

jacobtomlinson deleted the env-path branch April 6, 2022 15:02

ajschmidt8 mentioned this pull request Apr 6, 2022

[Backport] Set the default PATH to include the RAPIDS conda env #450

Merged

jacobtomlinson mentioned this pull request Apr 6, 2022

Fix Path quote placement #452

Merged

jakirkham reviewed Apr 6, 2022

View reviewed changes

ajschmidt8 mentioned this pull request Apr 8, 2022

Fix PATH variable #460

Merged

jameslamb mentioned this pull request May 1, 2024

[FEA] add tests on image characteristics #667

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set the default PATH to include the RAPIDS conda env #449

Set the default PATH to include the RAPIDS conda env #449

jacobtomlinson commented Apr 6, 2022

mmccarty left a comment

beckernick commented Apr 6, 2022

jacobtomlinson commented Apr 6, 2022 •

edited

Loading

beckernick commented Apr 6, 2022

ajschmidt8 left a comment

jacobtomlinson commented Apr 6, 2022

mmccarty commented Apr 6, 2022

jakirkham Apr 6, 2022

jacobtomlinson Apr 7, 2022

jakirkham Apr 7, 2022

jacobtomlinson Apr 8, 2022

Set the default PATH to include the RAPIDS conda env #449

Set the default PATH to include the RAPIDS conda env #449

Conversation

jacobtomlinson commented Apr 6, 2022

mmccarty left a comment

Choose a reason for hiding this comment

beckernick commented Apr 6, 2022

jacobtomlinson commented Apr 6, 2022 • edited Loading

beckernick commented Apr 6, 2022

ajschmidt8 left a comment

Choose a reason for hiding this comment

jacobtomlinson commented Apr 6, 2022

mmccarty commented Apr 6, 2022

jakirkham Apr 6, 2022

Choose a reason for hiding this comment

jacobtomlinson Apr 7, 2022

Choose a reason for hiding this comment

jakirkham Apr 7, 2022

Choose a reason for hiding this comment

jacobtomlinson Apr 8, 2022

Choose a reason for hiding this comment

jacobtomlinson commented Apr 6, 2022 •

edited

Loading