docs

joelberkeley · Jan 4, 2025 · 1b29e57 · 1b29e57
1 parent 2b90515
commit 1b29e57
Show file tree

Hide file tree

Showing 2 changed files with 23 additions and 4 deletions.
diff --git a/pjrt-plugins/xla-cuda/README.md b/pjrt-plugins/xla-cuda/README.md
@@ -4,9 +4,29 @@ This is the PJRT plugin for CUDA-enabled GPUs. It uses the XLA compiler.
 
 ## Install
 
-This plugin requires Linux, a CUDA-enabled GPU, and a number of Nvidia packages. We have tested this only on Ubuntu 24.04.
+This plugin requires Ubuntu Linux 24.04 (or NVIDIA Docker), a CUDA-enabled GPU, and a number of NVIDIA packages.
 
-First, install NVIDIA GPU drivers. Next, install CUDA toolkit 12.6, as well as cuDNN and NCCL, making sure the versions are compatible with CUDA 12.6 and your OS. Finally, install the plugin with
+First, install NVIDIA GPU drivers. The remaining packages can be installed either on your host machine, or in Docker.
+
+### Install without Docker
+
+To install without Docker, refer to the commands in the [Dockerfile](./Dockerfile). Next, install the plugin with
+```
+pack install pjrt-plugin-xla-cuda
+```
+
+### Install with Docker
+
+To install with NVIDIA Docker, first install [Docker](https://www.docker.com/), then the [NVIDIA Container Toolkit](https://github.com/NVIDIA/nvidia-container-toolkit). Next, clone this repository, and build the runtime Docker container with
+```
+docker build -t spidr -f pjrt-plugins/xla-cuda/Dockerfile .
+```
+Next, run the container with
+```
+docker run -it --gpus all --name spidr -v <absolute_path>:/spidr -w /spidr spidr bash
+```
+Where `<absolute_path>` contains any executables and/or source code, present on the host, you wish to access from the container. Next, install `pack` and prerequisites in the container. Finally, install the plugin with
 ```
 pack install pjrt-plugin-xla-cuda
 ```
+Note: If you are also running Ubuntu 24.04 on your host machine, you can instead install `pjrt-plugin-xla-cuda`, and build your code, on the host, then mount and run the executable in the container.
diff --git a/pjrt-plugins/xla-cuda/build.sh b/pjrt-plugins/xla-cuda/build.sh
@@ -2,10 +2,9 @@
 
 script_dir=$(CDPATH="" cd -- "$(dirname -- "$0")" && pwd)
 cd "$script_dir"
-# how to pin nccl?
 cuda_version=$(cat CUDA_VERSION)
 cudnn_version=$(cat CUDNN_VERSION)
-cd "$script_dir/../.."
+cd ../..
 . ./dev.sh
 rev=$(cat XLA_VERSION)