Generalize GHA `select` statements (to avoid hard-coding versions) #25

jakirkham · 2023-12-09T05:35:16Z

As there is only need of Dask-CUDA build, we filter out one architecture, Python version, and CUDA version in GHA. So as to only build the package once

https://github.com/rapidsai/dask-cuda/blob/1eecb1b2ac79ae9aaff9c26d0a3c93dd57f859f3/.github/workflows/build.yaml#L69-L70

However this currently hard-codes the versions of each in the selection logic, which means it can go stale as new versions are added and old ones dropped. Potentially resulting in the build being lost altogether (maybe even silently)

To avoid pinning to a specific version, @ajschmidt8 made several suggestions in this thread: rapidsai/dask-cuda#1294 (comment)

Filing this to track for follow-up

jakirkham · 2024-02-28T23:09:15Z

Here is another hard-coding example in cuDF

https://github.com/rapidsai/cudf/blob/3adfddcfa2cdac4acb16a50916442763a1d8a78b/.github/workflows/build.yaml#L94-L95

jakirkham · 2024-02-28T23:10:57Z

Also some relevant discussion in this thread: rapidsai/cudf#15174 (comment)

bdice · 2024-02-28T23:16:17Z

I worked through a bit of jq and I think I have a decent solution.

Case 1: Pure Python, for each CUDA major version

For pure Python packages that need to build for each CUDA major version, like dask-cudf-cu11 / dask-cudf-cu12, we use the highest (Python, CUDA) pair, from groups of each CUDA major version.

map(select(.ARCH == "amd64")) | group_by(.CUDA_VER|split(".")|map(tonumber)|.[0]) | map(max_by([(.PY_VER|split(".")|map(tonumber)), (.CUDA_VER|split(".")|map(tonumber))]))

This was originally posted here: rapidsai/cudf#15174 (comment)

Case 2: Pure Python, no CUDA dependency

For jobs that only need a single matrix entry (no dependence on a specific Python or CUDA version), we should use the highest (Python, CUDA) pair.

map(select(.ARCH == "amd64")) | max_by([(.PY_VER|split(".")|map(tonumber)), (.CUDA_VER|split(".")|map(tonumber))]) | [.]

Optionally, you can drop the map(select(.ARCH == "amd64")) if you want jobs for both amd64 and arm64.

These expressions assume that we do not care what OS value (LINUX_VER), GPU hardware (gpu), or driver (driver) is used in the resulting jobs.

jameslamb · 2024-02-29T00:10:04Z

@bdice I tried the "Pure Python, no CUDA dependency" example in rapidsai/rapids-dask-dependency#28

Worked perfectly, thank you 😊

@bdice

Contributes to rapidsai/build-planning#25. Contributes to rapidsai/build-planning#3. This project uses a `jq` filter in its GitHub Actions configuration to select exactly 1 combination of `(architecture, Python version, CUDA version)` for each of its CI jobs. This PR proposes removing string literals referencing specific versions, so that the configuration won't have to be updated in the future as RAPIDS changes its supported matrix of Python and CUDA versions. Credit for this to @bdice / @ajschmidt8 : rapidsai/build-planning#25 (comment). I'm just clicking the buttons 😁

jakirkham · 2024-02-29T01:43:57Z

Also needed in cuxfilter

https://github.com/rapidsai/cuxfilter/blob/cfab998ffc4f348f649cd839bea5697fd8aeef02/.github/workflows/pr.yaml#L67

bdice · 2024-02-29T15:03:00Z

I did cudf as well. rapidsai/cudf#15191

There are some hardcoded references remaining but they're not really used in the same way as the cases we've addressed above.

Docker needs something like if latest CUDA but there's no matrix filter so I'm not sure if there's a good solution.
https://github.com/rapidsai/docker/blob/32a60e107c3ff777c7f2ddc2ac4d11a3669c3f83/.github/workflows/build-image.yml#L145

cuGraph has some PyG tests that only run on CUDA 11. I don't think these are worth modifying since it's pinning to an older version. There is some open work needed to upgrade to PyTorch 2, which I think may open up the ability to use CUDA 12 here.
https://github.com/rapidsai/cugraph/blob/ac65b17ee3e9b85368f266da1a6a3b8e5717e292/.github/workflows/pr.yaml#L165
https://github.com/rapidsai/cugraph/blob/ac65b17ee3e9b85368f266da1a6a3b8e5717e292/.github/workflows/test.yaml#L79

To eliminate hard-coding, generalize the GHA workflow logic to select one build for testing. This should simplify future cuxfilter updates. xref: rapidsai/build-planning#25 Authors: - https://github.com/jakirkham - Ajay Thorve (https://github.com/AjayThorve) Approvers: - Ray Douglass (https://github.com/raydouglass) - Ajay Thorve (https://github.com/AjayThorve) URL: #575

jakirkham · 2024-02-29T19:42:11Z

Thanks Bradley! 🙏

Yeah think we can close this once cuDF is fixed

To eliminate hard-coding, generalize the GHA workflow logic to select one build for testing. This should simplify future Dask-CUDA updates. xref: rapidsai/build-planning#25 Authors: - https://github.com/jakirkham Approvers: - AJ Schmidt (https://github.com/ajschmidt8) - Ray Douglass (https://github.com/raydouglass) URL: #1318

To eliminate hard-coding, generalize the GHA workflow logic to select one build for testing. This should simplify future updates. This is a follow-up to #15174. xref: rapidsai/build-planning#25 Authors: - Bradley Dice (https://github.com/bdice) Approvers: - Jake Awe (https://github.com/AyodeAwe) - https://github.com/jakirkham URL: #15191

bdice · 2024-03-09T01:41:06Z

cuDF's PR is merged: rapidsai/cudf#15191

This should be safe to close now.

To eliminate hard-coding, generalize the GHA workflow logic to select one build for testing. This should simplify future Dask-CUDA updates. xref: rapidsai/build-planning#25 Authors: - https://github.com/jakirkham Approvers: - AJ Schmidt (https://github.com/ajschmidt8) - Ray Douglass (https://github.com/raydouglass) URL: rapidsai#1318

jakirkham mentioned this issue Dec 9, 2023

Publish nightly wheels to NVIDIA index instead of PyPI rapidsai/dask-cuda#1294

Merged

jameslamb mentioned this issue Feb 28, 2024

[ci] update matrix filters for dask-cudf builds rapidsai/cudf#15174

Merged

3 tasks

jakirkham transferred this issue from rapidsai/dask-cuda Feb 28, 2024

jameslamb mentioned this issue Feb 28, 2024

update CI matrix selectors rapidsai/rapids-dask-dependency#28

Merged

jakirkham mentioned this issue Feb 29, 2024

Generalize GHA selectors for pure Python testing rapidsai/dask-cuda#1318

Merged

jakirkham mentioned this issue Feb 29, 2024

Generalize GHA selectors for pure Python testing rapidsai/cuxfilter#575

Merged

bdice mentioned this issue Feb 29, 2024

Generalize GHA selectors for pure Python testing rapidsai/cudf#15191

Merged

3 tasks

bdice closed this as completed Mar 9, 2024

jameslamb mentioned this issue May 29, 2024

Only build conda packages once for libraries without direct CUDA dependency #67

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize GHA `select` statements (to avoid hard-coding versions) #25

Generalize GHA `select` statements (to avoid hard-coding versions) #25

jakirkham commented Dec 9, 2023

jakirkham commented Feb 28, 2024

jakirkham commented Feb 28, 2024

bdice commented Feb 28, 2024 •

edited

Loading

jameslamb commented Feb 29, 2024

jakirkham commented Feb 29, 2024

bdice commented Feb 29, 2024

jakirkham commented Feb 29, 2024

bdice commented Mar 9, 2024

Generalize GHA select statements (to avoid hard-coding versions) #25

Generalize GHA select statements (to avoid hard-coding versions) #25

Comments

jakirkham commented Dec 9, 2023

jakirkham commented Feb 28, 2024

jakirkham commented Feb 28, 2024

bdice commented Feb 28, 2024 • edited Loading

Case 1: Pure Python, for each CUDA major version

Case 2: Pure Python, no CUDA dependency

jameslamb commented Feb 29, 2024

jakirkham commented Feb 29, 2024

bdice commented Feb 29, 2024

jakirkham commented Feb 29, 2024

bdice commented Mar 9, 2024

Generalize GHA `select` statements (to avoid hard-coding versions) #25

Generalize GHA `select` statements (to avoid hard-coding versions) #25

bdice commented Feb 28, 2024 •

edited

Loading