Improved CPU/GPU interoperability #5001

viclafargue · 2022-11-16T17:08:08Z

No description provided.

This reverts commit 09db802.

This reverts commit 4a79bea which contained a bad find-replace.

**Purpose of this PR** - Provide an implementation of CumlArray that can be backed by either host or device memory - Hook the host-compatible CumlArray into infrastructure for device and memory type selection - Ensure that device-host and host-device transfers are minimized for both GPU and CPU execution of existing CPU/GPU interoperable models **Non-goals of this PR** - This PR is not intended to provide a version of CUML that can be imported or run on a CPU-only system - This PR does not allow CUML to be compiled without nvcc - This PR is not intended to document the CPU/GPU interoperable estimator infrastructure introduced by #4908 and improved in #5001 - This PR is intended *only* to avoid breaking existing functionality. New functionality related to CPU-only execution which is not currently being used in the codebase will be tested in a separate PR **Tests performed on this PR** Besides the obvious (standard test suite), benchmarks have been run against this PR for a variety of conversions to and from the new CumlArray. Additionally, the standard cuML benchmark suite was run against this PR and compared to results from current branch-22.12. Results for both are described below. **Notable features of this PR not immediately related to its purpose** - Implementing the new CumlArray infrastructure required us to break many of the existing circular imports in the codebase - New [utilities](https://github.com/wphicks/cuml/blob/fea-xpu_infra/python/cuml/internals/safe_imports.py) for "safe" imports in the context of GPU-only or CPU-only installs were introduced **Anticipated follow-on PRs** - #5001 - #4970 - PR(s) to make all CPU/GPU-exclusive imports "safe" - PR(s) to add additional testing for CPU-only functionality Authors: - William Hicks (https://github.com/wphicks) - Victor Lafargue (https://github.com/viclafargue) - Corey J. Nolet (https://github.com/cjnolet) Approvers:

dantegd · 2022-12-16T20:53:59Z

rerun tests

codecov-commenter · 2022-12-17T00:27:38Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-23.02@447bded). Click here to learn what that means.
Patch has no changes to coverable lines.

Additional details and impacted files

@@               Coverage Diff               @@
##             branch-23.02    #5001   +/-   ##
===============================================
  Coverage                ?   79.24%           
===============================================
  Files                   ?      191           
  Lines                   ?    12374           
  Branches                ?        0           
===============================================
  Hits                    ?     9806           
  Misses                  ?     2568           
  Partials                ?        0

Flag	Coverage Δ
dask	`45.79% <0.00%> (?)`
non-dask	`69.25% <0.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

dantegd · 2022-12-17T02:03:49Z

@gpucibot merge

**Purpose of this PR** - Provide an implementation of CumlArray that can be backed by either host or device memory - Hook the host-compatible CumlArray into infrastructure for device and memory type selection - Ensure that device-host and host-device transfers are minimized for both GPU and CPU execution of existing CPU/GPU interoperable models **Non-goals of this PR** - This PR is not intended to provide a version of CUML that can be imported or run on a CPU-only system - This PR does not allow CUML to be compiled without nvcc - This PR is not intended to document the CPU/GPU interoperable estimator infrastructure introduced by rapidsai#4908 and improved in rapidsai#5001 - This PR is intended *only* to avoid breaking existing functionality. New functionality related to CPU-only execution which is not currently being used in the codebase will be tested in a separate PR **Tests performed on this PR** Besides the obvious (standard test suite), benchmarks have been run against this PR for a variety of conversions to and from the new CumlArray. Additionally, the standard cuML benchmark suite was run against this PR and compared to results from current branch-22.12. Results for both are described below. **Notable features of this PR not immediately related to its purpose** - Implementing the new CumlArray infrastructure required us to break many of the existing circular imports in the codebase - New [utilities](https://github.com/wphicks/cuml/blob/fea-xpu_infra/python/cuml/internals/safe_imports.py) for "safe" imports in the context of GPU-only or CPU-only installs were introduced **Anticipated follow-on PRs** - rapidsai#5001 - rapidsai#4970 - PR(s) to make all CPU/GPU-exclusive imports "safe" - PR(s) to add additional testing for CPU-only functionality Authors: - William Hicks (https://github.com/wphicks) - Victor Lafargue (https://github.com/viclafargue) - Corey J. Nolet (https://github.com/cjnolet) Approvers:

Authors: - Victor Lafargue (https://github.com/viclafargue) - William Hicks (https://github.com/wphicks) - Corey J. Nolet (https://github.com/cjnolet) - Dante Gama Dessavre (https://github.com/dantegd) - Carl Simon Adorf (https://github.com/csadorf) Approvers: - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#5001

wphicks and others added 30 commits October 4, 2022 16:16

Update array import path

e14fee8

Update input_utils import path

e3d0737

Move type_utils to internals

0072c2a

Remove unused imports

c22fcec

Move import_utils to internals

2214d4e

Fix tests

5c03489

Using raft::KeyValuePair instead of cub::KeyValuePair

09db802

Begin adjusting CumlArray for host inputs

57d27e9

Revert "Using raft::KeyValuePair instead of cub::KeyValuePair"

0dbe8a4

This reverts commit 09db802.

Fix _check_internal_model

495c676

Generic testing

73bb16c

Adding UMAP

a232ff6

Adding LogisticRegression

edc4d84

Merge branch 'branch-22.10' into cpu-gpu-interop-models

3c3328e

Add error checking for array construction

9d83c4b

Finish initial to_output implementation for CumlArray

7b952d6

Update CumlArray construction methods for mem_type

fb08a3f

Adding LogisticRegression 2

e5f88cf

Use consistent methods for detecting mem accessibility

1ac78ed

Handle backward-compatibility for CumlArray to_output

df4bc5b

Move logger to internals

4a79bea

Guard imports in input_utils

20b55d4

Begin refactoring input_utils with optional dependencies

ac67b32

Revert "Move logger to internals"

cda3cc8

This reverts commit 4a79bea which contained a bad find-replace.

Move logger.pyx to internals

5da57ee

Update logger import path

97ba765

Update remaining conversions in input_utils

3c3794a

Reimplement is_array_like

259b1be

Avoid short-circuiting check for is_array_like

668dfb3

Implement determine_array_memtype

114be42

wphicks and others added 11 commits December 9, 2022 17:12

Import logger into common

8172a75

Revert change to CHANGELOG

85026ab

Merge branch 'branch-23.02' into fea-xpu_infra

43f8545

Correct __le__ method for CumlArray

cda3fbd

Remove old using_memory_type and set_global_memory_type

c7988c0

Remove old device type setters

71e3650

Add tests for stride computation and order detection

3526df1

Merge branch 'branch-23.02' into fea-xpu_infra

9916639

Correct mistake in merge

ac3d542

Improved CPU/GPU interoperability

4f61962

Modularization of the dispatch function

fa8b051

viclafargue force-pushed the cpu-gpu-interop-improvements branch from d8dcaaf to fa8b051 Compare December 14, 2022 16:29

Merge branch-23.02

78432b3

github-actions bot removed the CMake label Dec 15, 2022

viclafargue marked this pull request as ready for review December 15, 2022 18:05

viclafargue requested a review from a team as a code owner December 15, 2022 18:05

dantegd added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Dec 15, 2022

fixes

a43d502

dantegd approved these changes Dec 16, 2022

View reviewed changes

FIX Remove CumlArray from api.rst

15d3211

rapids-bot bot merged commit fa4b301 into rapidsai:branch-23.02 Dec 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved CPU/GPU interoperability #5001

Improved CPU/GPU interoperability #5001

viclafargue commented Nov 16, 2022

dantegd commented Dec 16, 2022

codecov-commenter commented Dec 17, 2022

dantegd commented Dec 17, 2022

Improved CPU/GPU interoperability #5001

Improved CPU/GPU interoperability #5001

Conversation

viclafargue commented Nov 16, 2022

dantegd commented Dec 16, 2022

codecov-commenter commented Dec 17, 2022

Codecov Report

dantegd commented Dec 17, 2022