Add GPU and CPU packages for ANN benchmarks #1773

dantegd · 2023-08-25T17:54:08Z

Builds on top of #1769

Removes libraft-ann-bench C++ based package
Creates raft-ann-bench packages that includes C++ tests as well as Python scripts
- raft-ann-bench package includes all tests for CPU and GPU
- raft-ann-bench-cpu package that does not depend on CUDA or RAFT GPU code
Update docs
Test artifacts and scripts in CI
Minor code cleaning

Some changes include:

Use RAPIDS_DATASET_ROOT_DIR env variable to indicate location of datasets (optional) consistent with other repos: https://docs.rapids.ai/maintainers/datasets/
CPU and GPU packages are built in the existing GPU build GHA. Only the CUDA 12 jobs build the CPU packages.
Small change for invocation of scripts, for example: python bench/ann/run.py --dataset deep-image-96-inner is now python -m raft-ann-bench.run --dataset deep-image-96-inner, but still scripts meant to be invoked from the command line.

Future improvements:

Remove use of popen python scripts from python scripts.
Improve printing and logging
Allow functions of package to be called from python scripts.

Closes #1744

…bench-use-gbench

…benchmarks

…nchmarks

…in the next commit

…oogle-benchmarks

…n-ann-bench-use-gbench

…oogle-benchmarks

divyegala · 2023-09-01T20:01:27Z

build.sh

@@ -152,7 +154,7 @@ function limitTests {
            # Remove the full LIMIT_TEST_TARGETS argument from list of args so that it passes validArgs function
            ARGS=${ARGS//--limit-tests=$LIMIT_TEST_TARGETS/}
            TEST_TARGETS=${LIMIT_TEST_TARGETS}
-	    echo "Limiting tests to $TEST_TARGETS"
+        echo "Limiting tests to $TEST_TARGETS"


I think the original indentation was correct

divyegala · 2023-09-01T20:03:31Z

ci/build_python.sh

+# Build ann-bench-cpu only in CUDA 12 jobs since it only depends on python
+# version
+if [[ ${CUDA_VERSION} == "11.8.0" ]]; then


The comment does not match the conditional. Also, I think the variable name that our CI containers use is RAPIDS_CUDA_VERSION. Can you verify if CUDA_VERSION is set?

yes, CUDA_VERSION is set, you can see it in the logs https://github.com/rapidsai/raft/actions/runs/6051700515/job/16424916331#step:7:54 which have both with and without RAPIDS_ prefix

Please use RAPIDS_CUDA_VERSION -- but better yet, use this snippet copied from cugraph (with CUDA 12 to match the comment as @divyegala noted above):

Suggested change

# Build ann-bench-cpu only in CUDA 12 jobs since it only depends on python

# version

if [[ ${CUDA_VERSION} == "11.8.0" ]]; then

# Build ann-bench-cpu only in CUDA 12 jobs since it only depends on python

# version

RAPIDS_CUDA_MAJOR="${RAPIDS_CUDA_VERSION%%.*}"

if [[ ${RAPIDS_CUDA_MAJOR} == "12" ]]; then

https://github.com/rapidsai/cugraph/blob/2b4118aee4af912d74ce1ebe7adc39cf596899ef/ci/build_python.sh#L49-L51

we require to build in the CUDA 11 jobs for now, but otherwise will use it!

cpp/CMakeLists.txt

cjnolet · 2023-09-01T20:28:00Z

/ok to test

dantegd · 2023-09-01T22:50:04Z

/ok to test

cjnolet · 2023-09-02T00:40:59Z

/ok to test

cjnolet · 2023-09-02T00:58:08Z

/ok to test

Make the `cpp/bench/ann/src/common/cuda_stub.hpp` more flexible and include it in all benchmarks (instead of only `ANN_BENCH` target). This makes the targets (benchmark libs), which define `CPU_ONLY`, on depend on CUDA headers, thus enabling the builds without GPU. This should relief #1773 of doing extra workarounds in the bench headers to achieve the same effect. Authors: - Artem M. Chirkin (https://github.com/achirkin) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: #1792

…e-benchmarks

cjnolet · 2023-09-02T03:28:40Z

/ok to test

cpp/bench/ann/src/common/util.hpp

Co-authored-by: Artem M. Chirkin <[email protected]>

cjnolet · 2023-09-05T14:36:28Z

/ok to test

ajschmidt8

Approving ops-codeowner file changes

achirkin

LGTM, long awaited feature!

cjnolet · 2023-09-05T16:34:04Z

/merge

achirkin and others added 30 commits August 9, 2023 13:19

ANN-benchmarks: switch to use gbench

bd738ec

Disable NVTX if the nvtx3 headers are missing

7473c62

Merge branch 'branch-23.10' into enh-google-benchmarks

aa10d7c

Merge branch 'branch-23.10' into enh-google-benchmarks

bed126c

Merge remote-tracking branch 'upstream/branch-23.10' into python-ann-…

09ea7a7

…bench-use-gbench

try to run gbench executable

2917886

Allow to compile ANN_BENCH without CUDA

49732b1

Merge remote-tracking branch 'rapidsai/branch-23.10' into enh-google-…

76cfb40

…benchmarks

Fix style

9b588af

Adapt ANN benchmark python scripts

6d6c17d

Make the default behavior to produce one executable per benchmark

b89b27d

Fix style problems / pre-commit

163a40c

Merge branch 'branch-23.10' into enh-google-benchmarks

0bb51a3

Merge remote-tracking branch 'rapidsai/branch-23.10' into enh-google-…

2b9f649

…benchmarks

Merge branch 'branch-23.10' into enh-google-benchmarks

9728f7e

Merge remote-tracking branch 'origin/branch-23.10' into enh-google-be…

7b1bf01

…nchmarks

Adding k and batch-size options to run.py

1daf2bf

Merge branch 'branch-23.10' - CONFIGS ONLY - dataset_memtype follows …

4e0a53e

…in the next commit

Add dataset_memory_type/query_memory_type as build/search parameters

04893c9

middle of merge, not building

b24fcf7

Tuning guide

30f7467

Merge remote-tracking branch 'artem/enh-google-benchmarks' into enh-g…

3e35121

…oogle-benchmarks

compiling, index building successful, search failing

f927f69

Merge remote-tracking branch 'corey/enh-google-benchmarks' into pytho…

404cd10

…n-ann-bench-use-gbench

FEA first commit rebasing changes on gbench branch

2f19c44

FIX fixing straggling changes from rebase

e0586de

Fix FAISS using a destroyed stream from previous benchmark case

0eaa7e0

Merge remote-tracking branch 'artem/enh-google-benchmarks' into enh-g…

9896963

…oogle-benchmarks

Fixing issue in conf file and stubbing out parameter tuning guide

4062d6f

Adding CAGRA to tuning guide

7141c21

dantegd requested review from a team as code owners September 1, 2023 18:05

FIX pep8

c6014a9

cjnolet approved these changes Sep 1, 2023

View reviewed changes

dantegd mentioned this pull request Sep 1, 2023

[FEA] Testing of raft-ann-bench packages in CI #1798

Open

3 tasks

divyegala reviewed Sep 1, 2023

View reviewed changes

FIX docs and plot datasets path

5a12ce3

dantegd added 4 commits September 1, 2023 18:15

FIX found typo in cmake

fbdc1fa

FIX missing parameter in python

954aa87

FIX correct conditional

15b0dc0

FIX for single gpu arch detection in CMake

d863ce6

FIX PR review fixes and a {yea}

0d60c56

divyegala approved these changes Sep 2, 2023

View reviewed changes

Merge remote-tracking branch 'origin/branch-23.10' into dev-enh-googl…

fcc158a

…e-benchmarks

achirkin reviewed Sep 4, 2023

View reviewed changes

cpp/bench/ann/src/common/util.hpp Outdated Show resolved Hide resolved

Update util.hpp

1274b21

Co-authored-by: Artem M. Chirkin <[email protected]>

ajschmidt8 approved these changes Sep 5, 2023

View reviewed changes

achirkin approved these changes Sep 5, 2023

View reviewed changes

rapids-bot bot merged commit 2a89574 into rapidsai:branch-23.10 Sep 5, 2023

cjnolet mentioned this pull request Sep 10, 2023

[FEA] CPU-only package and environments for ANN-benchmarks #1735

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPU and CPU packages for ANN benchmarks #1773

Add GPU and CPU packages for ANN benchmarks #1773

dantegd commented Aug 25, 2023 •

edited

Loading

divyegala Sep 1, 2023

divyegala Sep 1, 2023

dantegd Sep 1, 2023

bdice Sep 1, 2023 •

edited

Loading

dantegd Sep 2, 2023

cjnolet commented Sep 1, 2023

dantegd commented Sep 1, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 5, 2023

ajschmidt8 left a comment

achirkin left a comment

cjnolet commented Sep 5, 2023

Add GPU and CPU packages for ANN benchmarks #1773

Add GPU and CPU packages for ANN benchmarks #1773

Conversation

dantegd commented Aug 25, 2023 • edited Loading

divyegala Sep 1, 2023

Choose a reason for hiding this comment

divyegala Sep 1, 2023

Choose a reason for hiding this comment

dantegd Sep 1, 2023

Choose a reason for hiding this comment

bdice Sep 1, 2023 • edited Loading

Choose a reason for hiding this comment

dantegd Sep 2, 2023

Choose a reason for hiding this comment

cjnolet commented Sep 1, 2023

dantegd commented Sep 1, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 5, 2023

ajschmidt8 left a comment

Choose a reason for hiding this comment

achirkin left a comment

Choose a reason for hiding this comment

cjnolet commented Sep 5, 2023

dantegd commented Aug 25, 2023 •

edited

Loading

bdice Sep 1, 2023 •

edited

Loading