Skip to content

Commit

Permalink
add data generation to benchmark documentation (#10677)
Browse files Browse the repository at this point in the history
add device data generation to benchmark documentation

Authors:
  - Karthikeyan (https://github.com/karthikeyann)

Approvers:
  - Conor Hoekstra (https://github.com/codereport)
  - Nghia Truong (https://github.com/ttnghia)

URL: #10677
  • Loading branch information
karthikeyann authored Apr 19, 2022
1 parent 31a5f44 commit 65b1cbd
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions cpp/docs/BENCHMARKING.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,12 @@ provided in `cpp/benchmarks/synchronization/synchronization.hpp` to help with th
can also optionally clear the GPU L2 cache in order to ensure cache hits do not artificially inflate
performance in repeated iterations.

## Data generation

For generating benchmark input data, helper functions are available at [cpp/benchmarks/common/generate_input.hpp](/cpp/benchmarks/common/generate_input.hpp). The input data generation happens on device, in contrast to any `column_wrapper` where data generation happens on the host.
* `create_sequence_table` can generate sequence columns starting with value 0 in first row and increasing by 1 in subsequent rows.
* `create_random_table` can generate a table filled with random data. The random data parameters are configurable.

## What should we benchmark?

In general, we should benchmark all features over a range of data sizes and types, so that we can
Expand Down

0 comments on commit 65b1cbd

Please sign in to comment.