-
Notifications
You must be signed in to change notification settings - Fork 922
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] Add file size counter to cuIO benchmarks #10154
[FEA] Add file size counter to cuIO benchmarks #10154
Conversation
Sample output:
|
cpp/benchmarks/io/csv/csv_reader.cpp
Outdated
@@ -132,6 +130,7 @@ void BM_csv_read_varying_options(benchmark::State& state) | |||
auto const data_processed = data_size * cols_to_read.size() / view.num_columns(); | |||
state.SetBytesProcessed(data_processed * state.iterations()); | |||
state.counters["peak_memory_usage"] = mem_stats_logger.peak_memory_usage(); | |||
state.counters["file_size"] = source_sink.size(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
or "encoded_size" maybe?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"encoded_file_size" ?
Codecov Report
@@ Coverage Diff @@
## branch-22.04 #10154 +/- ##
================================================
+ Coverage 10.37% 10.48% +0.10%
================================================
Files 119 122 +3
Lines 20149 20493 +344
================================================
+ Hits 2091 2148 +57
- Misses 18058 18345 +287
Continue to review full report at Codecov.
|
CC @GregoryKimball who sort of asked for this feature |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rest looks good
cpp/benchmarks/io/csv/csv_reader.cpp
Outdated
@@ -132,6 +130,7 @@ void BM_csv_read_varying_options(benchmark::State& state) | |||
auto const data_processed = data_size * cols_to_read.size() / view.num_columns(); | |||
state.SetBytesProcessed(data_processed * state.iterations()); | |||
state.counters["peak_memory_usage"] = mem_stats_logger.peak_memory_usage(); | |||
state.counters["file_size"] = source_sink.size(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"encoded_file_size" ?
@gpucibot merge |
Most cuIO benchmarks used dataframes of fixed size as input. After writing to a file in the given format, its size can vary greatly depending on the encoding and compression.
This PR adds a counter to output the file size, as it can be often corelated with the performance of readers/writers.