Sort metrics alphabetically in EXPLAIN ANALYZE output #12568

progval · 2024-09-21T08:47:20Z

Which issue does this PR close?

Rationale for this change

From ParquetExec metrics, with predicate pushdown enabled:

metrics=[output_rows=0, elapsed_compute=96ns, row_groups_matched_bloom_filter=0, row_groups_matched_statistics=21050, file_scan_errors=0, pushdown_rows_matched=0, row_groups_pruned_statistics=173576, row_groups_pruned_bloom_filter=21050, file_open_errors=0, num_predicate_creation_errors=0, bytes_scanned=25023432248, pushdown_rows_pruned=0, page_index_rows_pruned=0, predicate_evaluation_errors=0, page_index_rows_matched=0, time_elapsed_scanning_until_data=16.622753ms, time_elapsed_processing=72.280463608s, pushdown_eval_time=382ns, page_index_eval_time=3.177676ms, time_elapsed_scanning_total=16.661811ms, time_elapsed_opening=102.989073638s]

For example, pushdown_rows_matched and pushdown_rows_pruned (highlighted in the snippet) are far away from each other, even though they refer to roughly the same thing.

The unstable sort also makes it hard to compare multiple EXPLAIN ANALYZE results.

after this change, metrics for the same query look like this:

metrics=[output_rows=0, elapsed_compute=96ns, bytes_scanned=25023432248, file_open_errors=0, file_scan_errors=0, num_predicate_creation_errors=0, page_index_rows_matched=0, page_index_rows_pruned=0, predicate_evaluation_errors=0, pushdown_rows_matched=0, pushdown_rows_pruned=0, row_groups_matched_bloom_filter=0, row_groups_matched_statistics=21050, row_groups_pruned_bloom_filter=21050, row_groups_pruned_statistics=173576, page_index_eval_time=2.882359ms, pushdown_eval_time=382ns, time_elapsed_opening=104.629010525s, time_elapsed_processing=73.86660138s, time_elapsed_scanning_total=97.929057ms, time_elapsed_scanning_until_data=97.893962ms]

What changes are included in this PR?

Refinement of the partial order used in MetricsSet::sorted_for_display

Are these changes tested?

Yes

Are there any user-facing changes?

More readable output. There doesn't seem to be any snippet in the documentation that needs to be updated to match the new behavior.

alamb

Thank you @progval -- this is a very nice improvement in my mind

Weijun-H

LGTM! Thanks @progval

github-actions bot added the physical-expr Physical Expressions label Sep 21, 2024

Sort metrics alphabetically in EXPLAIN ANALYZE output

7c5238a

progval force-pushed the sort-metrics branch from f296420 to 7c5238a Compare September 21, 2024 08:48

alamb approved these changes Sep 21, 2024

View reviewed changes

Weijun-H approved these changes Sep 22, 2024

View reviewed changes

Weijun-H merged commit 3bd41bc into apache:main Sep 22, 2024
24 checks passed

bgjackma pushed a commit to bgjackma/datafusion that referenced this pull request Sep 25, 2024

Sort metrics alphabetically in EXPLAIN ANALYZE output (apache#12568)

10c5127

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sort metrics alphabetically in EXPLAIN ANALYZE output #12568

Sort metrics alphabetically in EXPLAIN ANALYZE output #12568

progval commented Sep 21, 2024

alamb left a comment

Weijun-H left a comment

Sort metrics alphabetically in EXPLAIN ANALYZE output #12568

Sort metrics alphabetically in EXPLAIN ANALYZE output #12568

Conversation

progval commented Sep 21, 2024

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

alamb left a comment

Choose a reason for hiding this comment

Weijun-H left a comment

Choose a reason for hiding this comment