Cleanup ExternalSorter metrics (#5885) #6364

tustvold · 2023-05-16T14:23:20Z

Which issue does this PR close?

Part of #5885
Part of #5108

Rationale for this change

In preparation for improving the memory accounting in ExternalSorter / SortPreservingMerge I first wanted to sanitise what already existed. I will call out the various changes in the PR

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

mem_used, spill_count and spilled_bytes are no longer included in BaselineMetrics (as they were only used by ExternalSorter)

CompositeMetricsSet and MemTrackingMetrics have been removed

tustvold · 2023-05-16T14:23:43Z

datafusion/core/src/physical_plan/metrics/baseline.rs

-    /// count of spills during the execution of the operator
-    spill_count: Count,
-
-    /// total spilled bytes during the execution of the operator
-    spilled_bytes: Count,
-
-    /// current memory usage for the operator
-    mem_used: Gauge,


These were only used by ExternalSorter and so I figure aren't really Baseline

tustvold · 2023-05-16T14:25:20Z

datafusion/core/src/physical_plan/metrics/composite.rs

-/// multiple in-mem sort metrics and final merge-sort metrics from `SortPreservingMergeStream`.
-/// Therefore, We need a separation of metrics for which are final metrics (for output_rows accumulation),
-/// and which are intermediate metrics that we only account for elapsed_compute time.
-pub struct CompositeMetricsSet {


BaselineMetrics::intermediate replaces the need for this

tustvold · 2023-05-16T14:27:19Z

datafusion/core/src/physical_plan/metrics/tracker.rs

-
-/// Wraps a [`BaselineMetrics`] and records memory usage on a [`MemoryReservation`]
-#[derive(Debug)]
-pub struct MemTrackingMetrics {


I think this construction may date from an earlier iteration of the memory tracking, as it stands now it makes little sense.

The memory reported by the mem_used metric will be a somewhat arbitrary value based on the last value at the point the plan finished. Additionally there isn't any way to use the MemoryReservation in a fallible manner. It felt easier to just separate the concerns of reporting plan metrics from tracking runtime memory usage.

alamb · 2023-05-16T19:52:31Z

datafusion/core/src/physical_plan/sorts/sort.rs

@@ -56,6 +55,27 @@ use tempfile::NamedTempFile;
 use tokio::sync::mpsc::{Receiver, Sender};
 use tokio::task;

+struct ExternalSorterMetrics {


I agree it is much nicer to put sorting metrics on the sorter rather than BaselineMetrics

alamb

This looks great to me. Thank you @tustvold

I will also run the sort benchmark to make sure this doesn't cause a regression (I don't expect it will) and report back

cc @yjshen as I believe he contributed an early version of these metrics

alamb · 2023-05-17T11:17:38Z

My benchmark run shows no changes in performance, as expected

--------------------
Benchmark sort.json
--------------------
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query        ┃       sort ┃       sort ┃        Change ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ Qsort utf8   │ 63572.72ms │ 65037.97ms │     no change │
│ Qsort int    │ 77433.88ms │ 75017.20ms │     no change │
│ Qsort        │ 65095.90ms │ 63913.09ms │     no change │
│ decimal      │            │            │               │
│ Qsort        │ 83230.11ms │ 80413.45ms │     no change │
│ integer      │            │            │               │
│ tuple        │            │            │               │
│ Qsort utf8   │ 64949.20ms │ 61437.19ms │ +1.06x faster │
│ tuple        │            │            │               │
│ Qsort mixed  │ 74037.84ms │ 70042.36ms │ +1.06x faster │
│ tuple        │            │            │               │

)" This reverts commit cf81117.

tustvold added the api change Changes the API exposed to users of the crate label May 16, 2023

github-actions bot added the core Core DataFusion crate label May 16, 2023

tustvold commented May 16, 2023

View reviewed changes

tustvold force-pushed the cleanup-sort-metrics branch from 93b24f0 to 09a798a Compare May 16, 2023 14:24

tustvold commented May 16, 2023

View reviewed changes

Cleanup ExternalSorter metrics (apache#5885)

eafefe4

tustvold force-pushed the cleanup-sort-metrics branch from 09a798a to eafefe4 Compare May 16, 2023 14:29

alamb reviewed May 16, 2023

View reviewed changes

alamb approved these changes May 16, 2023

View reviewed changes

tustvold merged commit cf81117 into apache:main May 17, 2023

richox pushed a commit to richox/arrow-datafusion that referenced this pull request May 29, 2023

blaze: Revert "Cleanup ExternalSorter metrics (apache#5885) (apache#6364

c6ff0ea

)" This reverts commit cf81117.

richox pushed a commit to richox/arrow-datafusion that referenced this pull request May 29, 2023

blaze: Revert "Cleanup ExternalSorter metrics (apache#5885) (apache#6364

a77f839

)" This reverts commit cf81117.

richox pushed a commit to richox/arrow-datafusion that referenced this pull request Jul 5, 2023

blaze: Revert "Cleanup ExternalSorter metrics (apache#5885) (apache#6364

a836a07

)" This reverts commit cf81117.

richox pushed a commit to richox/arrow-datafusion that referenced this pull request Jul 5, 2023

blaze: Revert "Cleanup ExternalSorter metrics (apache#5885) (apache#6364

fc65a6e

)" This reverts commit cf81117.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup ExternalSorter metrics (#5885) #6364

Cleanup ExternalSorter metrics (#5885) #6364

tustvold commented May 16, 2023

tustvold May 16, 2023

tustvold May 16, 2023

tustvold May 16, 2023

alamb May 16, 2023

alamb left a comment

alamb commented May 17, 2023

Cleanup ExternalSorter metrics (#5885) #6364

Cleanup ExternalSorter metrics (#5885) #6364

Conversation

tustvold commented May 16, 2023

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

tustvold May 16, 2023

Choose a reason for hiding this comment

tustvold May 16, 2023

Choose a reason for hiding this comment

tustvold May 16, 2023

Choose a reason for hiding this comment

alamb May 16, 2023

Choose a reason for hiding this comment

alamb left a comment

Choose a reason for hiding this comment

alamb commented May 17, 2023