Refine GpuHashAggregateExec.setupReference #2917

sperlingxx · 2021-07-13T11:28:45Z

Signed-off-by: sperlingxx [email protected]

Current PR refined GpuHashAggregateExec.setupReference to support TypedImperativeAggregate, which is required by #2916 . The primary work of current PR is reorganizing the code of boundInputReferences.

Originally, we workaround PartialMerge modes when we construct boundInputReferences. To be specific, the code piece dealing with aggregations which contain both PartialMerge and Partial mode is not generic enough. It regards GpuPivotFirst and GpuAverage as exceptional cases:

      case Partial =>
        // Partial with distinct case
        val updateExpressionsCudfAggsDistinct =
          updateExpressionsDistinct.filter(_.isInstanceOf[CudfAggregate])
              .map(_.asInstanceOf[CudfAggregate].ref)
        if (inputProjectionsDistinct.exists(p => !p.isInstanceOf[NamedExpression])) {
          // Case of distinct average we need to evaluate the "GpuCast and GpuIsNotNull" columns.
          // Refer to how input projections are setup for GpuAverage.
          // In the case where we have expressions to evaluate, pick the unique attributes
          // references from them as you only have one column for it before you start evaluating.
          distinctExpressions = inputProjectionsDistinct
          distinctAttributes = inputProjectionsDistinct.flatMap(ref =>
            ref.references.toSeq).distinct
        } else {
          distinctAttributes = updateAttributesDistinct
          distinctExpressions = updateExpressionsCudfAggsDistinct
        }

With the new implementation, we avoid all kinds of workaround. Instead, we fit all kinds of aggregation modes into three categories by the stage of AggregateExec in aggregation stack:

The first stage of aggregation stack, in terms of aggregation modes, it may contain aggregate expressions with Partial or Complete mode. It may also contain no aggregate expressions. For the first aggregation stage, the input projections are necessary, because it consume the outputs of non-Aggregate plans.
The final-like stages, including three conditions:
- the final (last) stage of aggregation stack
- the (partial) merge stage merely for nonDistinctAggAttributes (the second stage of aggregation stack with one distinct)
For final-like stages, we just pass through all output attributes of child plan.
The third stage of aggregation stack with one distinct, in terms of aggregation modes, it contains both Partial mode (for distinctAgg) and PartialMerge mode (for nonDistinctAgg). For this stage, we need to switch the position of distinctAttributes and nonDistinctAttributes to match the output schema of the previous stage:
- the schema of the 2nd stage's outputs: groupingAttributes ++ distinctAttributes ++ nonDistinctAggBufferAttributes
- the schema of the 3rd stage's aggregate expressions: nonDistinctMergeAggExpressions ++ distinctPartialAggExpressions

sperlingxx · 2021-07-13T11:30:37Z

build

abellina · 2021-07-13T13:25:39Z

Could you add to the description for this PR around what exactly was changed and why? Please also note this PR is in flight: #2859, and also makes changes to setupReferences.

Signed-off-by: sperlingxx <[email protected]>

sperlingxx · 2021-07-14T07:46:34Z

build

sperlingxx · 2021-07-14T07:48:31Z

Could you add to the description for this PR around what exactly was changed and why? Please also note this PR is in flight: #2859, and also makes changes to setupReferences.

I added the description. And I rebased this branch onto latest main branch.

Signed-off-by: sperlingxx <[email protected]>

sperlingxx · 2021-07-14T10:53:34Z

build

revans2 · 2021-07-14T14:07:59Z

@abellina and @jlowe could both of you take a look at this? You have a lot more experience with the aggregate code than I do. It looks okay to me, but I don't feel 100% qualified to approve it.

abellina · 2021-07-14T14:16:17Z

@revans2 yeap, I am looking.

abellina

Thanks for the added details @sperlingxx. I am mostly curious on the update expressions, as we used to use them before in the PartialMerge + Partial case https://github.com/NVIDIA/spark-rapids/blob/branch-21.08/sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala#L679. I think the comments need to be adjusted but it would be nice to make sure that you found that input projections is all that is needed in that case (perhaps the shapes match?)

Tagging @kuhushukla and @nartal1 as they may be more familiar with both distinct and pivot.

Had other minor nits.

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala

abellina · 2021-07-14T14:54:59Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala

-    // Pick merge non-distinct for PartialMerge
-    val mergeExpressionsNonDistinct =
-      nonDistinctAggExpressions
+    // - PartialMerge with Partial mode: we use the inputProjections or distinct update expressions


Comments mention distinct update expressions, but I am not finding the use of update expressions in your change, only input projections.

I updated this part along with comments, in order to verify and clarify "input projections is all that is needed in that case".

abellina · 2021-07-14T15:02:53Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala

+    // - Final or PartialMerge-only mode: we pick the columns in the order as handed to us.
+    // - Partial or Complete mode: we use the inputProjections or distinct update expressions.
+    val boundInputReferences =
+    if (modeInfo.hasPartialMerge && modeInfo.uniqueModes.contains(Partial)) {


Suggested change

if (modeInfo.hasPartialMerge && modeInfo.uniqueModes.contains(Partial)) {

if (modeInfo.hasPartialMerge && modeInfo.hasPartialMode) {

bonus points => hasPartialMerge => hasPartialMergeMode

We used modeInfo.uniqueModes.contains(Partial) instead of modeInfo.hasPartialMode here, because hasPartialMode is set as hasPartialMerge || uniqueModes.contains(Partial). I adjusted AggregateModeInfo to make its members more consistent with their names.

Signed-off-by: sperlingxx <[email protected]>

sperlingxx · 2021-07-15T07:03:59Z

build

abellina

I had a couple of nits, but they are minor. Approving ahead of time. Thanks for making the tweaks.

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala

Co-authored-by: Alessandro Bellina <[email protected]>

sperlingxx · 2021-07-16T01:19:51Z

build

abellina

LGTM

sperlingxx requested review from abellina, revans2 and nartal1 July 13, 2021 11:28

sperlingxx mentioned this pull request Jul 13, 2021

[FEA] Support GpuCollectList and GpuCollectSet as TypedImperativeAggregate #2916

Closed

sperlingxx added feature request New feature or request task Work required that improves the product but is not user facing and removed feature request New feature or request labels Jul 13, 2021

rework GpuHashAgg.setupRef

3dcce50

Signed-off-by: sperlingxx <[email protected]>

sperlingxx force-pushed the rework_setup_ref branch from 46559ff to 3dcce50 Compare July 14, 2021 06:22

refine

cf39647

Signed-off-by: sperlingxx <[email protected]>

fix

0af7991

Signed-off-by: sperlingxx <[email protected]>

abellina requested changes Jul 14, 2021

View reviewed changes

refine

ca40f1a

Signed-off-by: sperlingxx <[email protected]>

abellina previously approved these changes Jul 15, 2021

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala Outdated Show resolved Hide resolved

sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala Outdated Show resolved Hide resolved

Update sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala

d9e3ea1

Co-authored-by: Alessandro Bellina <[email protected]>

sperlingxx dismissed abellina’s stale review via d9e3ea1 July 16, 2021 01:18

Update sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala

1bb0e81

Co-authored-by: Alessandro Bellina <[email protected]>

sperlingxx requested a review from abellina July 16, 2021 01:19

abellina approved these changes Jul 16, 2021

View reviewed changes

sperlingxx merged commit 25197b8 into NVIDIA:branch-21.08 Jul 16, 2021

sperlingxx deleted the rework_setup_ref branch July 16, 2021 03:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refine GpuHashAggregateExec.setupReference #2917

Refine GpuHashAggregateExec.setupReference #2917

sperlingxx commented Jul 13, 2021 •

edited

Loading

sperlingxx commented Jul 13, 2021

abellina commented Jul 13, 2021

sperlingxx commented Jul 14, 2021

sperlingxx commented Jul 14, 2021

sperlingxx commented Jul 14, 2021

revans2 commented Jul 14, 2021

abellina commented Jul 14, 2021

abellina left a comment

abellina Jul 14, 2021

sperlingxx Jul 15, 2021

abellina Jul 14, 2021

abellina Jul 14, 2021

sperlingxx Jul 15, 2021

sperlingxx commented Jul 15, 2021

abellina left a comment

sperlingxx commented Jul 16, 2021

abellina left a comment

	if (modeInfo.hasPartialMerge && modeInfo.uniqueModes.contains(Partial)) {
	if (modeInfo.hasPartialMerge && modeInfo.hasPartialMode) {

Refine GpuHashAggregateExec.setupReference #2917

Refine GpuHashAggregateExec.setupReference #2917

Conversation

sperlingxx commented Jul 13, 2021 • edited Loading

sperlingxx commented Jul 13, 2021

abellina commented Jul 13, 2021

sperlingxx commented Jul 14, 2021

sperlingxx commented Jul 14, 2021

sperlingxx commented Jul 14, 2021

revans2 commented Jul 14, 2021

abellina commented Jul 14, 2021

abellina left a comment

Choose a reason for hiding this comment

abellina Jul 14, 2021

Choose a reason for hiding this comment

sperlingxx Jul 15, 2021

Choose a reason for hiding this comment

abellina Jul 14, 2021

Choose a reason for hiding this comment

abellina Jul 14, 2021

Choose a reason for hiding this comment

sperlingxx Jul 15, 2021

Choose a reason for hiding this comment

sperlingxx commented Jul 15, 2021

abellina left a comment

Choose a reason for hiding this comment

sperlingxx commented Jul 16, 2021

abellina left a comment

Choose a reason for hiding this comment

sperlingxx commented Jul 13, 2021 •

edited

Loading