[BUG] Sum and multiply aggregations promote unsigned input types to a signed output #10149

jlowe · 2022-01-27T19:24:04Z

Describe the bug
When performing aggregations, the output types are often upscaled to help combat overflow situations. For example, performing a sum aggregation on an INT32 column will produce an INT64 result. However performing a sum aggregation on a UINT32 column produces an INT64 result rather than a UINT64 result.

Steps/Code to reproduce bug
Perform a sum aggregation with an input column of UINT32 and note that the result is INT64. Here's a snippet of a session doing this with the cudf Java API in the Spark REPL shell:

scala> import ai.rapids.cudf._
import ai.rapids.cudf._

scala> val t = new Table(ColumnVector.fromInts(0), ColumnVector.fromUnsignedInts(0))
t: ai.rapids.cudf.Table = Table{columns=[ColumnVector{rows=1, type=INT32, nullCount=Optional[0], offHeap=(ID: 5 7feec1b5ac90)}, ColumnVector{rows=1, type=UINT32, nullCount=Optional[0], offHeap=(ID: 9 7feec1b5a280)}], cudfTable=140663428850592, rows=1}

scala> t.groupBy(0).aggregate(GroupByAggregation.sum().onColumn(1))
res0: ai.rapids.cudf.Table = Table{columns=[ColumnVector{rows=1, type=INT32, nullCount=Optional.empty, offHeap=(ID: 10 7ff0b7039860)}, ColumnVector{rows=1, type=INT64, nullCount=Optional.empty, offHeap=(ID: 11 7ff0b7039760)}], cudfTable=140671839344304, rows=1}

Expected behavior
Unsigned input types should be promoted to unsigned output types for any aggregations where the sign of the result cannot change for unsigned inputs (e.g.: sum and multiply)

Additional context
See @jrhemstad's comment at #10102 (comment)

The text was updated successfully, but these errors were encountered:

github-actions · 2022-02-26T20:02:49Z

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

github-actions · 2022-05-27T20:02:53Z

This issue has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

GregoryKimball · 2022-06-30T18:41:30Z

From #10102 (comment):

I think the machinery in question was added before unsigned support and then was just never updated. It should be updated to use uint64 for unsigned integer types:

cudf/cpp/include/cudf/detail/aggregation/aggregation.hpp

Lines 1140 to 1147 in 1246116

    
           // Summing/Multiplying integers of any type, always use int64_t accumulator 
        
           template <typename Source, aggregation::Kind k> 
        
           struct target_type_impl< 
        
             Source, 
        
             k, 
        
             std::enable_if_t<std::is_integral<Source>::value && is_sum_product_agg(k)>> { 
        
             using type = int64_t; 
        
           };

could be updated to:

// Summing/Multiplying integers of any type, always use int64_t accumulator
template <typename Source, aggregation::Kind k>
struct target_type_impl<
  Source,
  k,
  std::enable_if_t<std::is_integral<Source>::value && is_sum_product_agg(k)>> {
  using type = std::conditional_t<std::is_signed_v<Source>, int64_t, uint64_t>>;
};

… Unsigned Output for Sum and Multiply (#14679) During aggregation, output types are modified to prevent overflow. Presently, summing INT32 yields INT64, but summing UINT32 still results in INT64 instead of UINT64. This pull request resolves Issue #[10149](#10149) to ensure the correct output type is used when summing or multiplying integers. Authors: - Suraj Aralihalli (https://github.com/SurajAralihalli) - Karthikeyan (https://github.com/karthikeyann) - Nghia Truong (https://github.com/ttnghia) Approvers: - Nghia Truong (https://github.com/ttnghia) - Shruti Shivakumar (https://github.com/shrshi) - Karthikeyan (https://github.com/karthikeyann) URL: #14679

GregoryKimball · 2024-02-01T05:29:26Z

The work in #14679 to address this issue ended up needed to be reverted in #14907 due to a performance regression reported in #14886.

In addition to adding back the changes in #14679, we also need to:

add specializations in device_atomics.cuh for uint32_t and uint64_t as per this comment
for clarity, update namespace for atomicAdd in aggregation.cuh to specify the cudf::detail:: namespace
also for clarity, extend the cudf::detail:: namespace updates to atomicMul atomicMin atomicMax in aggregation.cuh

bdice · 2024-02-03T05:08:10Z

I started an experiment in this direction before I re-read this issue and realized @SurajAralihalli was assigned here. With apologies to @SurajAralihalli, I think I have a good start on the atomics refactoring in #14962. I would like to get that PR merged first, because it should be a standalone improvement, and then we can revisit the changes that were originally reverted.

karthikeyann · 2024-02-03T06:12:02Z

The revert can be undone after merging #14962. I tested a similar fix while debugging this issue with @SurajAralihalli .
Although, a thorough testing on other types in similar scenario is required to ensure other bugs are not hidden. (perhaps test chrono types, decimal types).

SurajAralihalli · 2024-02-03T21:53:11Z

I started an experiment in this direction before I re-read this issue and realized @SurajAralihalli was assigned here. With apologies to @SurajAralihalli, I think I have a good start on the atomics refactoring in #14962. I would like to get that PR merged first, because it should be a standalone improvement, and then we can revisit the changes that were originally reverted.

Thanks @bdice for letting me know!

…operators to detail namespace. (#14962) This PR does a thorough refactoring of `device_atomics.cuh`. - I moved all atomic-related functions to `cudf::detail::` (making this an API-breaking change, but most likely a low-impact break) - I added all missing operators for natively supported types to `atomicAdd`, `atomicMin`, `atomicMax`, etc. as discussed in #10149 and #14907. - This should prevent fallback to the `atomicCAS` path for types that are natively supported for those atomic operators, which we suspect as the root cause of the performance regression in #14886. - I kept `atomicAdd` rather than `cudf::detail::atomic_add` in locations where a native CUDA overload exists, and the same for min/max/CAS operations. Aggregations are the only place where we use the special overloads. We were previously calling the native CUDA function rather than our special overloads in many cases, so I retained the previous behavior. This avoids including the additional headers that implement an unnecessary level of wrapping for natively supported overloads. - I enabled native 2-byte CAS operations (on `unsigned short int`) that eliminate the do-while loop and extra alignment-checking logic - The CUDA docs don't state this, but some forum posts claim this is only supported by compute capability 7.0+. We now have 7.0 as a lower bound for RAPIDS so I'm not concerned by this as long as builds/tests pass. - I improved/cleaned the documentation and moved around some code so that the operators were in a logical order. - I assessed the existing tests and it looks like all the types are being covered. I'm not sure if there is a good way to enforce that certain types (like `uint64_t`) are passing through native `atomicAdd` calls. Authors: - Bradley Dice (https://github.com/bdice) Approvers: - David Wendt (https://github.com/davidwendt) - Suraj Aralihalli (https://github.com/SurajAralihalli) URL: #14962

jlowe added bug Something isn't working Needs Triage Need team to review and classify libcudf Affects libcudf (C++/CUDA) code. labels Jan 27, 2022

jlowe mentioned this issue Jan 27, 2022

[FEA] Support SUM aggregation (groupby and reduction at least) with upcasting the output. #10102

Closed

github-actions bot added the inactive-30d label Feb 26, 2022

github-actions bot added the inactive-90d label May 27, 2022

GregoryKimball added Spark Functionality that helps Spark RAPIDS and removed Needs Triage Need team to review and classify labels Jun 26, 2022

GregoryKimball added the good first issue Good for newcomers label Jun 30, 2022

github-actions bot removed the inactive-90d label Jun 30, 2022

SurajAralihalli mentioned this issue Dec 28, 2023

Fix Aggregation Type Promotion: Ensure Unsigned Input Types Result in Unsigned Output for Sum and Multiply #14679

Merged

3 tasks

GregoryKimball closed this as completed Jan 24, 2024

GregoryKimball reopened this Feb 1, 2024

GregoryKimball assigned SurajAralihalli Feb 1, 2024

GregoryKimball removed the good first issue Good for newcomers label Feb 1, 2024

bdice mentioned this issue Feb 3, 2024

Add missing atomic operators, refactor atomic operators, move atomic operators to detail namespace. #14962

Merged

3 tasks

vyasr removed the inactive-30d label Feb 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Sum and multiply aggregations promote unsigned input types to a signed output #10149

[BUG] Sum and multiply aggregations promote unsigned input types to a signed output #10149

jlowe commented Jan 27, 2022

github-actions bot commented Feb 26, 2022

github-actions bot commented May 27, 2022

GregoryKimball commented Jun 30, 2022

GregoryKimball commented Feb 1, 2024 •

edited

Loading

bdice commented Feb 3, 2024

karthikeyann commented Feb 3, 2024

SurajAralihalli commented Feb 3, 2024

[BUG] Sum and multiply aggregations promote unsigned input types to a signed output #10149

[BUG] Sum and multiply aggregations promote unsigned input types to a signed output #10149

Comments

jlowe commented Jan 27, 2022

github-actions bot commented Feb 26, 2022

github-actions bot commented May 27, 2022

GregoryKimball commented Jun 30, 2022

GregoryKimball commented Feb 1, 2024 • edited Loading

bdice commented Feb 3, 2024

karthikeyann commented Feb 3, 2024

SurajAralihalli commented Feb 3, 2024

GregoryKimball commented Feb 1, 2024 •

edited

Loading