[REVIEW] Adding floating point specialization to comparators for NaNs #3239

rgsl888prabhu · 2019-10-29T20:35:02Z

Added floating point specialization to comparators for NaNs and test cases to evaluate.

codecov · 2019-10-29T22:11:15Z

Codecov Report

Merging #3239 into branch-0.11 will not change coverage.
The diff coverage is n/a.

@@             Coverage Diff              @@
##           branch-0.11    #3239   +/-   ##
============================================
  Coverage        87.13%   87.13%           
============================================
  Files               49       49           
  Lines             9269     9269           
============================================
  Hits              8077     8077           
  Misses            1192     1192

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6bca9f6...ef59466. Read the comment docs.

harrism

I have some concerns about the semantics and the overheads.

cpp/include/cudf/table/row_operators.cuh

jrhemstad · 2019-10-30T14:39:13Z

@rgsl888prabhu a pattern like this is more along the lines of what I was envisioning:
https://wandbox.org/permlink/BEXfhMZWdwDKBprS

template <typename Element, std::enable_if_t<cudf::is_relationally_comparable<
                                  Element, Element>()>* = nullptr>
  __device__ weak_ordering operator()(size_type lhs_element_index,
                                      size_type rhs_element_index) const
      noexcept {
    if (has_nulls) {
      bool const lhs_is_null{lhs.nullable() and lhs.is_null(lhs_element_index)};
      bool const rhs_is_null{rhs.nullable() and rhs.is_null(rhs_element_index)};

      if (lhs_is_null and rhs_is_null) {  // null <? null
        return weak_ordering::EQUIVALENT;
      } else if (lhs_is_null) {  // null <? x
        return (null_precedence == null_order::BEFORE) ? weak_ordering::LESS
                                                       : weak_ordering::GREATER;
      } else if (rhs_is_null) {  // x <? null
        return (null_precedence == null_order::AFTER) ? weak_ordering::LESS
                                                      : weak_ordering::GREATER;
      }
    }

    Element const lhs_element = lhs.element<Element>(lhs_element_index);
    Element const rhs_element = rhs.element<Element>(rhs_element_index);

    return compare(lhs_element, rhs_element);
  }

This should vastly simplify the implementation you have here.

rgsl888prabhu · 2019-10-30T14:47:22Z

@rgsl888prabhu a pattern like this is more along the lines of what I was envisioning:
https://wandbox.org/permlink/BEXfhMZWdwDKBprS

template <typename Element, std::enable_if_t<cudf::is_relationally_comparable<
                                  Element, Element>()>* = nullptr>
  __device__ weak_ordering operator()(size_type lhs_element_index,
                                      size_type rhs_element_index) const
      noexcept {
    if (has_nulls) {
      bool const lhs_is_null{lhs.nullable() and lhs.is_null(lhs_element_index)};
      bool const rhs_is_null{rhs.nullable() and rhs.is_null(rhs_element_index)};

      if (lhs_is_null and rhs_is_null) {  // null <? null
        return weak_ordering::EQUIVALENT;
      } else if (lhs_is_null) {  // null <? x
        return (null_precedence == null_order::BEFORE) ? weak_ordering::LESS
                                                       : weak_ordering::GREATER;
      } else if (rhs_is_null) {  // x <? null
        return (null_precedence == null_order::AFTER) ? weak_ordering::LESS
                                                      : weak_ordering::GREATER;
      }
    }

    Element const lhs_element = lhs.element<Element>(lhs_element_index);
    Element const rhs_element = rhs.element<Element>(rhs_element_index);

    return compare(lhs_element, rhs_element);
  }

This should vastly simplify the implementation you have here.

@jrhemstad So, we will have two sets of functions one for floating point and other for non-floating type.

jrhemstad · 2019-10-30T14:51:55Z

@jrhemstad So, we will have two sets of functions one for floating point and other for non-floating type.

Yes, but it's limited to the logic that needs to be specialized between floating/non-floating types.

cpp/include/cudf/table/row_operators.cuh

jrhemstad

One small change, otherwise looks great!

cpp/include/cudf/table/row_operators.cuh

jrhemstad

I think you just need to merge the latest branch-0.11 into your PR to get CI to pass.

rgsl888prabhu · 2019-10-31T20:41:33Z

@harrism

harrism

Very clean solution.

harrism · 2019-11-01T05:37:37Z

cpp/include/cudf/table/row_operators.cuh

+* @brief A specialization for floating-point `Element` type rerlational comparison
+* to derive the order of the elements with respect to `lhs`. Specialization is to
+* handle `nan` in the order shown below.
+* `[-Inf, -ve, 0, -0, +ve, +Inf, NaN, NaN, null] (for null_order::AFTER)`


Why does NaN appear twice in these?

To show NaN == NaN

code changes and test cases

205f4c2

rgsl888prabhu requested review from a team as code owners October 29, 2019 20:35

CHANGELOG.md

0aa6b5c

rgsl888prabhu self-assigned this Oct 29, 2019

rgsl888prabhu changed the title ~~[WIP] Adding floating point specialization to comparators for NaNs~~ [REVIEW] Adding floating point specialization to comparators for NaNs Oct 29, 2019

rgsl888prabhu added 3 - Ready for Review Ready for review by team libcudf++ labels Oct 29, 2019

harrism requested changes Oct 30, 2019

View reviewed changes

rgsl888prabhu added 2 commits October 30, 2019 10:44

review changes

cc5b04d

typo

0295f45

jrhemstad requested changes Oct 30, 2019

View reviewed changes

cpp/include/cudf/table/row_operators.cuh Outdated Show resolved Hide resolved

cpp/include/cudf/table/row_operators.cuh Outdated Show resolved Hide resolved

cpp/include/cudf/table/row_operators.cuh Outdated Show resolved Hide resolved

review changes

ec3d914

jrhemstad requested changes Oct 31, 2019

View reviewed changes

cpp/include/cudf/table/row_operators.cuh Outdated Show resolved Hide resolved

using cuda isnan

3e6770b

jrhemstad approved these changes Oct 31, 2019

View reviewed changes

merging with 0.11

2938ce2

jrhemstad requested a review from harrism October 31, 2019 20:42

harrism approved these changes Nov 1, 2019

View reviewed changes

Merge branch 'branch-0.11' into 3226_adding_floating_pt_spclization

ef59466

jrhemstad merged commit 4ecaeae into rapidsai:branch-0.11 Nov 1, 2019

jrhemstad mentioned this pull request Nov 1, 2019

[BUG] nan_as_null parameter affects output of sort_values. #2191

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] Adding floating point specialization to comparators for NaNs #3239

[REVIEW] Adding floating point specialization to comparators for NaNs #3239

rgsl888prabhu commented Oct 29, 2019

codecov bot commented Oct 29, 2019 •

edited

Loading

harrism left a comment

jrhemstad commented Oct 30, 2019

rgsl888prabhu commented Oct 30, 2019

jrhemstad commented Oct 30, 2019

jrhemstad left a comment

jrhemstad left a comment

rgsl888prabhu commented Oct 31, 2019

harrism left a comment

harrism Nov 1, 2019

jrhemstad Nov 1, 2019

[REVIEW] Adding floating point specialization to comparators for NaNs #3239

[REVIEW] Adding floating point specialization to comparators for NaNs #3239

Conversation

rgsl888prabhu commented Oct 29, 2019

codecov bot commented Oct 29, 2019 • edited Loading

Codecov Report

harrism left a comment

Choose a reason for hiding this comment

jrhemstad commented Oct 30, 2019

rgsl888prabhu commented Oct 30, 2019

jrhemstad commented Oct 30, 2019

jrhemstad left a comment

Choose a reason for hiding this comment

jrhemstad left a comment

Choose a reason for hiding this comment

rgsl888prabhu commented Oct 31, 2019

harrism left a comment

Choose a reason for hiding this comment

harrism Nov 1, 2019

Choose a reason for hiding this comment

jrhemstad Nov 1, 2019

Choose a reason for hiding this comment

codecov bot commented Oct 29, 2019 •

edited

Loading