Support Unary Operations in Masked UDF #9409

isVoid · 2021-10-09T02:49:01Z

This PR adds support for several unary operations in masked udfs. Including trigonometry functions sin, cos, tan; rounding functions ceil and floor, sign functions neg and logic functions not.

closes #9405

codecov · 2021-10-09T04:16:41Z

Codecov Report

Merging #9409 (96f99ae) into branch-21.12 (ab4bfaa) will increase coverage by 0.26%.
The diff coverage is 0.00%.

@@               Coverage Diff                @@
##           branch-21.12    #9409      +/-   ##
================================================
+ Coverage         10.79%   11.05%   +0.26%     
================================================
  Files               116      117       +1     
  Lines             18869    20250    +1381     
================================================
+ Hits               2036     2239     +203     
- Misses            16833    18011    +1178

Impacted Files	Coverage Δ
python/cudf/cudf/__init__.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/_lib/__init__.py	`0.00% <ø> (ø)`
python/cudf/cudf/_lib/strings/__init__.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/_base_index.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/categorical.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/column.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/datetime.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/lists.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/numerical.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/string.py	`0.00% <0.00%> (ø)`
... and 73 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7fa2738...96f99ae. Read the comment docs.

python/cudf/cudf/core/udf/lowering.py

python/cudf/cudf/tests/test_udf_masked_ops.py

…unary_masked_udf

bdice

@isVoid @brandon-b-miller Thanks for tagging me on this! It's a really interesting part of the code. I have a few comments/questions.

python/cudf/cudf/core/udf/_ops.py

python/cudf/cudf/core/udf/lowering.py

Co-authored-by: Bradley Dice <[email protected]>

python/cudf/cudf/tests/test_udf_masked_ops.py

python/cudf/cudf/core/udf/_ops.py

gmarkall

This looks good on the whole - it looks like you've got to grips with extending UDF support with relative ease! There are a couple of comments on the diff on expanding the set of supported functions and tightening up the typing a bit, but I don't see anything major that needs changing.

gmarkall · 2021-10-13T14:43:47Z

python/cudf/cudf/core/udf/_ops.py

@@ -10,6 +11,26 @@
    operator.pow,
 ]

+unary_ops = [


All the unary math ops supported by the CUDA target can be found in Numba's cudamath.py, starting at this line: https://github.com/numba/numba/blob/master/numba/cuda/cudamath.py#L10 - it may be worth adding the complete set?

I was able to get many unary ops in, except math.trunc. It's suggesting

NotImplementedError: No definition for lowering <built-in function trunc>(float64,) -> float64

I believe it should should've gotten trunc(float64)->int64, maybe something is not registered correctly?

Besides math.log appears to be both a binary op and a unary op. Can we simply register math.log in binaryop as well as in unaryop to support both its usage?

Lastly, is there a place for all operator ops? Sorry for cramming all the questions in one place!

For trunc, the lack of implementation in CUDA might be a bug in Numba - I'll check into it and get back to you.

For log with two arguments, this appears not to be supported by the CUDA target (probably not for any good technical reason) - if the CUDA target did support it, just registering log as both a unary and binary op would work (because when one typing fails Numba will carry on trying others until it finds a successful one).

For all the operators, there's https://github.com/numba/numba/blob/master/numba/cpython/numbers.py - you have to look for all the instances of lower_builtin in that file. It's using the exact same code as the CPU target, so it isn't duplicated in the CUDA target, but instead the CUDA target pulls it in by "magic"... I started trying to trace exactly why the typing is registered for the CUDA target but I ended up going through several layers and still didn't get to the bottom of it. However, the lowering is pulled in by a side effect of this import in the CUDA target context: https://github.com/numba/numba/blob/master/numba/cuda/target.py#L88

Thanks for pointing out the operator locations. I was able to add invert. For abs, there's an error:

TypingError: No implementation of function Function(<built-in function abs>) found for signature: E E >>> abs(int32)

Which seems strange because the lowering for integer types are here: https://github.com/numba/numba/blob/2a792155c3dce43f86b9ff93802f12d39a3752dc/numba/cpython/numbers.py#L565

Unless... it's not for cuda target?

python/cudf/cudf/core/udf/typing.py

brandon-b-miller · 2021-10-18T18:21:41Z

All, is this ready to merge? @isVoid @gmarkall

gmarkall

All, is this ready to merge? @isVoid @gmarkall

Looks good to me.

brandon-b-miller · 2021-10-18T20:12:31Z

rerun tests

brandon-b-miller · 2021-10-18T22:36:49Z

rerun tests

brandon-b-miller · 2021-10-19T14:35:49Z

@gpucibot merge

brandon-b-miller · 2021-10-19T16:49:50Z

thanks @isVoid ! 🚀

isVoid added 3 commits October 8, 2021 11:59

unary op declaration works

143e65d

Adding more unary ops and tests

91d3de4

Adding test everything

43036b9

isVoid requested review from gmarkall, bdice and brandon-b-miller October 9, 2021 02:49

isVoid requested a review from a team as a code owner October 9, 2021 02:49

isVoid requested a review from galipremsagar October 9, 2021 02:49

github-actions bot added the Python Affects Python cuDF API. label Oct 9, 2021

isVoid added feature request New feature or request non-breaking Non-breaking change labels Oct 9, 2021

isVoid added the numba Numba issue label Oct 11, 2021

brandon-b-miller reviewed Oct 12, 2021

View reviewed changes

python/cudf/cudf/core/udf/lowering.py Outdated Show resolved Hide resolved

brandon-b-miller reviewed Oct 12, 2021

View reviewed changes

python/cudf/cudf/tests/test_udf_masked_ops.py Show resolved Hide resolved

isVoid added 2 commits October 12, 2021 09:56

Merge branch 'branch-21.12' of https://github.com/rapidsai/cudf into …

874ca02

…unary_masked_udf

Make unary test cases take row like inputs

9e3c37e

bdice reviewed Oct 12, 2021

View reviewed changes

Apply suggestions from code review

84f5346

Co-authored-by: Bradley Dice <[email protected]>

brandon-b-miller approved these changes Oct 12, 2021

View reviewed changes

bdice reviewed Oct 12, 2021

View reviewed changes

python/cudf/cudf/tests/test_udf_masked_ops.py Outdated Show resolved Hide resolved

actually test unary ops in test_everything

1443609

shwina reviewed Oct 12, 2021

View reviewed changes

python/cudf/cudf/core/udf/_ops.py Show resolved Hide resolved

gmarkall requested changes Oct 13, 2021

View reviewed changes

isVoid added 3 commits October 13, 2021 21:50

add more unary ops and test them

72bab55

address reviews

67f970d

Add operator.invert

96f99ae

isVoid requested a review from gmarkall October 15, 2021 16:11

gmarkall approved these changes Oct 18, 2021

View reviewed changes

brandon-b-miller added 5 - Ready to Merge Testing and reviews complete, ready to merge non-breaking Non-breaking change and removed non-breaking Non-breaking change labels Oct 18, 2021

rapids-bot bot merged commit 5e2aaf9 into rapidsai:branch-21.12 Oct 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Unary Operations in Masked UDF #9409

Support Unary Operations in Masked UDF #9409

isVoid commented Oct 9, 2021 •

edited

Loading

codecov bot commented Oct 9, 2021 •

edited

Loading

bdice left a comment

gmarkall left a comment

gmarkall Oct 13, 2021

isVoid Oct 14, 2021

isVoid Oct 14, 2021 •

edited

Loading

gmarkall Oct 14, 2021

isVoid Oct 14, 2021

isVoid Oct 14, 2021

brandon-b-miller commented Oct 18, 2021

gmarkall left a comment

brandon-b-miller commented Oct 18, 2021

brandon-b-miller commented Oct 18, 2021

brandon-b-miller commented Oct 19, 2021

brandon-b-miller commented Oct 19, 2021

Support Unary Operations in Masked UDF #9409

Support Unary Operations in Masked UDF #9409

Conversation

isVoid commented Oct 9, 2021 • edited Loading

codecov bot commented Oct 9, 2021 • edited Loading

Codecov Report

bdice left a comment

Choose a reason for hiding this comment

gmarkall left a comment

Choose a reason for hiding this comment

gmarkall Oct 13, 2021

Choose a reason for hiding this comment

isVoid Oct 14, 2021

Choose a reason for hiding this comment

isVoid Oct 14, 2021 • edited Loading

Choose a reason for hiding this comment

gmarkall Oct 14, 2021

Choose a reason for hiding this comment

isVoid Oct 14, 2021

Choose a reason for hiding this comment

isVoid Oct 14, 2021

Choose a reason for hiding this comment

brandon-b-miller commented Oct 18, 2021

gmarkall left a comment

Choose a reason for hiding this comment

brandon-b-miller commented Oct 18, 2021

brandon-b-miller commented Oct 18, 2021

brandon-b-miller commented Oct 19, 2021

brandon-b-miller commented Oct 19, 2021

isVoid commented Oct 9, 2021 •

edited

Loading

codecov bot commented Oct 9, 2021 •

edited

Loading

isVoid Oct 14, 2021 •

edited

Loading