Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable segmented_gather in Java package #10669

Merged
merged 4 commits into from
Apr 19, 2022

Conversation

sperlingxx
Copy link
Contributor

Current PR is to enable cuDF API segmented_gather in Java package. segmented_gather is essential to implement spark array functions like arrays_zip(NVIDIA/spark-rapids#5229).

@sperlingxx sperlingxx requested a review from a team as a code owner April 15, 2022 09:38
@sperlingxx sperlingxx added the non-breaking Non-breaking change label Apr 15, 2022
@github-actions github-actions bot added the Java Affects Java cuDF API. label Apr 15, 2022
@sperlingxx sperlingxx added the feature request New feature or request label Apr 15, 2022
@codecov
Copy link

codecov bot commented Apr 15, 2022

Codecov Report

Merging #10669 (323a341) into branch-22.06 (8f5a044) will increase coverage by 0.06%.
The diff coverage is n/a.

@@               Coverage Diff                @@
##           branch-22.06   #10669      +/-   ##
================================================
+ Coverage         86.34%   86.41%   +0.06%     
================================================
  Files               142      142              
  Lines             22356    22334      -22     
================================================
- Hits              19304    19300       -4     
+ Misses             3052     3034      -18     
Impacted Files Coverage Δ
python/cudf/cudf/core/series.py 95.15% <0.00%> (-0.13%) ⬇️
python/cudf/cudf/core/dataframe.py 93.75% <0.00%> (-0.01%) ⬇️
python/cudf/cudf/core/groupby/groupby.py 91.72% <0.00%> (+0.22%) ⬆️
python/cudf/cudf/core/column/string.py 89.22% <0.00%> (+0.24%) ⬆️
python/cudf/cudf/core/tools/datetimes.py 84.49% <0.00%> (+0.30%) ⬆️
python/cudf/cudf/core/column/lists.py 92.79% <0.00%> (+1.27%) ⬆️
python/cudf/cudf/utils/cudautils.py 65.74% <0.00%> (+5.90%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8f5a044...323a341. Read the comment docs.

try {
cudf::jni::auto_set_device(env);
auto src_col = reinterpret_cast<cudf::lists_column_view *>(source_column);
auto gather_map = reinterpret_cast<cudf::lists_column_view *>(gather_map_list);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please cast these to a cudf::column_view and then create a cudf::lists_column_view from them. Otherwise there is no checking that the columns passed in are really lists. If they are not really bad things could happen.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.

@sperlingxx
Copy link
Contributor Author

@gpucibot merge

@rapids-bot rapids-bot bot merged commit 17d49fa into rapidsai:branch-22.06 Apr 19, 2022
@sperlingxx sperlingxx deleted the segment_gather_jni branch April 19, 2022 01:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request Java Affects Java cuDF API. non-breaking Non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants