-
Notifications
You must be signed in to change notification settings - Fork 916
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[REVIEW] concatenate row items using a separator defined per row #5204
Conversation
- this Closes rapidsai#3726 - this emulates `concatenate_ws` spark functionality - provides option for a global separator and global column null replacements - skips null values in a row to perform concatenation
Codecov Report
@@ Coverage Diff @@
## branch-0.15 #5204 +/- ##
==============================================
Coverage ? 88.38%
==============================================
Files ? 55
Lines ? 10489
Branches ? 0
==============================================
Hits ? 9271
Misses ? 1218
Partials ? 0
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good.
One minor thing. I've been trying to consistently use the plural strings whenever referring to a strings column in the documentation.
- add more tests (all empty string test)
Co-authored-by: David <[email protected]>
Co-authored-by: David <[email protected]>
Co-authored-by: David <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
[sc] i have checked with @revans2 offline and he did concur that the alternate api's requested in the ticket will result in as much (or more) intermediaries compared to existing append api. further, this api is required to support the |
This looks fine to me @rwlee does the API look like what you expected? |
Just out of curiosity, what can this new API achieve that wasn’t possible with the old concatenate. Eg could you achieve the same result by creating a If so, could this be better achieved by making the existing API more generic and take a null replacement per column of the values table? I’m not prescribing that this is how it should be done, just trying to understand the API requirements. |
|
Fair enough |
Retargeting to 0.15 since we enter code freeze tonight. |
Make sure you communicate with a maintainer when you open PRs. This one was not on my radar and therefore had not been added to the 0.14 project board. |
@devavret is this good to go? i have now re-targeted this against 0.15. |
@davidwendt @devavret - can one of you please merge this? |
concatenate_ws
spark functionality