Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only exclude join keys that are indices from key columns #12271

Merged
merged 1 commit into from
Nov 30, 2022

Conversation

wence-
Copy link
Contributor

@wence- wence- commented Nov 30, 2022

Description

Previously, if any of the join keys were indices, we assumed that they
all were, and provided an empty set of key columns with matching names
in the left and right dataframe. This does the wrong thing for mixed
join keys (on a combination of index and normal columns), producing
more output columns than is correct. To avoid this, only skip matching
key names if they name indices.

Closes #11550.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Previously, if any of the join keys were indices, we assumed that they
all were, and provided an empty set of key columns with matching names
in the left and right dataframe. This does the wrong thing for mixed
join keys (on a combination of index and normal columns), producing
more output columns than is correct. To avoid this, only skip matching
key names if they name indices.

Closes rapidsai#11550.
@wence- wence- requested a review from a team as a code owner November 30, 2022 19:18
@wence-
Copy link
Contributor Author

wence- commented Nov 30, 2022

cc @eriknw

@github-actions github-actions bot added the Python Affects Python cuDF API. label Nov 30, 2022
@wence- wence- added 3 - Ready for Review Ready for review by team non-breaking Non-breaking change labels Nov 30, 2022
@codecov
Copy link

codecov bot commented Nov 30, 2022

Codecov Report

❗ No coverage uploaded for pull request base (branch-23.02@d970c5a). Click here to learn what that means.
Patch has no changes to coverable lines.

Additional details and impacted files
@@               Coverage Diff               @@
##             branch-23.02   #12271   +/-   ##
===============================================
  Coverage                ?   88.21%           
===============================================
  Files                   ?      137           
  Lines                   ?    22690           
  Branches                ?        0           
===============================================
  Hits                    ?    20017           
  Misses                  ?     2673           
  Partials                ?        0           

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

@shwina shwina added the bug Something isn't working label Nov 30, 2022
@shwina
Copy link
Contributor

shwina commented Nov 30, 2022

@gpucibot merge

@rapids-bot rapids-bot bot merged commit e321bf1 into rapidsai:branch-23.02 Nov 30, 2022
@wence- wence- deleted the wence/fix/issue-11550 branch November 30, 2022 22:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3 - Ready for Review Ready for review by team bug Something isn't working non-breaking Non-breaking change Python Affects Python cuDF API.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] cudf incorrect when merging on both index level and column when specifying left_on and right_on
2 participants