Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

REF: de-duplicate IntervalIndex._intersection #41929

Merged
merged 1 commit into from
Jun 10, 2021

Conversation

jbrockmendel
Copy link
Member

made possible by #41863

Copy link
Contributor

@jreback jreback left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

wow luv it!

@jreback
Copy link
Contributor

jreback commented Jun 10, 2021

failures looks spurious

@jreback jreback merged commit 5940c9c into pandas-dev:master Jun 10, 2021
@jbrockmendel jbrockmendel deleted the ref-setops-4 branch June 10, 2021 23:09
@simonjayhawkins simonjayhawkins added Index Related to the Index class or subclasses Interval Interval data type Refactor Internal refactoring of code labels Jun 11, 2021
@simonjayhawkins simonjayhawkins added this to the 1.3 milestone Jun 11, 2021
@jorisvandenbossche
Copy link
Member

@jbrockmendel
Copy link
Member Author

This might have caused a big slowdown [...] (the range of commits also includes some other commits)

The indexing bugfix might play a part, but it does seem likely that this is the culprit. Profiling it looks like drop_duplicates is the big time sink.

@jbrockmendel
Copy link
Member Author

ive got a IntervalArray.unique branch in the works that cuts this this from 4x down to 2x, still a ways to go.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Index Related to the Index class or subclasses Interval Interval data type Refactor Internal refactoring of code
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants