-
Notifications
You must be signed in to change notification settings - Fork 197
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix compile time explosion for minkowski distance #1254
Fix compile time explosion for minkowski distance #1254
Conversation
As explained in rapidsai#1246 (comment), ptxas chokes on the minkowski distance when VecLen==4 and IdxT==uint32_t. This PR removes the veclen == 4 specialization for the minkowski distance.
Codecov ReportBase: 87.99% // Head: 87.99% // No change to project coverage 👍
Additional details and impacted files@@ Coverage Diff @@
## branch-23.04 #1254 +/- ##
=============================================
Coverage 87.99% 87.99%
=============================================
Files 21 21
Lines 483 483
=============================================
Hits 425 425
Misses 58 58
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure how I missed this one originally but I'd like to keep the (modern) naming conventions more consistent across the APIs.
Also, I want to make sure the seemingly lower compile times aren't the result of sccache (which is suspect they are) and giving us false hope here. This is what happened before the release (we saw 45 minute compile times because of sccache, got excited, and then realized after the sccsche reset that the compile times were still taking >6 hours. I'd like to do some more profiling of this before merging.
Did you reset sccache in commit 585fc47 or do we still have to test this? |
I tried to but that didn't seem to work. I'm going to build locally without sccache in the meantime since my workstation specs are similar to CI. |
Here's the ninja trace log: The trace above is from building RAFT on my local machine with the Looks like the high offenders are now (in no particular order):
I notice a lot of the source files that take a long time to compile are taking in upwards of 1.5 hours or more. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm going ahead and approving this guy because I think the build issues related to these changes have been fixed. The other build issues predate these changes, I believe.
/merge |
@ahendriksen just leaving a note for later that we should pull the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
As explained in rapidsai#1246 (comment), ptxas chokes on the minkowski distance when `VecLen==4` and `IdxT==uint32_t`. This PR removes the veclen == 4 specialization for the minkowski distance. Follow up to: rapidsai#1239 Authors: - Allard Hendriksen (https://github.com/ahendriksen) - Corey J. Nolet (https://github.com/cjnolet) Approvers: - Corey J. Nolet (https://github.com/cjnolet) - Sean Frye (https://github.com/sean-frye) URL: rapidsai#1254
As explained in #1246 (comment), ptxas chokes on the minkowski distance when
VecLen==4
andIdxT==uint32_t
.This PR removes the veclen == 4 specialization for the minkowski distance.
Follow up to: #1239