[FEA] Update cmake to only build for present GPU #2902
Labels
CMake
CMake build issue
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Is your feature request related to a problem? Please describe.
As of 388dd12 libcudf's full build compile time using
ninja
is: 7m 40sThe longer the compile time, the slower the development process. As such, we should do what we can to reduce compile time. Currently, libcudf's cmake configuration builds a fatbin for both
sm60
andsm70
.If I disable the fatbin and only generate PTX for my target architecture (
compute_70
), compile time is: 3m 38sHowever, this comes at a significant runtime cost: as soon as you try and run any tests, the CUDA runtime will JIT compile all of the library's PTX into SASS (which takes a very long time).
Instead, if we generate a fatbin for only a single GPU (e.g.,
compute_70,sm_70
), compile time is: 4m 54sDescribe the solution you'd like
Update the CMake build to only generate a fatbin for the architecture present on the machine.
Additional context
Related: #1099 #581
The text was updated successfully, but these errors were encountered: