BUG fix BatchedLevelAlgo DtClsTest & DtRegTest failing tests #3690

venkywonka · 2021-04-01T04:47:07Z

This PR fixes the regressions shown by BatchedLevelAlgo/DtClsTestF and BatchedLevelAlgo/DtRegTestF wherein the quantiles parameter passed to grow_tree function was uninitialized garbage memory as opposed to what should have been quantiles computed for each column.
It also replaces the old method of computing quantiles (preprocess_quantiles) with new, more accurate one (computeQuantiles)
removes an unnecessary memory allocation to tempmem in the setup phase of the test fixture.
This fixes failing BatchedLevelAlgo/DtRegTestF tests as reported in issue [BUG] CUDA 11.2 libcuml++ C++ test failures EDIT: Updated with 11.2 update 2 #3406
It also fixes failing BatchedLevelAlgo/DtClsTestF tests in PR ENH Decision Tree new backend computeSplitClassificationKernel histogram calculation and occupancy optimization #3616

* updated to new method for quantiles computation * deleted unused tempmem

teju85

Changes LGTM. Thanks @venkywonka for the quick fix.

codecov-io · 2021-04-01T07:15:26Z

Codecov Report

Merging #3690 (78ec480) into branch-0.19 (fd9ec89) will increase coverage by 1.77%.
The diff coverage is 100.00%.

@@               Coverage Diff               @@
##           branch-0.19    #3690      +/-   ##
===============================================
+ Coverage        80.70%   82.48%   +1.77%     
===============================================
  Files              227      227              
  Lines            17615    17541      -74     
===============================================
+ Hits             14217    14469     +252     
+ Misses            3398     3072     -326

Flag	Coverage Δ
dask	`46.40% <100.00%> (+1.40%)`	⬆️
non-dask	`74.46% <100.00%> (+1.54%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...cuml/_thirdparty/sklearn/preprocessing/__init__.py	`100.00% <ø> (ø)`
...on/cuml/_thirdparty/sklearn/preprocessing/_data.py	`64.36% <ø> (+1.24%)`	⬆️
...hirdparty/sklearn/preprocessing/_discretization.py	`83.33% <ø> (-0.88%)`	⬇️
...l/_thirdparty/sklearn/preprocessing/_imputation.py	`64.25% <ø> (+1.45%)`	⬆️
...cuml/_thirdparty/sklearn/utils/skl_dependencies.py	`80.00% <ø> (+26.07%)`	⬆️
python/cuml/cluster/__init__.py	`100.00% <ø> (ø)`
python/cuml/cluster/agglomerative.pyx	`96.47% <ø> (ø)`
python/cuml/cluster/dbscan.pyx	`98.19% <ø> (-1.81%)`	⬇️
python/cuml/cluster/kmeans.pyx	`91.95% <ø> (ø)`
python/cuml/common/array_sparse.py	`96.29% <ø> (+1.95%)`	⬆️
... and 92 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update daf0d14...78ec480. Read the comment docs.

JohnZed · 2021-04-01T19:46:41Z

@gpucibot merge

…ogram calculation and occupancy optimization (#3616) * This PR introduces: * A faster way to calculate the histograms containing splits in the `ML::DecisionTree::computeSplitClassificationKernel` . These histograms are used for node-splitting in decision trees for the task of classification. * A change in the default `gridDim.x` in the launch configuration of the above kernel from `4` to based on occupancy calculator and other dimension gridDims, thus improving the occupancy to theoretical limits * Earlier too many atomic adds to shared memory limited the kernel times, which has been avoided by blockwide sum-scans to obtain the same histogram using fewer atomic writes to shared memory. * The resulting kernel time speedups are significant (upto 30x for some nodes) * `computeSplitRegressionKernel` has different share-memory write patterns that deserves it's own PR for optimization 😬 * Tests will pass once #3690 is merged Authors: - Venkat (https://github.com/venkywonka) Approvers: - AJ Schmidt (https://github.com/ajschmidt8) - Philip Hyunsu Cho (https://github.com/hcho3) - Thejaswi. N. S (https://github.com/teju85) - John Zedlewski (https://github.com/JohnZed) URL: #3616

venkywonka added 2 commits April 1, 2021 10:01

🛠 fix garbage quantiles bug

4a4e3cc

* updated to new method for quantiles computation * deleted unused tempmem

🎨 clang format fix

78ec480

venkywonka requested a review from a team as a code owner April 1, 2021 04:47

github-actions bot added the CUDA/C++ label Apr 1, 2021

venkywonka added bug Something isn't working CUDA / C++ CUDA issue non-breaking Non-breaking change tests Unit testing for project and removed CUDA/C++ labels Apr 1, 2021

teju85 approved these changes Apr 1, 2021

View reviewed changes

venkywonka mentioned this pull request Apr 1, 2021

ENH Decision Tree new backend computeSplitClassificationKernel histogram calculation and occupancy optimization #3616

Merged

JohnZed approved these changes Apr 1, 2021

View reviewed changes

rapids-bot bot merged commit 538a2be into rapidsai:branch-0.19 Apr 1, 2021

dantegd mentioned this pull request Apr 5, 2021

[BUG] CUDA 11.2 libcuml++ C++ test failures EDIT: Updated with 11.2 update 2 #3406

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG fix BatchedLevelAlgo DtClsTest & DtRegTest failing tests #3690

BUG fix BatchedLevelAlgo DtClsTest & DtRegTest failing tests #3690

venkywonka commented Apr 1, 2021 •

edited

Loading

teju85 left a comment

codecov-io commented Apr 1, 2021

JohnZed commented Apr 1, 2021

BUG fix BatchedLevelAlgo DtClsTest & DtRegTest failing tests #3690

BUG fix BatchedLevelAlgo DtClsTest & DtRegTest failing tests #3690

Conversation

venkywonka commented Apr 1, 2021 • edited Loading

teju85 left a comment

Choose a reason for hiding this comment

codecov-io commented Apr 1, 2021

Codecov Report

JohnZed commented Apr 1, 2021

venkywonka commented Apr 1, 2021 •

edited

Loading