[REVIEW] Fix memory leak and unnecessary allocs in TSNE::get_distances #2542

zbjornson · 2020-07-11T04:08:21Z

c3ab759 left behind two extra allocations. Those allocations also leaked because delete knn_input, sizes only deletes knn_input. (It should also be delete[].)

Finally, modernizes the vector initialization syntax.

GPUtester · 2020-07-11T04:08:22Z

Can one of the admins verify this patch?

cjnolet · 2020-07-11T18:42:22Z

cpp/src/tsne/distances.cuh

-  std::vector<int> sizes_vec(1);
-  input_vec.push_back(knn_input[0]);
-  sizes_vec.push_back(sizes[0]);
+  // TODO make brute_force_knn take a const float*


We could add a convenience wrapper for the brute force kNN that accepts a single float* and calls the version that accepts multiple. Would you like to do that in this PR?

Oops, unclear comment. I was only trying to avoid the const_casts here and on line 58R, not the vector. But it looks like making it const-correct would be hard, so I've removed the comment.

c3ab759 left behind two extra allocations. Those allocations also leaked because `delete knn_input, sizes` only deletes `knn_input`. (It should also be `delete[]`.) Finally, modernizes the vector initialization.

cjnolet

Changes LGTM. Thanks for the contribution!

JohnZed · 2020-07-11T21:36:23Z

Ok to test

zbjornson · 2020-07-12T18:26:28Z

Just realized that this fixed another bug that caused double the memory usage.

std::vector<float *> input_vec(1); constructs a vector with 1 float default-inserted (definition 4), then push_back() expands the vector so it has size 2. Downstream, the knn code branches if input.size() > 1 to allocate a lot of temp memory (and do extra work):

cuml/cpp/src_prims/selection/knn.cuh

Lines 263 to 265 in 069a229

    
           if (input.size() > 1) { 
        
             all_D.resize(input.size() * k * n, userStream); 
        
             all_I.resize(input.size() * k * n, userStream);

The only other place I see that issue is here, but it's in a test, so not necessarily wrong:

cuml/cpp/test/prims/knn.cu

Lines 70 to 73 in 069a229

    
           std::vector<float *> input_vec(1); 
        
           std::vector<int> sizes_vec(1); 
        
           input_vec.push_back(ptrs[0]); 
        
           sizes_vec.push_back(sizes[0]);

cjnolet · 2020-07-12T18:44:42Z

@zbjornson feel free to make the change to the kNN test in #2548.

Continuation of rapidsai#2542, see rapidsai#2542 (comment)

zbjornson requested a review from a team as a code owner July 11, 2020 04:08

zbjornson force-pushed the bug-tsne-memleak branch from c679194 to 20a0a86 Compare July 11, 2020 04:11

cjnolet reviewed Jul 11, 2020

View reviewed changes

cjnolet added 3 - Ready for Review Ready for review by team bug Something isn't working CUDA / C++ CUDA issue labels Jul 11, 2020

Fix memory leak and unnecessary allocs in TSNE::get_distances

b9fa4eb

c3ab759 left behind two extra allocations. Those allocations also leaked because `delete knn_input, sizes` only deletes `knn_input`. (It should also be `delete[]`.) Finally, modernizes the vector initialization.

zbjornson force-pushed the bug-tsne-memleak branch from 20a0a86 to b9fa4eb Compare July 11, 2020 19:33

cjnolet approved these changes Jul 11, 2020

View reviewed changes

cjnolet merged commit 069a229 into rapidsai:branch-0.15 Jul 12, 2020

zbjornson deleted the bug-tsne-memleak branch July 12, 2020 00:50

zbjornson added a commit to zbjornson/cuml that referenced this pull request Jul 21, 2020

Fix unintended expansion of vectors in KNNTest

6fa581a

Continuation of rapidsai#2542, see rapidsai#2542 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] Fix memory leak and unnecessary allocs in TSNE::get_distances #2542

[REVIEW] Fix memory leak and unnecessary allocs in TSNE::get_distances #2542

zbjornson commented Jul 11, 2020

GPUtester commented Jul 11, 2020

cjnolet Jul 11, 2020

zbjornson Jul 11, 2020

cjnolet left a comment

JohnZed commented Jul 11, 2020

zbjornson commented Jul 12, 2020

cjnolet commented Jul 12, 2020

[REVIEW] Fix memory leak and unnecessary allocs in TSNE::get_distances #2542

[REVIEW] Fix memory leak and unnecessary allocs in TSNE::get_distances #2542

Conversation

zbjornson commented Jul 11, 2020

GPUtester commented Jul 11, 2020

cjnolet Jul 11, 2020

Choose a reason for hiding this comment

zbjornson Jul 11, 2020

Choose a reason for hiding this comment

cjnolet left a comment

Choose a reason for hiding this comment

JohnZed commented Jul 11, 2020

zbjornson commented Jul 12, 2020

cjnolet commented Jul 12, 2020