Improve performance of quadtree point-to-polyline join #362

trxcllnt · 2021-02-25T06:48:10Z

Rewrites quadtree_point_to_nearest_polyline via thrust in the style of quadtree_point_in_polygon to get a 5x speed boost.

Benchmarked locally with NYC taxi 169M float32 points (RMM pool mode enabled):

quadtree_point_to_nearest_polyline (before)
1min ± 10.4 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

quadtree_point_to_nearest_polyline (after)
9.49 s ± 15 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Timings for every routine in the benchmark after these changes:

quadtree_on_points
66.2 ms ± 64.3 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)

polygon_bounding_boxes
322 µs ± 375 ns per loop (mean ± std. dev. of 7 runs, 1000 loops each)

join_quadtree_and_bounding_boxes
11.4 ms ± 8.19 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

quadtree_point_in_polygon
5.48 s ± 54.4 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

polyline_bounding_boxes
258 µs ± 2.37 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

join_quadtree_and_bounding_boxes
11.5 ms ± 17.5 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

quadtree_point_to_nearest_polyline
9.49 s ± 15 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

… of quadtree_point_in_polygon to get a 5x perf boost

thomcom · 2021-03-26T14:51:34Z

Mind iterating on this one's passing status when you get the chance @trxcllnt ?

trxcllnt · 2021-03-26T16:49:43Z

@thomcom yeah for sure. We might also want to wait on this PR because, while it's faster, it requires allocating much more temporary memory than the current implementation. I think it's possible to apply some of the techniques in the hausdorff iterator to eliminate that extra memory, but I'd need to explore it in depth.

edit: The new implementation that doesn't use as much intermediate memory done in e79caa9 🎉

…quadtree-point-to-nearest-polyline-speed-boost

…mpatible with reduce_by_key

trxcllnt · 2021-04-07T21:33:36Z

@harrism I re-requested your review for the new version of quadtree_point_to_nearest_polyline.cu

…quadtree-point-to-nearest-polyline-speed-boost

cpp/src/join/quadtree_point_to_nearest_polyline.cu

trxcllnt · 2021-04-16T18:38:56Z

rerun tests

harrism · 2021-05-04T20:43:07Z

@harrism I re-requested your review for the new version of quadtree_point_to_nearest_polyline.cu

I totally missed this. I'm sorry.

harrism

Looks great. Performance looks great. Several comments, but only one absolutely required change (missing stream sync).

cpp/src/join/quadtree_point_in_polygon.cu

cpp/src/utility/point_to_nearest_polyline.cuh

cpp/tests/join/point_in_polygon_test_small.cpp

cpp/src/join/quadtree_point_to_nearest_polyline.cu

…quadtree-point-to-nearest-polyline-speed-boost

trxcllnt · 2021-05-12T02:52:19Z

rerun tests

harrism

Just a couple more fixes.

cpp/src/utility/point_to_nearest_polyline.cuh

cpp/src/join/quadtree_point_to_nearest_polyline.cu

Co-authored-by: Mark Harris <[email protected]>

trxcllnt · 2021-05-20T19:32:35Z

@gpucibot merge

cwharris · 2021-05-20T19:40:10Z

I had questions regarding how the tests were updated due to the new sortedness of the outputs, and @trxcllnt answered those offline. basically, user's weren't guaranteed any sortedness prior to this change, and now we are sorting the output ascending, so no breaking change.

lgtm.

implement quadtree_point_to_nearest_polyline with thrust in the style…

a5f47c0

… of quadtree_point_in_polygon to get a 5x perf boost

trxcllnt requested a review from a team as a code owner February 25, 2021 06:48

trxcllnt requested review from thomcom, zhangjianting and harrism February 25, 2021 06:48

trxcllnt added 3 - Ready for Review Ready for review by team improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 25, 2021

free intermediate local_point_offsets in quadtree_point_in_polygon

ca2e630

harrism approved these changes Mar 2, 2021

View reviewed changes

trxcllnt added 2 commits March 31, 2021 17:12

Merge branch 'branch-0.19' of github.com:rapidsai/cuspatial into fix/…

d90ac5f

…quadtree-point-to-nearest-polyline-speed-boost

update for set_element_async API change

b843069

github-actions bot added the libcuspatial Relates to the cuSpatial C++ library label Mar 31, 2021

trxcllnt added 2 commits April 7, 2021 16:18

add p2np iterator that yields point/polyline/distances in an order co…

e79caa9

…mpatible with reduce_by_key

clean up p2np includes

e07483b

trxcllnt requested a review from harrism April 7, 2021 21:33

Merge branch 'branch-0.19' of github.com:rapidsai/cuspatial into fix/…

7cf6461

…quadtree-point-to-nearest-polyline-speed-boost

trxcllnt added the 5 - DO NOT MERGE Hold off on merging; see PR for details label Apr 8, 2021

use thrust binary searches instead of linear search

c061759

trxcllnt commented Apr 8, 2021

View reviewed changes

cpp/src/join/quadtree_point_to_nearest_polyline.cu Outdated Show resolved Hide resolved

trxcllnt changed the base branch from branch-0.19 to branch-0.20 April 8, 2021 17:02

trxcllnt added the 2 - In Progress Currenty a work in progress label Apr 8, 2021

trxcllnt mentioned this pull request Apr 9, 2021

[FEA]Point-to-polyline nearest neighbor distance #342

Open

trxcllnt added 3 commits April 9, 2021 12:10

fix GCC 9 RVO warning/error

1fa3514

use new Thrust 1.12.0 make_zip_iterator instead of our custom helper

cc1cb96

update copyrights

5a33ca7

github-actions bot added the Python Related to Python code label Apr 14, 2021

trxcllnt removed 2 - In Progress Currenty a work in progress 5 - DO NOT MERGE Hold off on merging; see PR for details Python Related to Python code cmake Related to CMake code or build configuration java labels Apr 15, 2021

harrism requested changes May 6, 2021

View reviewed changes

trxcllnt added 2 commits May 5, 2021 21:58

Merge branch 'branch-0.20' of github.com:rapidsai/cuspatial into fix/…

53adc9e

…quadtree-point-to-nearest-polyline-speed-boost

use C++17 structured binding

43bbd91

trxcllnt requested a review from a team as a code owner May 6, 2021 04:13

github-actions bot added the Python Related to Python code label May 6, 2021

trxcllnt added 4 commits May 5, 2021 23:21

avoid the copies incurred by shrinking the device_uvectors

2fb630e

remove dead code

ced903b

call cudaMemsetAsync with an int instead of a float/double

f523461

factor common code into point.cuh

1bc89b4

trxcllnt requested a review from harrism May 12, 2021 05:35

harrism requested changes May 18, 2021

View reviewed changes

cpp/src/utility/point_to_nearest_polyline.cuh Outdated Show resolved Hide resolved

cpp/src/join/quadtree_point_to_nearest_polyline.cu Outdated Show resolved Hide resolved

trxcllnt and others added 2 commits May 18, 2021 07:03

Apply suggestions from code review

82f0569

Co-authored-by: Mark Harris <[email protected]>

fix lint

c663e5e

harrism approved these changes May 20, 2021

View reviewed changes

cwharris approved these changes May 20, 2021

View reviewed changes

rapids-bot bot merged commit 6eb044e into rapidsai:branch-21.06 May 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of quadtree point-to-polyline join #362

Improve performance of quadtree point-to-polyline join #362

trxcllnt commented Feb 25, 2021

thomcom commented Mar 26, 2021

trxcllnt commented Mar 26, 2021 •

edited

Loading

trxcllnt commented Apr 7, 2021 •

edited

Loading

trxcllnt commented Apr 16, 2021

harrism commented May 4, 2021

harrism left a comment

trxcllnt commented May 12, 2021

harrism left a comment

trxcllnt commented May 20, 2021

cwharris commented May 20, 2021 •

edited

Loading

Improve performance of quadtree point-to-polyline join #362

Improve performance of quadtree point-to-polyline join #362

Conversation

trxcllnt commented Feb 25, 2021

thomcom commented Mar 26, 2021

trxcllnt commented Mar 26, 2021 • edited Loading

trxcllnt commented Apr 7, 2021 • edited Loading

trxcllnt commented Apr 16, 2021

harrism commented May 4, 2021

harrism left a comment

Choose a reason for hiding this comment

trxcllnt commented May 12, 2021

harrism left a comment

Choose a reason for hiding this comment

trxcllnt commented May 20, 2021

cwharris commented May 20, 2021 • edited Loading

trxcllnt commented Mar 26, 2021 •

edited

Loading

trxcllnt commented Apr 7, 2021 •

edited

Loading

cwharris commented May 20, 2021 •

edited

Loading