Implement predict_per_tree() in FIL #5303

hcho3 · 2023-03-27T19:23:54Z

No description provided.

wphicks

Looking great! I left some inline feedback on details, but in general it looks quite good.

The one larger thing I'd like to see us clean up is how we're handling the shared memory to global memory fallback. Rather than having custom logic at every point we need to touch that, it would be nice to encapsulate all of that in shared_memory_buffer. Specifically, I would love to see the signature for fill change to

fill(index_type element_count, T value=T{}, T* fallback_buffer=nullptr)

Then, we don't have to have any special logic for when we are or are not using the fallback. Before we launch the kernel, we allocate global memory if we detect that we don't have enough shared memory. Otherwise, we have an empty buffer. We then pass that pointer all the way through the layers without any other processing down to shared_memory_buffer. If it sees that the fill is going to fail, it fills the fallback buffer and returns that pointer. That keeps the logic for all different prediction types the same, and it encapsulates any special handling in the shared_memory_buffer object.

Other than that, I think we're looking pretty great!

cpp/include/cuml/experimental/fil/detail/infer.hpp

cpp/include/cuml/experimental/fil/detail/infer/gpu.cuh

cpp/include/cuml/experimental/fil/output_kind.hpp

python/cuml/experimental/fil/fil.pyx

hcho3 · 2023-03-29T05:56:18Z

FYI, I renamed the enum throughout the codebase so that it's not confused with the array type.

output_type -> infer_type or predict_type, depending on context.
output_kind -> infer_kind

wphicks

Beautiful! Everything about this is great. I had one small request for where we put the output_t logic, but it should not hold up a merge unnecessarily.

I'm testing for perf regressions now, and we can merge as soon as that gets cleared. If we get that output_t change in before that's done, great, but otherwise I'm fine with merging as is.

cpp/include/cuml/experimental/fil/detail/output_type.hpp

wphicks · 2023-03-30T22:19:07Z

/merge

This reverts commit ecd4d02.

This reverts commit ecd4d02 to avoid a race condition.

Implement predict_per_tree()

5fee233

github-actions bot added CUDA/C++ Cython / Python Cython or Python issue labels Mar 27, 2023

hcho3 added non-breaking Non-breaking change improvement Improvement / enhancement to an existing function labels Mar 27, 2023

hcho3 added 2 commits March 27, 2023 15:11

Correct threshold for global mem fallback

cda6ea8

Add methods num_outputs / num_trees

cee0090

hcho3 marked this pull request as ready for review March 27, 2023 22:35

hcho3 requested review from a team as code owners March 27, 2023 22:35

hcho3 changed the title ~~Implement predict_per_tree()~~ Implement predict_per_tree() in FIL Mar 27, 2023

wphicks requested changes Mar 28, 2023

View reviewed changes

hcho3 mentioned this pull request Mar 28, 2023

Implement apply() in FIL #5307

Closed

hcho3 added 5 commits March 28, 2023 14:53

Merge remote-tracking branch 'origin/branch-23.04' into predict_per_tree

30d4243

More economical logic for allocating workspace

1c2c703

Rename row_offset -> base_rowid

77dd103

Rename output_type -> infer_type, output_kind -> infer_kind

be1a264

Rename output_type -> predict_type in fil.pyx

53eaa7d

hcho3 added 8 commits March 28, 2023 23:00

Pass 0 conditionally to raft_proto::buffer

b88d470

Define output_t template

1f7660e

Expose a single predict() method from ForestInference_impl

b55b861

Move enum to fil.pxd

f9037cc

Better docstring for infer_type

c55f66d

Add fallback handling in shared_memory_buffer.fill()

a883294

Fix build

18e2d76

Merge branch 'branch-23.04' into predict_per_tree

ad76593

wphicks reviewed Mar 29, 2023

View reviewed changes

cpp/include/cuml/experimental/fil/detail/output_type.hpp Outdated Show resolved Hide resolved

hcho3 added 2 commits March 29, 2023 14:51

Pull in output_t into forest

350d716

Use raw_output_type in CPU kernel too

687c52a

Merge branch 'branch-23.04' into predict_per_tree

063b114

wphicks approved these changes Mar 30, 2023

View reviewed changes

rapids-bot bot merged commit ecd4d02 into rapidsai:branch-23.04 Mar 30, 2023

hcho3 deleted the predict_per_tree branch March 30, 2023 22:20

wphicks added a commit that referenced this pull request Apr 4, 2023

Revert "Implement predict_per_tree() in FIL (#5303)"

abdb0d3

This reverts commit ecd4d02.

wphicks added a commit to wphicks/cuml that referenced this pull request Apr 6, 2023

Revert "Implement predict_per_tree() in FIL (rapidsai#5303)"

69666b6

This reverts commit ecd4d02 to avoid a race condition.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement predict_per_tree() in FIL #5303

Implement predict_per_tree() in FIL #5303

hcho3 commented Mar 27, 2023

wphicks left a comment

hcho3 commented Mar 29, 2023

wphicks left a comment

wphicks commented Mar 30, 2023

Implement predict_per_tree() in FIL #5303

Implement predict_per_tree() in FIL #5303

Conversation

hcho3 commented Mar 27, 2023

wphicks left a comment

Choose a reason for hiding this comment

hcho3 commented Mar 29, 2023

wphicks left a comment

Choose a reason for hiding this comment

wphicks commented Mar 30, 2023