Define k-core API and tests #2712

ChuckHastings · 2022-09-21T17:23:37Z

Define k-core C++ API and tests.

Closes #2631
Closes #2632
Closes #2633
Closes #2635

codecov-commenter · 2022-09-22T02:16:20Z

Codecov Report

Base: 60.04% // Head: 60.04% // No change to project coverage 👍

Coverage data is based on head (59bce1d) compared to base (3eb2b40).
Patch has no changes to coverable lines.

❗ Current head 59bce1d differs from pull request most recent head 3f7db87. Consider uploading reports for the commit 3f7db87 to get more accurate results

Additional details and impacted files

@@              Coverage Diff              @@
##           branch-22.10    #2712   +/-   ##
=============================================
  Coverage         60.04%   60.04%           
=============================================
  Files               111      111           
  Lines              6184     6184           
=============================================
  Hits               3713     3713           
  Misses             2471     2471

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

naimnv · 2022-09-23T12:58:23Z

cpp/include/cugraph/algorithms.hpp

+ *
+ * @return edge list for the graph
+ */
+template <typename vertex_t, typename edge_t, typename weight_t, bool multi_gpu>


Would it be a better idea to pass bool transpose=false as template parameter and then use transpose as a named parameter to the function instead of implicit false?

We try to avoid excess compilation of different templated functions, as this increased compile time.

Most of our functions are implemented based on an assumption about whether the graph is stored transposed or not. For example, pagerank assumes that store_transposed=true. Implementing pagerank if store_transposed=false would result in a great deal more synchronization (and communication in a multi-gpu environment). There are a few examples that will work on a graph in either orientation.

Supporting this would require providing an implementation that would work with either orientation of the graph. It seems like as long as this matches the requirement for core_number (which has to be called first) then just one orientation is sufficient.

naimnv · 2022-09-23T12:59:16Z

cpp/src/cores/k_core_impl.cuh

+template <typename vertex_t, typename edge_t, typename weight_t, bool multi_gpu>
+std::tuple<rmm::device_uvector<vertex_t>,


Same here as mentioned in my previous comment.

See above reaction.

seunghwak · 2022-09-23T17:59:12Z

cpp/include/cugraph/algorithms.hpp

+ * @tparam edge_t Type of edge identifiers. Needs to be an integral type.
+ * @tparam weight_t Type of edge weights. Needs to be a floating point type.
+ * @tparam multi_gpu Flag indicating whether template instantiation should target single-GPU (false)
+ * @param  graph           cuGraph graph in coordinate format


I assume this comment is outdated.

seunghwak · 2022-09-23T18:05:58Z

cpp/include/cugraph/algorithms.hpp

+std::tuple<rmm::device_uvector<vertex_t>,
+           rmm::device_uvector<vertex_t>,
+           std::optional<rmm::device_uvector<weight_t>>>
+k_core(raft::handle_t const& handle,


Something to think about...

So, we're passing core numbers here. So in this case, should we really have both this function and induced subgraph?

seeing core numbers, we can easily extract a vertex list in a single thrust call (e.g. thrust::copy_if). Then, we can pass a vertex list to induced subgraph.

Not sure this function adds enough convenience to justify increase in compile time/binary size.

Maybe OK, if this calls explicitly instantiated induced_subgraph, increases in compile time/binary size might be minimal...

Still debating between whether this function should take core_numbers or just call core_numbers internally.... (to maximize user convenience, if not, a user can just separately call core_numbers, something like thurst::copy_if, and induced_subgraph).

I had that debate internally. Ultimately I concluded that if we're going to expose a complete k-core implementation at the python level we should probably expose it at the lower levels as well. It seems like something that would be useful.

I do wonder if we should make passing in core_numbers optional, and if they are not passed then also call core_numbers. This would allow a simple caller that just wants to extract a 3-core from the graph to be able to call the function directly and get the complete answer, while a more sophisticated case might call core_number once and reuse the result to extract the different subgraphs as required.

Yeah... +1 for taking core_numbers as std::optional.

Still a bit not-sure about "Ultimately I concluded that if we're going to expose a complete k-core implementation at the python level we should probably expose it at the lower levels as well."

At the python layer, user convenience might be more important than anything else. And providing an additional python function that internally calls core_number, find vertex list, and induced subgraph has no toll on compile time/binary size as python is an interpreted language.

Not sure we should provide the same level of convenience for C++ users, as we assume C++ users are more advanced and they might be OK about finding k-core by composing core_numbers and induced subgraph.

But this is a much bigger topic, and we may have this discussion again in the future.

And another thing to consider is that this may incur some sort of dependency within a same algorithm level in our C++ software hierarchy. Most algorithms are composed of primitives and thrust calls. But now algorithms are composed of primitives, thrust calls, and other algorithms. I am not sure what kind of complications this can incur in the future.

Composing C++ algorithms in the C layer might be OK, but I am not sure whether we should support use cases that can be supported simply by composing few existing algorithms in the C++ level.

Fair point.

I will change core_numbers to be optional.

We can explore dropping the k_core from the C++ layer if having the algorithms depend on other algorithms becomes an issue. I see the point that adding it at the C layer may be sufficient.

… user deletes the original but needs to recreate

jnke2016 · 2022-09-24T04:22:17Z

I don't see where these k-core functions are implemented: cugraph_core_result_get_src_vertices, cugraph_core_result_get_dst_vertices, cugraph_core_result_get_weights. Please can you point me to those? I am getting the error below in pylibcugraph

cugraph_core_result_get_src_vertices' is not a constant, variable or function identifier

ChuckHastings · 2022-09-24T04:52:12Z

Good catch, Joseph. Adding the missing functions.

ChuckHastings · 2022-09-24T04:57:08Z

I see another issue, need to fix something else also.

jnke2016 · 2022-09-24T05:15:34Z

Ok thanks. It looks like cugraph_k_core too is missing

ChuckHastings · 2022-09-26T15:16:09Z

rerun tests

ChuckHastings · 2022-09-26T22:24:52Z

@gpucibot merge

Define k-core API and tests

0264144

ChuckHastings requested review from a team as code owners September 21, 2022 17:23

ChuckHastings self-assigned this Sep 21, 2022

ChuckHastings added the 3 - Ready for Review label Sep 21, 2022

ChuckHastings added this to the 22.10 milestone Sep 21, 2022

ChuckHastings added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Sep 21, 2022

fix clang-format issues

cdf5f0a

BradReesWork requested review from seunghwak, naimnv and robertmaynard September 22, 2022 14:24

naimnv reviewed Sep 23, 2022

View reviewed changes

naimnv approved these changes Sep 23, 2022

View reviewed changes

BradReesWork approved these changes Sep 23, 2022

View reviewed changes

seunghwak reviewed Sep 23, 2022

View reviewed changes

add mechanism to create result from core_number for case where python…

650d017

… user deletes the original but needs to recreate

seunghwak approved these changes Sep 23, 2022

View reviewed changes

address PR comments

3d06e3d

ChuckHastings added the DO NOT MERGE Hold off on merging; see PR for details label Sep 24, 2022

rlratzel mentioned this pull request Sep 24, 2022

Added SamplingResult cdef class to return cupy "views" for PLC sampling algos instead of copying result data #2684

Merged

Add a bunch of missing things

2c00479

ChuckHastings requested a review from jnke2016 September 25, 2022 01:19

jnke2016 approved these changes Sep 26, 2022

View reviewed changes

fix clang-format issues

7caaba6

ChuckHastings removed the DO NOT MERGE Hold off on merging; see PR for details label Sep 26, 2022

Merge branch 'branch-22.10' into fea_k_core_api

3f7db87

rapids-bot bot merged commit f0c1e99 into rapidsai:branch-22.10 Sep 26, 2022

ChuckHastings deleted the fea_k_core_api branch December 2, 2022 18:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define k-core API and tests #2712

Define k-core API and tests #2712

ChuckHastings commented Sep 21, 2022

codecov-commenter commented Sep 22, 2022 •

edited

Loading

naimnv Sep 23, 2022

ChuckHastings Sep 23, 2022

naimnv Sep 23, 2022

ChuckHastings Sep 23, 2022

seunghwak Sep 23, 2022

ChuckHastings Sep 23, 2022

seunghwak Sep 23, 2022

seunghwak Sep 23, 2022

ChuckHastings Sep 23, 2022

seunghwak Sep 23, 2022

seunghwak Sep 23, 2022

ChuckHastings Sep 23, 2022

jnke2016 commented Sep 24, 2022 •

edited

Loading

ChuckHastings commented Sep 24, 2022

ChuckHastings commented Sep 24, 2022

jnke2016 commented Sep 24, 2022

ChuckHastings commented Sep 26, 2022

ChuckHastings commented Sep 26, 2022

		template <typename vertex_t, typename edge_t, typename weight_t, bool multi_gpu>
		std::tuple<rmm::device_uvector<vertex_t>,

Define k-core API and tests #2712

Define k-core API and tests #2712

Conversation

ChuckHastings commented Sep 21, 2022

codecov-commenter commented Sep 22, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jnke2016 commented Sep 24, 2022 • edited Loading

ChuckHastings commented Sep 24, 2022

ChuckHastings commented Sep 24, 2022

jnke2016 commented Sep 24, 2022

ChuckHastings commented Sep 26, 2022

ChuckHastings commented Sep 26, 2022

codecov-commenter commented Sep 22, 2022 •

edited

Loading

jnke2016 commented Sep 24, 2022 •

edited

Loading