Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Factor table: Connected components of university relations #415

Merged
merged 2 commits into from
Aug 17, 2022

Conversation

dszakallas
Copy link
Member

@dszakallas dszakallas commented Aug 13, 2022

This PR attempts to solve #404 with a GraphX implementation.

It generates connected components on the full raw graph without deletes, between persons who attended university in overlapping periods of time.

By ignoring deletes, we guarantee that any two disjoint nodes in the result have been so during its full span.
This means that different components can be used as samples for ldbc/ldbc_snb_bi#77
(a) guaranteed that no path exists

  • Requires spark-graphx. Have to test whether it is provided on EMR: Yes, it is.
  • We could think about introducing a build switch whether to bundle spark-graphx into the jar. Maybe a --with-spark-graphx or --include-spark-graphx. (I am not sure about with as my understanding is that is often used to denote feature switches, and this is not a feature, more like a platform-dependent build directive, that would be used to create the same "whole datagen distribution", no feature is technically added or removed here)

@dszakallas dszakallas requested a review from szarnyasg August 14, 2022 19:20
@dszakallas dszakallas force-pushed the cc-person-uni-knows branch from 667d7c0 to 45fe54d Compare August 15, 2022 22:45
@dszakallas dszakallas force-pushed the cc-person-uni-knows branch from 45fe54d to 153d98b Compare August 16, 2022 21:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants