Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add fourth column for graph name #1337

Merged
merged 19 commits into from
Aug 21, 2024

Conversation

joka921
Copy link
Member

@joka921 joka921 commented May 3, 2024

All six permutation pairs (SPO, SOP, OSP, OPS, PSO, POS) now have an additional column for the graph name. It is the fourth column (before the two columns for the patterns). For now, all internal triples get a special internal graph name, and all other triples belong to the default graph.

IMPORTANT: This changes the index format. It also slows down the indexing by about 15% (which is reasonable, given that the number of columns increased from 5 to 6).

joka921 added 2 commits May 2, 2024 17:25
This is yet very hacky, and we have to integrate all the other things.
Copy link

codecov bot commented May 3, 2024

Codecov Report

Attention: Patch coverage is 94.81481% with 7 lines in your changes missing coverage. Please review.

Project coverage is 89.36%. Comparing base (8578b34) to head (55f0c83).

Files Patch % Lines
src/index/StxxlSortFunctors.h 76.92% 0 Missing and 3 partials ⚠️
src/engine/IndexScan.cpp 60.00% 1 Missing and 1 partial ⚠️
src/engine/HasPredicateScan.cpp 0.00% 1 Missing ⚠️
src/index/IndexImpl.cpp 98.64% 0 Missing and 1 partial ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##           master    #1337      +/-   ##
==========================================
- Coverage   89.38%   89.36%   -0.03%     
==========================================
  Files         345      345              
  Lines       24881    24945      +64     
  Branches     3307     3312       +5     
==========================================
+ Hits        22241    22293      +52     
- Misses       1500     1501       +1     
- Partials     1140     1151      +11     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Copy link

sonarqubecloud bot commented May 3, 2024

Quality Gate Passed Quality Gate passed

Issues
6 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarCloud

@joka921 joka921 marked this pull request as ready for review May 3, 2024 18:39
Copy link
Member

@hannahbast hannahbast left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1-1 with Johannes, with some minor changes left.

I am amazed at how non-invasive this significant change is!

@hannahbast hannahbast changed the title Add the graph column as an additional payload to the index. It is not yet used. Add fourth column for graph name (not yet used) Aug 19, 2024
Copy link
Member

@hannahbast hannahbast left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome!

It remains to fix the failing macOS build

Copy link
Member

@hannahbast hannahbast left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another quick round with Johannes + tested this on Wikidata (ca. 15% slower)

@hannahbast hannahbast changed the title Add fourth column for graph name (not yet used) Add fourth column for graph name Aug 21, 2024
@hannahbast hannahbast merged commit 1a4282c into ad-freiburg:master Aug 21, 2024
17 checks passed
Copy link

joka921 added a commit that referenced this pull request Aug 27, 2024
Add a parser for N-Quads and write the graph label to the new column introduced by #1337 . Now `IndexBuilderMain` supports three file types: `nt`, `tll`, and `nq`.
hannahbast pushed a commit that referenced this pull request Sep 27, 2024
This continues work from #1337 (add a column with the graph name to the index), #1444 (add parser for N-Quads), and #1482 (add graph info to block metadata). Queries with `FROM` and/or `GRAPH` with a fixed IRI can now be processed. Processing queries with `FROM NAME` or `GRAPH` with a variable will be implemented in a future PR.
@joka921 joka921 deleted the add-graph-column branch December 18, 2024 09:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants