You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We encountered an error while parsing the UMLS (TTL) ontology:
I, [2024-05-08T22:01:02.600753 #1470563] INFO -- : ["Starting to process http://data.bioontology.org/ontologies/MDRGER/submissions/8"]
I, [2024-05-08T22:01:02.606373 #1470563] INFO -- : ["Starting to process MDRGER/submissions/8"]
I, [2024-05-08T22:01:02.801761 #1470563] INFO -- : ["Using UMLS turtle file found, skipping OWLAPI parse"]
E, [2024-05-08T22:01:11.151685 #1470563] ERROR -- : ["Error sending data to triple store - 400 RestClient::BadRequest: MALFORMED DATA: Turtle parser error while parsing an input stream on or around line 500000: Expected mandatory token '.', got 'eof'"]
This problem is related to PR #122 which introduces chunked data loading. The feature fails when handling TTL files exceeding 500000 lines with the AllegroGraph triplestore due to its strict Turtle file checker. AllegroGraph expects to load complete Turtle statements that end with a period (.) but chunked data loading feature breaks up turtle statement before reaching the end of the statement. We have not tested this with 4store so similar issue might exist.
UMLS ontologies are processed differently from the other types, where .ttl file is loaded into the triplestore instead of the owlapi.xrdf
The chunked load works only for ntriples format, not ttl.
The fix here is to not do the chunk load for ttl, or use another method of chunking for it, not by number of lines, but by the number of turtle blocks.
We didn't go through this bug at Agroportal, as we don't have UMLS or any ttl ontology.
We encountered an error while parsing the UMLS (TTL) ontology:
This problem is related to PR #122 which introduces chunked data loading. The feature fails when handling TTL files exceeding 500000 lines with the AllegroGraph triplestore due to its strict Turtle file checker. AllegroGraph expects to load complete Turtle statements that end with a period (.) but chunked data loading feature breaks up turtle statement before reaching the end of the statement. We have not tested this with 4store so similar issue might exist.
UMLS ontologies are processed differently from the other types, where
.ttl
file is loaded into the triplestore instead of theowlapi.xrdf
The text was updated successfully, but these errors were encountered: