CLI for the Ontogen version control system for RDF datasets
Explore the docs »
Changelog
·
Report Bug
·
Request Feature
·
Discussions
og
is the command-line interface for the Ontogen version control system for RDF datasets in SPARQL triple stores. It provides an easy-to-use interface for version control capabilities similar to Git, but tailored for RDF datasets.
Here's a basic example of how to use the Ontogen CLI:
$ og init --adapter Oxigraph
Initialized empty Ontogen repository in /Users/JohnDoe/example
$ og setup
Set up Ontogen repository
$ og add data.ttl
$ og commit --message "Initial commit"
[(root-commit) 6fc09c94768204983d0409d28e0796ec3f17cef46e57c5cb1248424d3922040d] Initial commit
3 insertions, 0 deletions, 0 overwrites
$ og log --changes
ec8108e3f4 - Initial commit (now) <John Doe [email protected]>
<http://www.example.org/employee38>
+ <http://www.example.org/familyName> "Smith" ;
+ <http://www.example.org/firstName> "John" ;
+ <http://www.example.org/jobTitle> "Assistant Designer" .
For more examples, setup instruction and a command reference, please refer to the User Guide
While Ontogen aims to provide a robust version control system for RDF datasets, it's important to note its current limitations:
- Single Graph Support: The current version only supports versioning of individual graphs within an RDF dataset. Versioning of multi-graph datasets is not yet implemented.
- Cryptic Graph Names: Due to the current implementation, graph names are automatically generated UUID URIs and can not be changed.
- Limited Configuration Updates: There's currently no way to update and sync repository metadata and configuration from the configuration files in the file system with the respective copy in the store, after the initial repository setup.
- Performance with Large Datasets: Ontogen is not yet suitable for versioning large datasets. Adding substantial amounts of data in a single commit can hit query size limits in some triple stores. Additionally, certain queries become prohibitively slow with very large datasets (be sure to use the latest version to at least prevent timeouts).
I'm actively working on addressing these limitations in future versions. The first three points will be addressed during the current follow-up funding period by the NLnet Foundation. For now, Ontogen is best suited for smaller to medium-sized datasets and experimental use.
Marcel Otto - @[email protected] - @MarcelOttoDE - [email protected]
This project is funded through NGI Assure, a fund established by NLnet with financial support from the European Commission's Next Generation Internet program.
JetBrains supports the project with complimentary access to its development environments.
Distributed under the MIT License. See LICENSE.md
for more information.