The Knowledge Graph Creator (kgcreator) is a tool for automating the generation of RDF data for Knowledge Graphs from raw text data read from an input directory path.
The Knowledge Graph Creator creates an output file containing RDF triples suitable for loading into any linked data/semantic web data store.
This Python command line utility is one of the example programs in my book Practical Python Artificial Intelligence Programming that is available on Leanpub.com or available (in the future, this book is under development) to read for free on my web site https://markwatson.com.
Install this tool using pip
:
pip install kgcreator
pip install spacy
python -m spacy download en_core_web_sm
For help, run:
kgcreator --help
kgcreator --inputdir=test_data --outputfile=out.rdf --outputfileneo4j=out.cypher
You can also use:
python -m kgcreator --help
To contribute to this tool, first checkout the code. Then create a new virtual environment:
cd kgcreator
python -m venv venv
source venv/bin/activate
Now install the dependencies and test dependencies:
pip install -e '.[test]'
To run the tests:
pytest
Start by identifying:
- Names of entities (node labels).
- Names of relationships.
- Names of properties for nodes and relationships.
- Constraints to be defined.
- Indexes required.
- The most important queries?