Release Candidate Neo4j-ETL 1.2.0-RC1
Pre-release
Pre-release
After GraphConnect NYC we did a lot of work to fix various bugs and to enhance the ETL tool adding support for all relational databases with a JDBC driver. We did also some backend optimizations and few changes to the UI.
- Multi schema support, the ETL tool now can “parse” more than one schema at time
- Additional driver support. The ETL tool comes with two embedded JDBC drivers (for MySQL and PostgreSQL) but you can set up an external JDBC driver for: Microsoft SQL server, IBM DB2 and Oracle (the list is not limited to these drivers but they are the default ones in the combobox). You can add a jar using “--driver” parameter.
- The resulting mapping file can now be written to a file without output redirection using “--output-mapping-file”.
- Fetch size has been added to a default value of 10,000 records. It will be configurable in future releases
- The ETL tool has undergone additional testing with Microsoft SQL sample databases AdventureWorks and WideWorldImporters and a DB2 sample database in addition to the previous tests
- When importing through cypher all fields are now mapped correctly according to their data type
- First version for parallel generation of the CSV. The concurrency will be configurable in future releases
- Now the ETL tool creates a separate directory for each schema/catalog when writing the CSV files
- Schema names are well-separated from table names. The ETL tools doesn't rely anymore on splitting names with "." in order to separate the schema name from the table name. These changes reflects also on the mapping.json file where the name of the schema is explicitly written.