Skip to content
This repository has been archived by the owner on Sep 28, 2024. It is now read-only.

Latest commit

 

History

History
23 lines (13 loc) · 895 Bytes

README.md

File metadata and controls

23 lines (13 loc) · 895 Bytes

THIS IS PURE ANCIENT.

This is 4 years old and likely doesn't work anymore. You're of course welcome to fork & update, but I won't be providing support for this. Good luck!

sstableloader-csv

This tool parses CSV files into sstables for Apache Cassandra. It was written for a project involving the need to parse ~75 Million row CSV files.

Installation

Run the 'run' script with 3 arguments - <keyspace> <column> <input>

For example:

   ./run.sh Test Data test.csv

Requirements

  • $CASSANDRA_HOME must be set to the location of the apache-cassandra.jar & associated libraries.
  • $CASSANDRA_CONFIG must be set to the location of your cassandra.yml config.

Note

The number of columns & names are coded in DataImport.java, you'll need to change these yourself. There isn't also any error handling in the parsing, the data I was handling was known to be error-free.