Organization of utils for loading states using Pentaho and Python, with flexible
configuration. Setup local configuration for paths by copying load_conf.json
into load_conf_local.json
which is ignored in git, and pass it to the loader
with the --config_file
argument.
loader.py
is also the entrypoint to the etl
container in the Docker Compose
file, so if you want to run commands through this you can just add the arguments
to the end. Ex:
Instead of: python loader.py transform -s wa --input_file=wa_test_file.csv
Run: docker-compose run etl transform -s wa --input_file=wa_test_file.csv
- Load dates with
docker-compose run etl dates
- PENDING: Load census data (after creating
dimensionaldata/census.csv
with thegetCensus.py
script) withdocker-compose run etl dimdata -s oh
- Load precincts with
docker-compose run etl precincts -s oh --input_file=test/oh.csv
- Run transformer with
docker-compose run etl transform -s oh --input_file=test/oh.csv
- Load transformed data with
docker-compose run etl load -s oh --input_file=test/oh_output.csv --reporter_key=2