Please note that this repository has been deprecated, as the data sources it relied on for aggregation have dried up.
You can find the latest aggregated data file at https://raw.githubusercontent.com/Recidiviz/covid19-aggregator/master/output/merged_data.csv.
Create a Python 3 virtualenv:
virtualenv -p python3 .virtualenv
source .virtualenv/bin/activate
Install dependencies:
pip install -r requirements.txt
Fetch the latest file(s) from each source (e.g., covidprisondata.com) and add them to the source's directory
under the data/
directory (e.g., data/covidprisondata.com
).
Note that some sources, like Recidiviz, are fetched automatically from a URL specified in constants.py
.
python src/aggregate.py
Outputs will be put in the output
dir.
This will overwrite the previously committed files in output/
.
Be sure you're in the Python 3 virtualenv, then run:
./run_tests.sh