This repo contains the code that helped produce the September HRI report about building permit fraud. We found over 10,000 falsified permits!
Read about our report in the nytimes, politico, and amNY.
make, fish, psql, python3, pip, pyvenv, csvkit, jq, pandoc, bbler, nycdb
See the Makefile for a better sense of what's going on. Note that downloading the data will take WEEKS!
make jobs.csv
This produces a csv of all department of buildings jobs for likely rent-stabilized buildings since 2016. It requires setting up a running instance of nycdb.
make possible_liars.csv
This will generate a list of potential liars by downloading tax bills and job filings pages from city websites. Expect this task to produce LOTS of errors and take many weeks. See bbler for the scripts to parse and download tax bills and job filings.
make liars.csv
Filter the list of liars to those whose permit applications contain falsified or suspicious information.
make liars.zip
This will create a zip file with a folder for each permit containing all the PDFs and documents so each lying landlord can be manually verify as needed.
To build the report make report
To download a list of buildings with lead paint violations: make lead_paint_bbls.csv
. This is used for statistics in the report.
In the folder notebook
there is a Jupiter notebook used to derive the statistics in the report.