Tanzanian-Water-Pumps-and-a-Data-Revolution Link to Notebook
Derived from a competition put on by Driven-Data: a company that holds international competitions based on statistical modeling in order to gather data for and improve solutions for the common good. In this case the common good is life and sustainability throughout the Eastern Coastal country of Africa "Tanzania"
Using data from Taarifa and the Tanzanian Ministry of Water, can you predict which pumps are functional, which need some repairs, and which don't work at all? This is an intermediate-level practice competition. Predict one of these three classes based on a number of variables about what kind of pump is operating, when it was installed, and how it is managed. A smart understanding of which waterpoints will fail can improve maintenance operations and ensure that clean, potable water is available to communities across Tanzania.
This project began as a basic coding competition to practice my statistical inferencing but, once I started looking further into the issues I became immediately hooked. First and foremost, I had to use my google translate chrome extension for the very first time 😃 but really the true grab at my attention was the DATA REVOLUTION that is occurring all throughout Tanzania. From ward to ward they are literally building something out of (almost less than nothing). From rags to riches their communities were starved for infrastructure remodeling and true-explicit intent for the well-being of its population. With a few cheap drones they started building maps to plot out dangerous flood-zones. The right place to start since everytime the rains came whole regions were being destroyed. The value of the DATA derived from those low-cost fly-overs is turning out be priceless.
Dar Ramani Huria OpenStreetMap Video
Stark proof of their Data-Driven impact thus far:
Plotly project map from my notebook showing an interactive view of the locations of the Tazanian waterpumps and their functional status.
https://colab.research.google.com/drive/1qrB47n1W9erzD2BzXnFYHPp-faIOuLj_#scrollTo=dbdljEmCVpf0
Jul.17.9_30.PM._.Zoom.Earth.-.Google.Chrome.2022-07-26.08-29-10_Trim.mp4
Google Earth Weather Radar: https://zoom.earth/#view=-7.87,36.19,6z/date=2022-07-17,21:30,-4/map=live/overlays=crosshair
https://www.drivendata.org/competitions/7/pump-it-up-data-mining-the-water-table/page/23/
You can source the data and also join the competition if you'd like by following the link above. Or if you are more comfortable with the Kaggle workflow then you can find the data and most of the information here: https://www.kaggle.com/code/danielmartinalarcon/tanzanian-water-pumps/data