
Background Information
Skills and Proficiency:
Python, Numpy, Pandas, Machine Learning Algorithms
​
Problem:
Can you predict which water pumps are faulty?
Using data from Taarifa and the Tanzanian Ministry of Water, can you predict which pumps are functional, which need some repairs, and which don't work at all? This is an intermediate-level practice competition. Predict one of these three classes based on a number of variables about what kind of pump is operating, when it was installed, and how it is managed. A smart understanding of which waterpoints will fail can improve maintenance operations and ensure that clean, potable water is available to communities across Tanzania.
​
Process:
Data Cleaning and Wrangling
Creating Submission Pipeline
Applying Base Model
Using ensemble model - RandomForestClassifier giving an accuracy of 0.8189
​
Result:
I submitted my model for evaluation on the drivendata.org where the competition was hosted. I ranked in the top 10% of all competitors with a a model accuracy of 0.8189.