Mastering Machine Learning – Ep.1: Imputing Missing Values With Scikit-learn

Author: Andrea Giussani, Data Scientist at Cloud Academy

***Participants are strongly encouraged to set up the environment prior to the start of the webinar. A detailed guide can be found at this GitHub repo ***

Approximately 90% of any data scientist’s day is taken up by cleaning data. More often than not, data sets are incomplete or incorrect, forcing people to lose a significant amount of time taking corrective actions and hindering efficiency. We will look at a processing technique called Imputation that allows one to retrieve unknown data.

This session will cover:
– Simple Imputer
– Iterative Imputer
– KNN-Imputer