Data preprocessing is one of the most important steps in Machine Learning. This step cannot be avoided especially if data is in unstructured form. In this post, I’ll discuss the different steps using Scikit-Learn and Pandas.

“I’m assuming that you have some basic knowledge of Numpy and Pandas. If you don’t know Numpy and Pandas then first learn these topics.”