This tutorial explains how to preprocess data using the pandas library. Preprocessing is the process of doing a pre-analysis of data, in order to transform them into a standard and normalized format.

Preprocessing involves the following aspects:

  • missing values
  • data standardization
  • data normalization
  • data binning

In this tutorial we deal only with missing values.

You can download the source code of this tutorial as a Jupyter notebook from my Github Data Science Repository.

#data-preprocessing #python #data-science #data-analysis #pandas

Data Preprocessing with Python Pandas — Part 1 Missing Data
1.55 GEEK