Before venturing on to any data science project it is important to pre-process the data and also to explore the data. Today we will discuss a very basic topic of exploratory data analysis (EDA) using Python and also uncover how simple EDA can be extremely helpful in performing preliminary data analysis.

The approach we will follow today is ask some questions and try to get those answers from the data.

We will consider the supermarket sales data from the Kaggle dataset.

1. Familiarize with the dataset and variables:

The dataset consists of historical sales data of a supermarket company from 3 different branches over 3 months from Jan-March 2019. Following table provides a summary of the variables and the variable description.

Image for post

#correlation #python

Exploratory Data analysis using Supermarket Sales Data in Python
8.60 GEEK