Introduction

Exploratory data analysis is one of the best practices used in data science today. While starting a career in Data Science, people generally don’t know the difference between Data analysis and exploratory data analysis. There is not a very big difference between the two, but both have different purposes.

Exploratory Data Analysis(EDA): Exploratory data analysis is a complement to  inferential statistics, which tends to be fairly rigid with rules and formulas. At an advanced level, EDA involves looking at and describing the data set from different angles and then summarizing it.

Data Analysis: Data Analysis is the statistics and probability to figure out trends in the data set. It is used to show historical data by using some analytics tools. It helps in drilling down the information, to transform metrics, facts, and figures into initiatives for improvement.

Exploratory Data Analysis(EDA)

We will explore a Data set and perform the exploratory data analysis. The major topics to be covered are below:

— Handle Missing value
— Removing duplicates
— Outlier Treatment
— Normalizing and Scaling( Numerical Variables)
— Encoding Categorical variables( Dummy Variables)
— Bivariate Analysis

#data-analysis #statistics #exploratory-data-analysis #data-science #python

Exploratory Data Analysis (EDA) From Scratch in Python
5.70 GEEK