Elegant way to make data talk stories: Exploratory data analysis. Data can tell great stories and making it to convey the right story is an art. The means to acquire this art is Exploratory data analysis (EDA).
Data can tell great stories and making it to convey the right story is an art. The means to acquire this art is Exploratory data analysis (EDA). Exploratory data analysis is nothing but using the statistical and probability approaches to understand what the data is trying to convey to us.
As a data scientist, a major share of the work will mostly be focused on understanding the data and trying to get only the necessary characteristics to be sent to the Machine Learning model. Only when the input data makes sense, the model will be able to leverage its maximum power.
One of the really tough things is figuring out what questions to ask. Once you figure out the question, then the answer is relatively easy — Elon Musk
It’s often a challenging task to find the right question from a clean slate. But, by constantly asking _Why, _we’ll be able to understand the behaviour of the data and derive the insights
Now, we can dive into some common starting points that can be used while performing EDA. Having always been a fan of Pokemon right from my childhood, I will be using the _Pokemon 🌟 dataset _from Kaggle for step by step process to go ahead with EDA
Come let’s catch ’em all :)
Before describing the general steps to perform the EDA, let’s take a look at an important tool.
Pandas *library is a fast, powerful and easy tool which was built on the top of *python. From my personal experience, as a data scientist, my everyday bread and butter solely rely on pandas. All the programming logic can be easily implemented with just one or two lines of code, which makes this library so popular. It can handle thousands of data without many computational requirements. Moreover, the functionalities provided by this library is simple and are quite effective.
Even for our Pokemon dataset EDA, we will be using pandas for understanding the data and also for visualisation
You will discover Exploratory Data Analysis (EDA), the techniques and tactics that you can use, and why you should be performing EDA on your next problem.
Most popular Data Science and Machine Learning courses — August 2020. This list was last updated in August 2020 — and will be updated regularly so as to keep it relevant
Artificial Intelligence, Machine Learning, and Data Science are amongst a few terms that have become extremely popular amongst professionals in almost all the fields.
The dataset also includes information on time and distance of flights which might also have an effect on delays. These columns can be analyzed with similar methods.
Enroll now at CETPA, the best Institute in India for Artificial Intelligence Online Training Course and Certification for students & working professionals & avail 50% instant discount.