Campus Recruitment: EDA and Classification — Part 1

Campus Recruitment: EDA and Classification — Part 1

Day 13 and 14 of 100 Days of Data Science. Welcome back to my 100 Days of Data Science Challenge Journey. On days 13 and 14, I work on Campus Recruitment Dataset available on Kaggle.

Welcome back to my 100 Days of Data Science Challenge Journey. On days 13 and 14, I work on Campus Recruitment Dataset available on Kaggle.

You can read my previous stories here:


This project is going to be divided into two parts. In this part, I will cover the following topics.

Content:

  • Introduction
  • Loading and Exploring Data
  • Exploring features and relation with the target class

1. Introduction

_Campus placement or campus recruiting is a program conducted within universities or other educational institutions to provide jobs to students nearing completion of their studies. — [Wikipedia_](https://en.wikipedia.org/wiki/Campus_placement)

Problem Statement:

XYZ University wants to build a machine learning model to know whether a student will get placed or not. So that they can provide special attention and help them to get a job. The given dataset can be treated as a classification or regression problem. In this project, I am going to treat this classification problem, where the task is to find whether a candidate will be placed or not. This is a binary classification problem.

2. Loading and Exploring Data

2.1 Loading required libraries and reading the data into Python DataFrame

import numpy as np
import pandas as pd

## data visualization
import matplotlib.pyplot as plt
import seaborn as sns

## setting colors for all graphs
colors = ['#e79c2a','#d54062', '#ebdc87', '#ffa36c']
sns.set_palette(sns.color_palette(colors))

Now it’s time to loading dataset in Python DataFrame. Before that, you have to download the dataset to your system. You can download data from here. Now let’s load data.

data = pd.read_csv("./Placement_Data_Full_Class.csv")

100daysofcode data-analysis data-science python data-visualization

Bootstrap 5 Complete Course with Examples

Bootstrap 5 Tutorial - Bootstrap 5 Crash Course for Beginners

Nest.JS Tutorial for Beginners

Hello Vue 3: A First Look at Vue 3 and the Composition API

Building a simple Applications with Vue 3

Deno Crash Course: Explore Deno and Create a full REST API with Deno

How to Build a Real-time Chat App with Deno and WebSockets

Convert HTML to Markdown Online

HTML entity encoder decoder Online

Data Visualization With Python | Data Visualization | Python For Data Science

🔥To access the slide deck used in this session for Free, click here: https://bit.ly/GetPDF_DataV_P 🔥 Great Learning brings you this live session on 'Data Vis...

Python for Data Science | Data Science With Python | Python Data Science Tutorial

🔥Intellipaat Python for Data Science Course: https://intellipaat.com/python-for-data-science-training/In this python for data science video you will learn e...

Applied Data Science with Python Certification Training Course -IgmGuru

Master Applied Data Science with Python and get noticed by the top Hiring Companies with IgmGuru's Data Science with Python Certification Program. Enroll Now

An introduction to exploratory data analysis in python

Many a time, I have seen beginners in data science skip exploratory data analysis (EDA) and jump straight into building a hypothesis function or model. In my opinion, this should not be the case.

Exploratory Data Analysis is a significant part of Data Science

Data science is omnipresent to advanced statistical and machine learning methods. For whatever length of time that there is data to analyse, the need to investigate is obvious.