The strength of a linear relationship between two quantitative variables can be measured using Correlation. It is a statistical method that is very easy in order to calculate and to interpret. It is generally represented by ‘r’ known as the coefficient of correlation.

This is the reason why it is highly misused by professionals because correlation cannot be termed for causation. It is not necessary that if two variables have a correlation then one is dependent on the other and similarly if there is no correlation between two variables it is possible that they might have some relation. This is where PPS(Predictive Power Score) comes into the role.

Predictive Power Score works similar to the coefficient of correlation but has some additional functionalities like:

  • It works on both Linear and Non-Linear Relationships
  • Can be applied to both Numeric and Categorical columns
  • It finds more patterns in the data.

In this article, we will explore how we can use the Predictive Power Score to replace correlation.

Implementation:

PPS is an open-source python library so we will install it like any other python library using pip install ppscore.

  1. Importing required libraries

We will import ppscore along with pandas to load a dataset that we will work on.

import ppscore as pps

import pandas as pd

  1. Loading the Dataset

We will be using different datasets to explore different functionalities of PPS. We will first import an advertising dataset of an MNC which contains the target variable as ‘Sales’ and features like ‘TV’, ‘Radio’, etc.

df = pd.read_csv(‘advertising.csv’)

df.head()

  1. Finding Relation using PPScore

We will use some basic functions defined in ppscore.

  1. Finding the Relationship score

PP Score lies between 0(No Predictive Power) to 1(perfect predictive power), in this step we will find PPScore/Relationship between the target variable and the featured variable in the given dataset.

pps.score(df, "Sales", "TV")

#developers corner #coefficient of correlation #correlation analysis #dependency #heatmap #linear regression #replace correlation #visualization

Effective Way To Replace Correlation With Predictive Power Score(PPS)
3.50 GEEK