How to clean and analyze anonymized, self-reported salary data from Google Sheets into Python/Jupyter Notebook.

I recently was told about self-reported, anonymous salary information and so was curious to analyze these data and see what patterns emerge.

The increased transparency promoted by the sharing of this information publicly is beneficial for understanding and avoiding pay inequities experienced by women and people of color. These data can be looked through to find one’s age group, level of experience, and similar roles, to see what level of pay they can expect. One can use this information to determine if they are being underpaid, or to choose what career path might be most worth getting in to.

In this article, I’ll work through the first step of the data science pipeline, data cleaning, and end with a couple of visualizations to understand these data better. I’ll end with some future next steps for these data.

#python #excel #data-science #developer

Exploring Salary Data Trends in Excel and Python
1.75 GEEK