Overview

A large amount of football play by play data was published by Wyscout in May 2019 on figshare.

These data are JSON format and it is complicated to process, so I tried to preprocess these data to DataFrame format with Python for later analysis. I’m going to use FIFA World Cup 2018 data. Use Google Colab, so you don’t need to build any dev environment on your laptop.

This is not sports analytics article but TIPS for preprocessing with pandas.

After this process, I visualized passing data. Please see also that article.

#data-science #pandas #data-preprocessing #football #python

Preprocess   FIFA World Cup data with Python
2.25 GEEK