In this video, I will be showing you how you can use the Vaex Python library that is to handle billion of rows in a matter of seconds. Vaex is an out-of-core DataFrame that works similar to Pandas and allows you to visualize and explore big tabular datasets. Vaex also allow the calculation of statistical parameters such as mean, sum, count, standard deviation, etc, on an N-dimensional grid for more than a billion samples/rows per second.

👉 Code https://github.com/dataprofessor/python/blob/main/vaex.ipynb
👉 Install: pip install vaex.
conda install -c conda-forge vaex
👉 Documentation https://vaex.io/docs/
👉 GitHub https://github.com/vaexio/vaex

🔔 Subscribe:https://www.youtube.com/channel/UCV8e2g4IWQqK71bbzGDEI4Q

#vaex #python #data-science

Vaex - Fast data frame for Data Science (Handle billion rows in seconds)
3.70 GEEK