Apache Spark is a fast and general-purpose cluster computing system. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs.

#apache-spark #pyspark #python

An Oversimplified Introduction to PySpark for Programmers
1.10 GEEK