I am a python programmer who started to work with PySpark a couple of months ago and realized that it is quite a “big” deal. I really had to work at collecting and understanding some basic terms in order to write my first program in spark. Ergo, I decided to compile this guide for my peers.
This is a condensed article for starting with PySpark if you are making the jump from Python. It is aimed at answering beginners’ questions and cleaning out some jargon.
#spark #data-science #big-data #python