Introductory jupyter notebook for Pyspark 1.6
EDA & Getting to know some basic Pyspark 1.6 commands
Some extra functions similar to SQL and Pandas Aggregation Joining tables
MLLib
Linear Regression
Visualisation
Convert spark df to pandas df to use matplotlib/seaborn
Setting up Spark
Connecting to Jupyter notebook