Skip to content

geowynn/Pyspark-1.6-for-Beginners

Repository files navigation

Pyspark-1.6-for-Beginners

Introductory jupyter notebook for Pyspark 1.6


Contents

EDA & Getting to know some basic Pyspark 1.6 commands

Some extra functions similar to SQL and Pandas Aggregation Joining tables

MLLib

Linear Regression

Visualisation

Convert spark df to pandas df to use matplotlib/seaborn

Setting up Spark

Connecting to Jupyter notebook

About

Introductory jupyter notebook for Pyspark 1.6

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published