These are various Apache Spark data analysis projects done in Jupyter notebooks. Some of these analyses were conducted on the ODROID XU4 mini cluster, which the more recent ones are being performed on the Personal Compute Cluster. Since the XU4 mini cluster is a significantly constrained system, the projects done there are limited in scope. If you are looking to repeat some of these projects, the Personal Compute Cluster versions are more current.
-
Notifications
You must be signed in to change notification settings - Fork 7
A collection of data analysis projects done using PySpark via Jupyter notebooks.
DIYBigData/spark-data-analysis-projects
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A collection of data analysis projects done using PySpark via Jupyter notebooks.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published