Lab 01 - Batch & ETL processing of Big Data with Spark SQL

To complete this lab, follow these steps:

Download a ZIP copy of this repository to your local machine and uncompress the contents.
Make sure your cluster is deployed.
Using the Azure Portal, navigate to your deployed cluster.
Select Dashboard and then select Jupyter Notebook.
Login with cluster admin credentials.
On the Jupyter home page, select Upload then browse to the copy of Lab01.ipynb that you downloaded from this repository.
Select Upload.
In the listing, select Lab01.ipynb to open the notebook.
Follow the prompts within the notebook to complete the lab.

HINT: If you get stuck with the lab, take a look at the provided solution notebook called lab01-complete.ipynb.

Provide feedback