Skip to content

Latest commit

 

History

History
122 lines (77 loc) · 3.79 KB

README-scala.md

File metadata and controls

122 lines (77 loc) · 3.79 KB

Spark Labs - Scala

Welcome to Spark labs bundle. This is the Scala track.

To Instructor

Create a lab bundle as follows

    $   ./assemble-labs.sh

Labs

Instructor will provide lab bundle

Download Data

The VMs already have data loaded. This for your own reference.

Link to Full Dataset (Note: Large download, ~300 Meg)

  • Click the above link to download or
  • use wget from command line
    $    wget   "https://s3.amazonaws.com/elephantscale-public/data/data.zip"

Labs

1 - Scala Primer

2 - Spark Intro

3 - Spark Core

4 - Dataframes and Datasets

5 - API

Practice Labs for end of day 2

6 - MLLib

7 - GraphX

8 - Streaming

Structured Streaming

This is the new recommended API for streaming.

Classic Streaming

9 - Operations

10 - Spark and Hadoop (all the Hadoop labs are grouped here)

Practice Labs