Skip to content

Latest commit

 

History

History
40 lines (27 loc) · 1.33 KB

README.md

File metadata and controls

40 lines (27 loc) · 1.33 KB

SparkEC2 docker

This is a dockerized version of the Spark-EC2 script, which will allow you to run the script in Windows (without Cygwin) or from any other environment as long as you have Docker.

Installation

The Docker image is part of the Docker Hub (it can be found here) so you only need to pull the image from docker:

docker pull edrevo/spark-ec2-docker

Usage

The hard way

The manual way of running the docker is with the following command-line:

docker run -t -i -v /c/Users/<% PATH TO YOUR SSH KEY %>:/id_rsa -e AWS_ACCESS_KEY_ID="<% YOUR AWS ID%>" -e AWS_SECRET_ACCESS_KEY="<% YOUR AWS SECRET %>" edrevo/spark-ec2-docker <% ARGUMENTS FOR SCRIPT %>

The easy way

In order to simplify the execution, you can download the batch script, edit it to add your SSH key path and place it in your PATH.

Once you have done that, you can start using the spark-ec2 script directly:

.\spark-ec2.bat launch -s 4 -t c3.4xlarge -r eu-west-1 -k test "my-test-cluster"

The only difference is you don't need to pass the SSH key path or configure any AWS environment variables, since that was already configured in the batch file.