You can clone this repository as follows
$ git clone [email protected]:elephantscale/hadoop-spark.git
- Dev environment setup
- Hadoop setup
- Spark Shell
- RDDs
- Dataframes
- Hive and Spark
- Spark and YARN
- Spark Applications
- "Learning Spark"
- "Advanced Analytics With Spark"
- "Mastering Apache Spark" (free online book) by Jacek Laskowski
- Hadoop Weekly - weekly digest of Big Data news and tech articles
- Apache Spark
- Spark mailing lists