Skip to content

Latest commit

 

History

History
176 lines (127 loc) · 9.14 KB

README.md

File metadata and controls

176 lines (127 loc) · 9.14 KB

HDInsight Developer's Guide

This guide is intended to provide a curated set of documentation useful to any developer, data scientist or big data engineer getting started or growing their experience with Azure HDInsight.

The delivery goal of this guide is to package this online format into the format of a digital book.

The table of contents follows, links to new content will open in the same window remaining in GitHub, while links to existing content that will soon be merged with this repo will open the Azure Docs.

Overview

Azure HDInsight and Hadoop Architecture

Configuring the Cluster

Configuring Identity and Access Controls

Monitoring and managing the HDInsight cluster

Developing Hive applications

Hive samples

Developing Spark applications

Use Spark with notebooks

Use Spark with IntelliJ

Spark samples

Developing Spark ML applications

Deep Learning with Spark

Developing R scripts on HDInsight

Developing Spark Streaming applications

Optimizing Spark Performance

Use HBase

Use Phoenix with HBase on HDInsight

Apache Open Source Ecosystem

Advanced Scenarios and Deep Dives

Troubleshooting