- If you are using hortonworks sandbox, you can execute command below to get latest notebooks:
curl -sSL https://raw.githubusercontent.com/hortonworks-gallery/zeppelin-notebooks/master/update_all_notebooks.sh | sudo -u zeppelin -E sh
- Please note that if you have changed or created your own notebooks they will be moved to a folder: /opt/incubator-zeppelin/notebook/old_yyyyMMdd-HHmmss, you can copy them back to /opt/incubator-zeppelin/notebook/ directory
Copy the "json" link URL from the table below and paste it into Zeppelin's import from URL tool.
You can add notebooks by making Github pull request, but we request you also update the table below by adding a record for your notebook. To generate the 'view' link for the notebook:
- Uploading the notebook into this repo and obtain the url (it should look like this: https://github.com/hortonworks-gallery/zeppelin-notebooks/blob/master/2A94M5J1Z/note.json)
- Generate a link to preview the notebook by pasting the url from #1 into ZeppelinHub viewer (https://www.zeppelinhub.com/viewer). This will generate a preview page for your notebook (at a url that looks something like https://www.zeppelinhub.com/viewer/notebooks/aHR0cHM6Ly9yYXcuZ2l0aHVidXNlcmNvbnRlbnQuY29tL2hvcnRvbndvcmtzLWdhbGxlcnkvemVwcGVsaW4tbm90ZWJvb2tzL21hc3Rlci8yQTk0TTVKMVovbm90ZS5qc29u/)
- Copy the url from #2 of the notebook Include the preview link under the 'View' link in table below. You can follow examples from existing notebooks for this at the markdown version of README here
Description | Components | Code | View | Author | Comments |
---|---|---|---|---|---|
Starter Bank demo | Spark/SparkSQL | json | view | NFLabs | Default notebook installed by Zeppelin |
Australian Spending dataset | Spark/SparkSQL | json | view | Ned Shawa | |
Australian Spending dataset | Hive | json | view | Ned Shawa | |
Map Earthquake dataset | Spark/SparkSQL | json | view | NFLabs | |
IoT Trucking demo | Spark/SparkSQL | json | view | Nauman Fakhar, Dhruv Kumar | IoT Truck Demo from Hadoop Summit |
Intra-day stock prices | Phoenix | json | view | Ali Bajwa | |
Magellan: Geospatial Analytics Using Spark | Spark | json | view | Ram Sriharsha | Magellan Blog as Zeppelin Notebook. Requires Spark 1.4.1 See here for more details |
Twitter sentiment analysis using Spark Streaming | Spark streaming | json | view | Guilherme Braccialli | |
PySpark tutorial: Analyzing network intrusion dataset with Python and Spark | PySpark | json | view | Saptak Sen | |
Single view demo | Hive/Sqoop/HDF | json | view | Ali Bajwa/Ajay Singh | Notebook version of Single view lab |
Predicting Airline Delays | Spark/MLLib | json | view | Ofer Mendelevitch | Related to this Blog post and part 2 |
Sensors Predictive Analysis | SparkSQL/PySpark | json | view | Vedant Jain | Adding Predictions to this Blog post |
Analyzing Server Web Logs | SparkSQL/PySpark | json | view | Paul Hargis | Parse and Analyze HTTP Web Server Logs (ported from Databricks) |
Hands-on Tour of Apache Spark in 5 min | PySpark | json | view | Robert Hryniewicz | |
Lab 101: Intro to Spark with Python | PySpark | json | view | Robert Hryniewicz | Intro to Spark Core & Spark SQL with Word Count |
Lab 102: Intro to Spark with Scala | Spark SQL | json | view | Robert Hryniewicz | Intro to Spark SQL using Airlines Dataset |
Lab 201: Intro to Machine Learning with Spark | Spark MLlib & ML | json | view | Robert Hryniewicz | Intro to Machine Learning |
HDFS Space Usage Visualization | HDFS / D3.js | json | view | Guilherme Braccialli | A D3.js Sunburst visualization to explore HDFS space utilization and other metadata info, more details |
|