diff --git a/README.md b/README.md index e075fe8..7646874 100644 --- a/README.md +++ b/README.md @@ -9,7 +9,6 @@ Here we outline the various tools, demos, and resources that can be used to acce - [Demo Notebooks](#notebook-demos) - [Other resources](#other-resources) - ## Access OBIS data There are several options available to download data from OBIS, some of which include: @@ -19,7 +18,7 @@ There are several options available to download data from OBIS, some of which in - [Full data exports](#full-data-exports) - [OBIS homepage search](https://obis.org/) or [advanced dataset search](https://obis.org/datasets) - [OBIS Mapper](https://mapper.obis.org/) - +- [speciesgrids](#speciesgrids) ### robis @@ -65,7 +64,7 @@ seamount_datasets <- datasets[ grepl(paste(search_terms, collapse = "|"), datasets$abstract, ignore.case = TRUE),] ``` -## Full data exports +### Full data exports [links to jupyter notebook for full data export?] @@ -75,15 +74,19 @@ A full data export of OBIS data is available for download as a Parquet file, [he - The exported file will be a single, flattened Occurrence table - The table includes all provided Event and Occurrence data, as well as 68 fields added by the OBIS Quality Control Pipeline, including taxonomic information obtained from WoRMS -## OBIS homepage +### OBIS homepage From the OBIS homepage, you can search for data in the search bar in the middle of the page. You can search by particular taxonomic groups, common names, dataset names, OBIS nodes, institute name, areas (e.g., Exclusive Economic Zone (EEZ)), or by the data provider’s country. See [here](https://manual.obis.org/access.html#obis-homepage-and-dataset-pages) for more details. -## OBIS Mapper +### OBIS Mapper The [OBIS Mapper](https://mapper.obis.org) lets you visualize and filter OBIS data by taxonomy, location, time, and data quality, with options to combine layers and download them as CSV. For more details, see the [OBIS manual](https://manual.obis.org/access.html#mapper). +### speciesgrids + +[speciesgrids](https://github.com/iobis/speciesgrids) is a Python package to build WoRMS aligned combined OBIS and GBIF species distribution datasets. The resulting dataset is available in a few resolutions on AWS S3. The dataset can be downloaded locally for best performance, or queried directly from the S3 bucket. For more details about downloading and using the dataset, see the [speciesgrids README](https://github.com/iobis/speciesgrids) or the [notebook](notebooks/Python/speciesgrids_demo.ipynb). + ## Notebook demos We have prepared several JupyterHub Notebooks that can be used for reference, see: https://github.com/iobis/hackathon/tree/master/notebooks. The notebooks cover several topics including OBIS data access, data cleaning, environmental information extraction, and data visualization. diff --git a/notebooks/Python/speciesgrids_demo.ipynb b/notebooks/Python/speciesgrids_demo.ipynb new file mode 100644 index 0000000..68bdd15 --- /dev/null +++ b/notebooks/Python/speciesgrids_demo.ipynb @@ -0,0 +1,292 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# speciesgrids demo\n", + "\n", + "This notebook demonstrates querying the speciesgrids data product using geopandas and duckdb. The examples use the `s3://obis-products/speciesgrids/h3_7/` remote datasource as default. To query a local copy instead, use the local file path.\n", + "\n", + "## Regional species list using duckdb\n", + "\n", + "In the example below we use duckdb to query the gridded product with H3 resolution 7 to obtain a regional species list. The [geometry for our region of interest](https://wktmap.com/?b40bc054) is encoded as WKT and used in the duckdb query." + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": {}, + "outputs": [ + { + "data": { + "application/vnd.jupyter.widget-view+json": { + "model_id": "4e208851c92f4f51b4aa4c3bddd86d27", + "version_major": 2, + "version_minor": 0 + }, + "text/plain": [ + "FloatProgress(value=0.0, layout=Layout(width='auto'), style=ProgressStyle(bar_color='black'))" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "data": { + "text/html": [ + "
\n", + " | species | \n", + "records | \n", + "min_year | \n", + "max_year | \n", + "source_obis | \n", + "source_gbif | \n", + "
---|---|---|---|---|---|---|
0 | \n", + "Larus argentatus | \n", + "1118392.0 | \n", + "1837.0 | \n", + "2024.0 | \n", + "True | \n", + "True | \n", + "
1 | \n", + "Larus fuscus | \n", + "84912.0 | \n", + "1964.0 | \n", + "2024.0 | \n", + "True | \n", + "True | \n", + "
2 | \n", + "Larus canus | \n", + "11298.0 | \n", + "1953.0 | \n", + "2024.0 | \n", + "True | \n", + "True | \n", + "
3 | \n", + "Larus ridibundus | \n", + "9528.0 | \n", + "1911.0 | \n", + "2024.0 | \n", + "True | \n", + "True | \n", + "
4 | \n", + "Phalacrocorax carbo | \n", + "8610.0 | \n", + "1910.0 | \n", + "2024.0 | \n", + "True | \n", + "True | \n", + "
... | \n", + "... | \n", + "... | \n", + "... | \n", + "... | \n", + "... | \n", + "... | \n", + "
1620 | \n", + "Wandonia haliotis | \n", + "1.0 | \n", + "2015.0 | \n", + "2015.0 | \n", + "False | \n", + "True | \n", + "
1621 | \n", + "Raja undulata | \n", + "1.0 | \n", + "2000.0 | \n", + "2000.0 | \n", + "False | \n", + "True | \n", + "
1622 | \n", + "Akashiwo sanguinea | \n", + "1.0 | \n", + "2021.0 | \n", + "2021.0 | \n", + "True | \n", + "False | \n", + "
1623 | \n", + "Pelagostrobilidium paraepacrum | \n", + "1.0 | \n", + "2014.0 | \n", + "2014.0 | \n", + "False | \n", + "True | \n", + "
1624 | \n", + "Slabberia halterata | \n", + "1.0 | \n", + "1972.0 | \n", + "1972.0 | \n", + "False | \n", + "True | \n", + "
1625 rows × 6 columns
\n", + "