Skip to content
This repository has been archived by the owner on Oct 24, 2024. It is now read-only.

Example datatree for use in tutorial documentation #100

Closed
6 tasks
Tracked by #61
TomNicholas opened this issue May 25, 2022 · 9 comments
Closed
6 tasks
Tracked by #61

Example datatree for use in tutorial documentation #100

TomNicholas opened this issue May 25, 2022 · 9 comments
Assignees
Labels
documentation Improvements or additions to documentation help wanted Extra attention is needed

Comments

@TomNicholas
Copy link
Member

TomNicholas commented May 25, 2022

What would help me enormously with writing documentation would be a killer example datatree, which I could open and use to demonstrate use of all types of methods. Just like we have the "air_temperature" example dataset used in the main xarray documentation.

To be as useful as possible, this example tree should hit a few criteria:

  • Nested - there needs to be some reason why you wouldn't just use a Dataset to organise this data. Multiple resolutions is a simple reason, but it also should be >1 level deep.
  • Common coordinates - it should have a least one common coordinate stored closer to the root of the tree. For example a reference normalisation value of some quantity, or perhaps some grid-related information that applies to the data in multiple sub-groups.
  • Heterogenous data - there is no restriction on the relationship between data in different nodes, so we should demonstrate this by storing data that is as different as possible (but still somehow related). I'm thinking maybe some demographic data vs geographical, or model data vs observational.
  • Small - though we would download this with pooch instead of uploading the data files in the repo, we still want this to be small enough that we don't cause problems when building or viewing our docs.
  • Multidimensional - the data stored in the leaves needs to have enough dimensions so that I can reduce/aggregate it and still have something interesting left to plot.
  • Recognisable - Ideally it would contain some relatable data. The existing Dataset example is nice because you can immediately see you are looking at a (low-resolution) map of North America. Maybe a satellite image of Manhattan Island or something?

A really good inspiration is this pseudo-structure provided in pydata/xarray#4118:

image

This would hit all of the criteria above, if it actually existed somewhere I could find!

What I would like is for people who have more familiarity with real geo-science data products to help me make this killer example tree, or at least point me towards data that I might use.

If we have multiple good suggestions I could make multiple different examples to use, but I think I would prefer one really good one to multiple quite good ones. Alternatively any extras could end up getting used for some future example notebooks though.

@jhamman @jbusecke @rabernat @alexamici @joshmoore

@TomNicholas TomNicholas mentioned this issue May 25, 2022
14 tasks
@TomNicholas TomNicholas changed the title A tutorial datatree, similar to xarray's "air_temperature" tutorial dataset would be useful. This could be as simple as the air_temperature dataset with some other variables added which live in different groups. Example datatree for use in tutorial documentation May 25, 2022
@joshmoore
Copy link

I defer to @thewtex though I assume that "small" might be an issue :)

@TomNicholas TomNicholas added documentation Improvements or additions to documentation help wanted Extra attention is needed labels May 26, 2022
@TomNicholas
Copy link
Member Author

@andersy005 @jhamman just a ping to say that if either of you can help me get some data like this I could really push forward with writing cool documentation for datatree!

@andersy005
Copy link
Member

@TomNicholas, i have this on my today's TODO list. I should have a few samples ready by the end of the day.

@joshmoore
Copy link

@andersy005
Copy link
Member

@TomNicholas, i now have two datasets (one from CMIP6 and another one from CESM2 Large Ensemble)

CESM2 Large Ensemble
DataTree('None', parent=None)
├── DataTree('ocn')
│   ├── DataTree('historical')
│   │   └── DataTree('monthly')
│   │       ├── DataTree('smbb')
│   │       │       Dimensions:     (member_id: 1, time: 6, z_t: 1, nlat: 384, nlon: 320, d2: 2)
│   │       │       Coordinates:
│   │       │         * member_id   (member_id) <U12 'r11i1231p1f2'
│   │       │         * time        (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:00:00
│   │       │           time_bound  (time, d2) object dask.array<chunksize=(6, 2), meta=np.ndarray>
│   │       │         * z_t         (z_t) float32 500.0
│   │       │       Dimensions without coordinates: nlat, nlon, d2
│   │       │       Data variables:
│   │       │           O2          (member_id, time, z_t, nlat, nlon) float32 dask.array<chunksize=(1, 6, 1, 384, 320), meta=np.ndarray>
│   │       │       Attributes:
│   │       │           Conventions:             CF-1.0; http://www.cgd.ucar.edu/cms/eaton/netcdf...
│   │       │           calendar:                All years have exactly  365 days.
│   │       │           cell_methods:            cell_methods = time: mean ==> the variable value...
│   │       │           contents:                Diagnostic and Prognostic Variables
│   │       │           model_doi_url:           https://doi.org/10.5065/D67H1H0V
│   │       │           revision:                $Id$
│   │       │           source:                  CCSM POP2, the CCSM Ocean Component
│   │       │           time_period_freq:        month_1
│   │       │           intake_esm_dataset_key:  ocn/historical/monthly/smbb
│   │       └── DataTree('cmip6')
│   │               Dimensions:     (member_id: 1, time: 6, z_t: 1, nlat: 384, nlon: 320, d2: 2)
│   │               Coordinates:
│   │                 * member_id   (member_id) <U12 'r10i1181p1f1'
│   │                 * time        (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:00:00
│   │                   time_bound  (time, d2) object dask.array<chunksize=(6, 2), meta=np.ndarray>
│   │                 * z_t         (z_t) float32 500.0
│   │               Dimensions without coordinates: nlat, nlon, d2
│   │               Data variables:
│   │                   O2          (member_id, time, z_t, nlat, nlon) float32 dask.array<chunksize=(1, 6, 1, 384, 320), meta=np.ndarray>
│   │               Attributes:
│   │                   Conventions:             CF-1.0; http://www.cgd.ucar.edu/cms/eaton/netcdf...
│   │                   calendar:                All years have exactly  365 days.
│   │                   cell_methods:            cell_methods = time: mean ==> the variable value...
│   │                   contents:                Diagnostic and Prognostic Variables
│   │                   model_doi_url:           https://doi.org/10.5065/D67H1H0V
│   │                   revision:                $Id$
│   │                   source:                  CCSM POP2, the CCSM Ocean Component
│   │                   start_time:              This dataset was created on 2020-07-18 at 07:26:...
│   │                   time_period_freq:        month_1
│   │                   intake_esm_dataset_key:  ocn/historical/monthly/cmip6
│   └── DataTree('ssp370')
│       └── DataTree('monthly')
│           ├── DataTree('smbb')
│           │       Dimensions:     (member_id: 1, time: 6, z_t: 1, nlat: 384, nlon: 320, d2: 2)
│           │       Coordinates:
│           │         * member_id   (member_id) <U12 'r11i1231p1f2'
│           │         * time        (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:00:00
│           │           time_bound  (time, d2) object dask.array<chunksize=(6, 2), meta=np.ndarray>
│           │         * z_t         (z_t) float32 500.0
│           │       Dimensions without coordinates: nlat, nlon, d2
│           │       Data variables:
│           │           O2          (member_id, time, z_t, nlat, nlon) float32 dask.array<chunksize=(1, 6, 1, 384, 320), meta=np.ndarray>
│           │       Attributes:
│           │           Conventions:             CF-1.0; http://www.cgd.ucar.edu/cms/eaton/netcdf...
│           │           calendar:                All years have exactly  365 days.
│           │           cell_methods:            cell_methods = time: mean ==> the variable value...
│           │           contents:                Diagnostic and Prognostic Variables
│           │           model_doi_url:           https://doi.org/10.5065/D67H1H0V
│           │           revision:                $Id$
│           │           source:                  CCSM POP2, the CCSM Ocean Component
│           │           time_period_freq:        month_1
│           │           intake_esm_dataset_key:  ocn/ssp370/monthly/smbb
│           └── DataTree('cmip6')
│                   Dimensions:     (member_id: 1, time: 6, z_t: 1, nlat: 384, nlon: 320, d2: 2)
│                   Coordinates:
│                     * member_id   (member_id) <U12 'r10i1181p1f1'* time        (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:00:00time_bound  (time, d2) object dask.array<chunksize=(6, 2), meta=np.ndarray>* z_t         (z_t) float32 500.0Dimensions without coordinates: nlat, nlon, d2Data variables:
│                       O2          (member_id, time, z_t, nlat, nlon) float32 dask.array<chunksize=(1, 6, 1, 384, 320), meta=np.ndarray>Attributes:
│                       Conventions:             CF-1.0; http://www.cgd.ucar.edu/cms/eaton/netcdf...
│                       calendar:                All years have exactly  365 days.
│                       cell_methods:            cell_methods = time: mean ==> the variable value...
│                       contents:                Diagnostic and Prognostic Variablesmodel_doi_url:           https://doi.org/10.5065/D67H1H0Vrevision:                $Id$
│                       source:                  CCSM POP2, the CCSM Ocean Componenttime_period_freq:        month_1intake_esm_dataset_key:  ocn/ssp370/monthly/cmip6
└── DataTree('atm')
    ├── DataTree('historical')
    │   └── DataTree('monthly')
    │       ├── DataTree('cmip6')
    │       │       Dimensions:    (member_id: 1, time: 6, lat: 192, lon: 288, nbnd: 2)
    │       │       Coordinates:
    │       │         * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0
    │       │         * lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8
    │       │         * member_id  (member_id) <U12 'r10i1181p1f1'
    │       │         * time       (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:00:00
    │       │           time_bnds  (time, nbnd) object dask.array<chunksize=(6, 2), meta=np.ndarray>
    │       │       Dimensions without coordinates: nbnd
    │       │       Data variables:
    │       │           PRECC      (member_id, time, lat, lon) float32 dask.array<chunksize=(1, 6, 192, 288), meta=np.ndarray>
    │       │           TREFHT     (member_id, time, lat, lon) float32 dask.array<chunksize=(1, 6, 192, 288), meta=np.ndarray>
    │       │       Attributes:
    │       │           time_period_freq:        month_1
    │       │           model_doi_url:           https://doi.org/10.5065/D67H1H0V
    │       │           Conventions:             CF-1.0
    │       │           NCO:                     netCDF Operators version 4.9.4 (Homepage = http:...
    │       │           logname:                 sunseon
    │       │           topography_file:         /mnt/lustre/share/CESM/cesm_input/atm/cam/topo/f...
    │       │           source:                  CAM
    │       │           intake_esm_dataset_key:  atm/historical/monthly/cmip6
    │       └── DataTree('smbb')
    │               Dimensions:    (member_id: 1, time: 6, lat: 192, lon: 288, nbnd: 2)
    │               Coordinates:
    │                 * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0* lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8* member_id  (member_id) <U12 'r10i1191p1f2'* time       (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:00:00time_bnds  (time, nbnd) object dask.array<chunksize=(6, 2), meta=np.ndarray>Dimensions without coordinates: nbndData variables:
    │                   PRECC      (member_id, time, lat, lon) float32 dask.array<chunksize=(1, 6, 192, 288), meta=np.ndarray>TREFHT     (member_id, time, lat, lon) float32 dask.array<chunksize=(1, 6, 192, 288), meta=np.ndarray>Attributes:
    │                   time_period_freq:        month_1model_doi_url:           https://doi.org/10.5065/D67H1H0VConventions:             CF-1.0logname:                 sunseontopography_file:         /mnt/lustre/share/CESM/cesm_input/atm/cam/topo/f...
    │                   source:                  CAMintake_esm_dataset_key:  atm/historical/monthly/smbb
    └── DataTree('ssp370')
        └── DataTree('monthly')
            ├── DataTree('cmip6')
            │       Dimensions:    (member_id: 1, time: 6, lat: 192, lon: 288, nbnd: 2)
            │       Coordinates:
            │         * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0* lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8* member_id  (member_id) <U12 'r10i1181p1f1'* time       (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:00:00time_bnds  (time, nbnd) object dask.array<chunksize=(6, 2), meta=np.ndarray>Dimensions without coordinates: nbndData variables:
            │           PRECC      (member_id, time, lat, lon) float32 dask.array<chunksize=(1, 6, 192, 288), meta=np.ndarray>TREFHT     (member_id, time, lat, lon) float32 dask.array<chunksize=(1, 6, 192, 288), meta=np.ndarray>Attributes:
            │           time_period_freq:        month_1model_doi_url:           https://doi.org/10.5065/D67H1H0VConventions:             CF-1.0host:                    mom1logname:                 sunseontopography_file:         /mnt/lustre/share/CESM/cesm_input/atm/cam/topo/f...
            │           source:                  CAMintake_esm_dataset_key:  atm/ssp370/monthly/cmip6
            └── DataTree('smbb')
                    Dimensions:    (member_id: 1, time: 6, lat: 192, lon: 288, nbnd: 2)
                    Coordinates:
                      * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0
                      * lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8
                      * member_id  (member_id) <U12 'r10i1191p1f2'
                      * time       (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:00:00
                        time_bnds  (time, nbnd) object dask.array<chunksize=(6, 2), meta=np.ndarray>
                    Dimensions without coordinates: nbnd
                    Data variables:
                        PRECC      (member_id, time, lat, lon) float32 dask.array<chunksize=(1, 6, 192, 288), meta=np.ndarray>
                        TREFHT     (member_id, time, lat, lon) float32 dask.array<chunksize=(1, 6, 192, 288), meta=np.ndarray>
                    Attributes:
                        time_period_freq:        month_1
                        model_doi_url:           https://doi.org/10.5065/D67H1H0V
                        Conventions:             CF-1.0
                        logname:                 sunseon
                        topography_file:         /mnt/lustre/share/CESM/cesm_input/atm/cam/topo/f...
                        source:                  CAM
                        intake_esm_dataset_key:  atm/ssp370/monthly/smbb
CMIP6
DataTree('None', parent=None)
├── DataTree('ScenarioMIP')
│   ├── DataTree('MIROC')
│   │   └── DataTree('MIROC6')
│   │       └── DataTree('ssp370')
│   │           ├── DataTree('Lmon')
│   │           │   └── DataTree('gn')
│   │           │           Dimensions:    (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6)
│   │           │           Coordinates:
│   │           │             * lat        (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93
│   │           │               lat_bnds   (lat, bnds) float64 ...
│   │           │             * lon        (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6
│   │           │               lon_bnds   (lon, bnds) float64 ...
│   │           │             * time       (time) datetime64[ns] 2015-01-16T12:00:00 ... 2015-06-16
│   │           │               time_bnds  (time, bnds) datetime64[ns] ...
│   │           │             * member_id  (member_id) object 'r1i1p1f1'
│   │           │           Dimensions without coordinates: bnds
│   │           │           Data variables:
│   │           │               mrso       (member_id, time, lat, lon) float32 ...
│   │           │           Attributes: (12/48)
│   │           │               Conventions:             CF-1.7 CMIP-6.2
│   │           │               activity_id:             ScenarioMIP AerChemMIP
│   │           │               branch_method:           standard
│   │           │               branch_time_in_child:    60265.0
│   │           │               branch_time_in_parent:   60265.0
│   │           │               cmor_version:            3.4.0
│   │           │               ...                      ...
│   │           │               variable_id:             mrso
│   │           │               variant_label:           r1i1p1f1
│   │           │               status:                  2019-10-29;created;by nhn2@columbia.edu
│   │           │               netcdf_tracking_ids:     hdl:21.14100/3ba01dc3-ab7e-45d0-882a-66ed2768a642
│   │           │               version_id:              v20190627
│   │           │               intake_esm_dataset_key:  ScenarioMIP/MIROC/MIROC6/ssp370/Lmon/gn
│   │           ├── DataTree('Amon')
│   │           │   └── DataTree('gn')
│   │           │           Dimensions:    (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6)
│   │           │           Coordinates:
│   │           │             * lat        (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93
│   │           │               lat_bnds   (lat, bnds) float64 ...
│   │           │             * lon        (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6
│   │           │               lon_bnds   (lon, bnds) float64 ...
│   │           │             * time       (time) datetime64[ns] 2015-01-16T12:00:00 ... 2015-06-16
│   │           │               time_bnds  (time, bnds) datetime64[ns] ...
│   │           │             * member_id  (member_id) object 'r1i1p1f1'
│   │           │           Dimensions without coordinates: bnds
│   │           │           Data variables:
│   │           │               pr         (member_id, time, lat, lon) float32 ...
│   │           │           Attributes: (12/48)
│   │           │               Conventions:             CF-1.7 CMIP-6.2
│   │           │               activity_id:             ScenarioMIP AerChemMIP
│   │           │               branch_method:           standard
│   │           │               branch_time_in_child:    60265.0
│   │           │               branch_time_in_parent:   60265.0
│   │           │               cmor_version:            3.4.0
│   │           │               ...                      ...
│   │           │               variable_id:             pr
│   │           │               variant_label:           r1i1p1f1
│   │           │               status:                  2019-10-25;created;by nhn2@columbia.edu
│   │           │               netcdf_tracking_ids:     hdl:21.14100/c23c415d-adca-4e01-8e7c-11617bcfa2bb
│   │           │               version_id:              v20190627
│   │           │               intake_esm_dataset_key:  ScenarioMIP/MIROC/MIROC6/ssp370/Amon/gn
│   │           └── DataTree('Omon')
│   │               └── DataTree('gn')
│   │                       Dimensions:             (y: 256, x: 360, time: 6, lev: 1, bnds: 2,
│   │                                                member_id: 1, vertices: 4)
│   │                       Coordinates: (12/13)
│   │                           latitude            (y, x) float32 ...
│   │                         * lev                 (lev) float64 1.0
│   │                           lev_bnds            (lev, bnds) float64 ...
│   │                           longitude           (y, x) float32 ...
│   │                           sigma_bnds          (lev, bnds) float64 ...
│   │                         * time                (time) datetime64[ns] 2015-01-16T12:00:00 ... 2015-06-16
│   │                           ...                  ...
│   │                         * x                   (x) float64 0.5 1.5 2.5 3.5 ... 356.5 357.5 358.5 359.5
│   │                           x_bnds              (x, bnds) float64 ...
│   │                         * y                   (y) float64 -88.0 -85.75 -85.25 ... 148.6 150.5 152.4
│   │                           y_bnds              (y, bnds) float64 ...
│   │                           zlev_bnds           (lev, bnds) float64 ...
│   │                         * member_id           (member_id) object 'r1i1p1f1'
│   │                       Dimensions without coordinates: bnds, vertices
│   │                       Data variables:
│   │                           depth               (y, x) float32 ...
│   │                           depth_c             float64 ...
│   │                           eta                 (time, y, x) float32 ...
│   │                           nsigma              int32 ...
│   │                           sigma               (lev) float64 ...
│   │                           thetao              (member_id, time, lev, y, x) float32 ...
│   │                           vertices_latitude   (y, x, vertices) float32 ...
│   │                           vertices_longitude  (y, x, vertices) float32 ...
│   │                           zlev                (lev) float64 ...
│   │                       Attributes: (12/48)
│   │                           Conventions:             CF-1.7 CMIP-6.2
│   │                           activity_id:             ScenarioMIP AerChemMIP
│   │                           branch_method:           standard
│   │                           branch_time_in_child:    60265.0
│   │                           branch_time_in_parent:   60265.0
│   │                           cmor_version:            3.4.0
│   │                           ...                      ...
│   │                           variable_id:             thetao
│   │                           variant_label:           r1i1p1f1
│   │                           status:                  2019-11-18;created;by nhn2@columbia.edu
│   │                           netcdf_tracking_ids:     hdl:21.14100/99dda520-c9e9-4617-b4ca-0de0a2b9398...
│   │                           version_id:              v20190627
│   │                           intake_esm_dataset_key:  ScenarioMIP/MIROC/MIROC6/ssp370/Omon/gn
│   ├── DataTree('CCCma')
│   │   └── DataTree('CanESM5')
│   │       └── DataTree('ssp370')
│   │           ├── DataTree('Amon')
│   │           │   └── DataTree('gn')
│   │           │           Dimensions:    (lat: 64, bnds: 2, lon: 128, member_id: 1, time: 6)
│   │           │           Coordinates:
│   │           │             * lat        (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86
│   │           │               lat_bnds   (lat, bnds) float64 ...
│   │           │             * lon        (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2
│   │           │               lon_bnds   (lon, bnds) float64 ...
│   │           │             * time       (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:00:00
│   │           │               time_bnds  (time, bnds) object ...
│   │           │             * member_id  (member_id) object 'r1i1p1f1'
│   │           │           Dimensions without coordinates: bnds
│   │           │           Data variables:
│   │           │               pr         (member_id, time, lat, lon) float32 ...
│   │           │           Attributes: (12/57)
│   │           │               CCCma_model_hash:            1f91f92cb6d607391f44831504025d32fc44faa1
│   │           │               CCCma_parent_runid:          rc3.1-his01
│   │           │               CCCma_pycmor_hash:           33c30511acc319a98240633965a04ca99c26427e
│   │           │               CCCma_runid:                 rc3.1-s7001
│   │           │               Conventions:                 CF-1.7 CMIP-6.2
│   │           │               YMDH_branch_time_in_child:   2015:01:01:00
│   │           │               ...                          ...
│   │           │               tracking_id:                 hdl:21.14100/8c4a1496-f308-493e-8ecc-a2e253e...
│   │           │               variable_id:                 pr
│   │           │               variant_label:               r1i1p1f1
│   │           │               version:                     v20190429
│   │           │               version_id:                  v20190429
│   │           │               intake_esm_dataset_key:      ScenarioMIP/CCCma/CanESM5/ssp370/Amon/gn
│   │           ├── DataTree('Lmon')
│   │           │   └── DataTree('gn')
│   │           │           Dimensions:    (member_id: 1, time: 6, lat: 64, lon: 128, bnds: 2)
│   │           │           Coordinates:
│   │           │             * lat        (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86
│   │           │               lat_bnds   (lat, bnds) float64 ...
│   │           │             * lon        (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2
│   │           │               lon_bnds   (lon, bnds) float64 ...
│   │           │             * time       (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:00:00
│   │           │               time_bnds  (time, bnds) object ...
│   │           │             * member_id  (member_id) object 'r1i1p1f1'
│   │           │           Dimensions without coordinates: bnds
│   │           │           Data variables:
│   │           │               gpp        (member_id, time, lat, lon) float32 ...
│   │           │               mrso       (member_id, time, lat, lon) float32 ...
│   │           │           Attributes: (12/53)
│   │           │               source:                      CanESM5 (2019): \naerosol: interactive\natmo...
│   │           │               source_type:                 AOGCM
│   │           │               institution:                 Canadian Centre for Climate Modelling and An...
│   │           │               YMDH_branch_time_in_child:   2015:01:01:00
│   │           │               initialization_index:        1
│   │           │               product:                     model-output
│   │           │               ...                          ...
│   │           │               parent_time_units:           days since 1850-01-01 0:0:0.0
│   │           │               parent_variant_label:        r1i1p1f1
│   │           │               realization_index:           1
│   │           │               parent_activity_id:          CMIP
│   │           │               branch_time_in_parent:       60225.0
│   │           │               intake_esm_dataset_key:      ScenarioMIP/CCCma/CanESM5/ssp370/Lmon/gn
│   │           └── DataTree('Omon')
│   │               └── DataTree('gn')
│   │                       Dimensions:             (i: 360, j: 291, lev: 1, bnds: 2, member_id: 1,
│   │                                                time: 6, vertices: 4)
│   │                       Coordinates:
│   │                         * i                   (i) int32 0 1 2 3 4 5 6 ... 353 354 355 356 357 358 359
│   │                         * j                   (j) int32 0 1 2 3 4 5 6 ... 284 285 286 287 288 289 290
│   │                           latitude            (j, i) float64 ...
│   │                         * lev                 (lev) float64 3.047
│   │                           lev_bnds            (lev, bnds) float64 ...
│   │                           longitude           (j, i) float64 ...
│   │                         * time                (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:0...
│   │                           time_bnds           (time, bnds) object ...
│   │                         * member_id           (member_id) object 'r1i1p1f1'
│   │                       Dimensions without coordinates: bnds, vertices
│   │                       Data variables:
│   │                           no3                 (member_id, time, lev, j, i) float32 ...
│   │                           vertices_latitude   (j, i, vertices) float64 ...
│   │                           vertices_longitude  (j, i, vertices) float64 ...
│   │                           thetao              (member_id, time, lev, j, i) float32 ...
│   │                       Attributes: (12/52)
│   │                           source:                      CanESM5 (2019): \naerosol: interactive\natmo...
│   │                           source_type:                 AOGCM
│   │                           institution:                 Canadian Centre for Climate Modelling and An...
│   │                           YMDH_branch_time_in_child:   2015:01:01:00
│   │                           initialization_index:        1
│   │                           product:                     model-output
│   │                           ...                          ...
│   │                           parent_time_units:           days since 1850-01-01 0:0:0.0
│   │                           parent_variant_label:        r1i1p1f1
│   │                           realization_index:           1
│   │                           parent_activity_id:          CMIP
│   │                           branch_time_in_parent:       60225.0
│   │                           intake_esm_dataset_key:      ScenarioMIP/CCCma/CanESM5/ssp370/Omon/gn
│   └── DataTree('NCAR')
│       └── DataTree('CESM2-WACCM')
│           └── DataTree('ssp370')
│               ├── DataTree('Lmon')
│               │   └── DataTree('gn')
│               │           Dimensions:    (lat: 192, lon: 288, time: 6, member_id: 1, hist_interval: 2)
│               │           Coordinates:
│               │             * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0
│               │             * lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8
│               │             * time       (time) object 2015-01-15 11:45:00 ... 2015-05-15 12:00:00
│               │             * member_id  (member_id) object 'r1i1p1f1'
│               │               lat_bnds   (lat, hist_interval) float32 ...
│               │               lon_bnds   (lon, hist_interval) float32 ...
│               │               time_bnds  (time, hist_interval) object ...
│               │           Dimensions without coordinates: hist_interval
│               │           Data variables:
│               │               gpp        (member_id, time, lat, lon) float32 ...
│               │               mrso       (member_id, time, lat, lon) float32 ...
│               │           Attributes: (12/45)
│               │               source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
│               │               source_type:             AOGCM BGC CHEM AER
│               │               institution:             National Center for Atmospheric Research, Climat...
│               │               initialization_index:    1
│               │               product:                 model-output
│               │               branch_method:           standard
│               │               ...                      ...
│               │               parent_time_units:       days since 0001-01-01 00:00:00
│               │               parent_variant_label:    r1i1p1f1
│               │               realization_index:       1
│               │               parent_activity_id:      CMIP
│               │               branch_time_in_parent:   735110.0
│               │               intake_esm_dataset_key:  ScenarioMIP/NCAR/CESM2-WACCM/ssp370/Lmon/gn
│               ├── DataTree('Amon')
│               │   └── DataTree('gn')
│               │           Dimensions:    (member_id: 1, time: 6, plev: 1, lat: 192, lon: 288, nbnd: 2)
│               │           Coordinates:
│               │             * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0
│               │               lat_bnds   (lat, nbnd) float64 ...
│               │             * lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8
│               │               lon_bnds   (lon, nbnd) float64 ...
│               │             * plev       (plev) float64 1e+05
│               │             * time       (time) object 2015-01-15 12:00:00 ... 2015-06-15 00:00:00
│               │               time_bnds  (time, nbnd) object ...
│               │             * member_id  (member_id) object 'r1i1p1f1'
│               │           Dimensions without coordinates: nbnd
│               │           Data variables:
│               │               co2        (member_id, time, plev, lat, lon) float32 ...
│               │               pr         (member_id, time, lat, lon) float32 ...
│               │           Attributes: (12/45)
│               │               source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
│               │               source_type:             AOGCM BGC CHEM AER
│               │               institution:             National Center for Atmospheric Research, Climat...
│               │               initialization_index:    1
│               │               product:                 model-output
│               │               branch_method:           standard
│               │               ...                      ...
│               │               parent_time_units:       days since 0001-01-01 00:00:00
│               │               parent_variant_label:    r1i1p1f1
│               │               realization_index:       1
│               │               parent_activity_id:      CMIP
│               │               branch_time_in_parent:   735110.0
│               │               intake_esm_dataset_key:  ScenarioMIP/NCAR/CESM2-WACCM/ssp370/Amon/gn
│               └── DataTree('Omon')
│                   └── DataTree('gn')
│                           Dimensions:    (nlat: 384, nlon: 320, vertices: 4, lev: 1, d2: 2, member_id: 1,
│                                           time: 6)
│                           Coordinates:
│                               lat        (nlat, nlon) float64 ...
│                               lat_bnds   (nlat, nlon, vertices) float32 ...
│                             * lev        (lev) float64 500.0lev_bnds   (lev, d2) float32 ...
│                               lon        (nlat, nlon) float64 ...
│                               lon_bnds   (nlat, nlon, vertices) float32 ...
│                             * nlat       (nlat) int32 1 2 3 4 5 6 7 8 ... 377 378 379 380 381 382 383 384* nlon       (nlon) int32 1 2 3 4 5 6 7 8 ... 313 314 315 316 317 318 319 320* time       (time) object 2015-01-15 13:00:00.000007 ... 2015-06-15 00:00:00time_bnds  (time, d2) object ...
│                             * member_id  (member_id) object 'r1i1p1f1'Dimensions without coordinates: vertices, d2Data variables:
│                               no3        (member_id, time, lev, nlat, nlon) float32 ...
│                               thetao     (member_id, time, lev, nlat, nlon) float32 ...
│                           Attributes: (12/44)
│                               source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
│                               source_type:             AOGCM BGC CHEM AERinstitution:             National Center for Atmospheric Research, Climat...
│                               initialization_index:    1product:                 model-outputbranch_method:           standard
│                               ...                      ...
│                               parent_time_units:       days since 0001-01-01 00:00:00parent_variant_label:    r1i1p1f1realization_index:       1parent_activity_id:      CMIPbranch_time_in_parent:   735110.0intake_esm_dataset_key:  ScenarioMIP/NCAR/CESM2-WACCM/ssp370/Omon/gn
└── DataTree('CMIP')
    ├── DataTree('MIROC')
    │   └── DataTree('MIROC6')
    │       ├── DataTree('piControl')
    │       │   ├── DataTree('Amon')
    │       │   │   └── DataTree('gn')
    │       │   │           Dimensions:    (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6)
    │       │   │           Coordinates:
    │       │   │             * lat        (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93
    │       │   │               lat_bnds   (lat, bnds) float64 ...
    │       │   │             * lon        (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6
    │       │   │               lon_bnds   (lon, bnds) float64 ...
    │       │   │             * time       (time) object 3200-01-16 12:00:00 ... 3200-06-16 00:00:00
    │       │   │               time_bnds  (time, bnds) object ...
    │       │   │             * member_id  (member_id) object 'r1i1p1f1'
    │       │   │           Dimensions without coordinates: bnds
    │       │   │           Data variables:
    │       │   │               pr         (member_id, time, lat, lon) float32 ...
    │       │   │           Attributes: (12/48)
    │       │   │               Conventions:             CF-1.7 CMIP-6.2
    │       │   │               activity_id:             CMIP
    │       │   │               branch_method:           standard
    │       │   │               branch_time_in_child:    0.0
    │       │   │               branch_time_in_parent:   365242.0
    │       │   │               cmor_version:            3.3.2
    │       │   │               ...                      ...
    │       │   │               variable_id:             pr
    │       │   │               variant_label:           r1i1p1f1
    │       │   │               status:                  2019-10-25;created;by nhn2@columbia.edu
    │       │   │               netcdf_tracking_ids:     hdl:21.14100/9a510d79-f7ac-4cb4-9b47-6b6ec73ceb4...
    │       │   │               version_id:              v20181212
    │       │   │               intake_esm_dataset_key:  CMIP/MIROC/MIROC6/piControl/Amon/gn
    │       │   ├── DataTree('Lmon')
    │       │   │   └── DataTree('gn')
    │       │   │           Dimensions:    (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6)
    │       │   │           Coordinates:
    │       │   │             * lat        (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93
    │       │   │               lat_bnds   (lat, bnds) float64 ...
    │       │   │             * lon        (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6
    │       │   │               lon_bnds   (lon, bnds) float64 ...
    │       │   │             * time       (time) object 3200-01-16 12:00:00 ... 3200-06-16 00:00:00
    │       │   │               time_bnds  (time, bnds) object ...
    │       │   │             * member_id  (member_id) object 'r1i1p1f1'
    │       │   │           Dimensions without coordinates: bnds
    │       │   │           Data variables:
    │       │   │               mrso       (member_id, time, lat, lon) float32 ...
    │       │   │           Attributes: (12/48)
    │       │   │               Conventions:             CF-1.7 CMIP-6.2
    │       │   │               activity_id:             CMIP
    │       │   │               branch_method:           standard
    │       │   │               branch_time_in_child:    0.0
    │       │   │               branch_time_in_parent:   365242.0
    │       │   │               cmor_version:            3.3.2
    │       │   │               ...                      ...
    │       │   │               variable_id:             mrso
    │       │   │               variant_label:           r1i1p1f1
    │       │   │               status:                  2019-10-26;created;by nhn2@columbia.edu
    │       │   │               netcdf_tracking_ids:     hdl:21.14100/ce661632-766a-45e6-9f98-8caa7391d24...
    │       │   │               version_id:              v20190311
    │       │   │               intake_esm_dataset_key:  CMIP/MIROC/MIROC6/piControl/Lmon/gn
    │       │   └── DataTree('Omon')
    │       │       └── DataTree('gn')
    │       │               Dimensions:             (y: 256, x: 360, time: 6, lev: 1, bnds: 2,
    │       │                                        member_id: 1, vertices: 4)
    │       │               Coordinates: (12/13)
    │       │                   latitude            (y, x) float32 ...
    │       │                 * lev                 (lev) float64 1.0
    │       │                   lev_bnds            (lev, bnds) float64 ...
    │       │                   longitude           (y, x) float32 ...
    │       │                   sigma_bnds          (lev, bnds) float64 ...
    │       │                 * time                (time) object 3200-01-16 12:00:00 ... 3200-06-16 00:0...
    │       │                   ...                  ...
    │       │                 * x                   (x) float64 0.5 1.5 2.5 3.5 ... 356.5 357.5 358.5 359.5
    │       │                   x_bnds              (x, bnds) float64 ...
    │       │                 * y                   (y) float64 -88.0 -85.75 -85.25 ... 148.6 150.5 152.4
    │       │                   y_bnds              (y, bnds) float64 ...
    │       │                   zlev_bnds           (lev, bnds) float64 ...
    │       │                 * member_id           (member_id) object 'r1i1p1f1'
    │       │               Dimensions without coordinates: bnds, vertices
    │       │               Data variables:
    │       │                   depth               (y, x) float32 ...
    │       │                   depth_c             float64 ...
    │       │                   eta                 (time, y, x) float32 ...
    │       │                   nsigma              int32 ...
    │       │                   sigma               (lev) float64 ...
    │       │                   thetao              (member_id, time, lev, y, x) float32 ...
    │       │                   vertices_latitude   (y, x, vertices) float32 ...
    │       │                   vertices_longitude  (y, x, vertices) float32 ...
    │       │                   zlev                (lev) float64 ...
    │       │               Attributes: (12/48)
    │       │                   Conventions:             CF-1.7 CMIP-6.2
    │       │                   activity_id:             CMIP
    │       │                   branch_method:           standard
    │       │                   branch_time_in_child:    0.0
    │       │                   branch_time_in_parent:   365242.0
    │       │                   cmor_version:            3.3.2
    │       │                   ...                      ...
    │       │                   variable_id:             thetao
    │       │                   variant_label:           r1i1p1f1
    │       │                   status:                  2019-10-25;created;by nhn2@columbia.edu
    │       │                   netcdf_tracking_ids:     hdl:21.14100/3d093cd5-61c4-4cb6-824e-d776d71e535...
    │       │                   version_id:              v20190311
    │       │                   intake_esm_dataset_key:  CMIP/MIROC/MIROC6/piControl/Omon/gn
    │       └── DataTree('historical')
    │           ├── DataTree('Amon')
    │           │   └── DataTree('gn')
    │           │           Dimensions:    (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6)
    │           │           Coordinates:
    │           │             * lat        (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93
    │           │               lat_bnds   (lat, bnds) float64 ...
    │           │             * lon        (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6
    │           │               lon_bnds   (lon, bnds) float64 ...
    │           │             * time       (time) datetime64[ns] 1850-01-16T12:00:00 ... 1850-06-16
    │           │               time_bnds  (time, bnds) datetime64[ns] ...
    │           │             * member_id  (member_id) object 'r1i1p1f1'
    │           │           Dimensions without coordinates: bnds
    │           │           Data variables:
    │           │               pr         (member_id, time, lat, lon) float32 ...
    │           │           Attributes: (12/48)
    │           │               Conventions:             CF-1.7 CMIP-6.2
    │           │               activity_id:             CMIP
    │           │               branch_method:           standard
    │           │               branch_time_in_child:    0.0
    │           │               branch_time_in_parent:   0.0
    │           │               cmor_version:            3.3.2
    │           │               ...                      ...
    │           │               variable_id:             pr
    │           │               variant_label:           r1i1p1f1
    │           │               status:                  2019-10-25;created;by nhn2@columbia.edu
    │           │               netcdf_tracking_ids:     hdl:21.14100/61fa8b6b-e74c-4e86-9344-8ba946ee8a8...
    │           │               version_id:              v20181212
    │           │               intake_esm_dataset_key:  CMIP/MIROC/MIROC6/historical/Amon/gn
    │           ├── DataTree('Lmon')
    │           │   └── DataTree('gn')
    │           │           Dimensions:    (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6)
    │           │           Coordinates:
    │           │             * lat        (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93
    │           │               lat_bnds   (lat, bnds) float64 ...
    │           │             * lon        (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6
    │           │               lon_bnds   (lon, bnds) float64 ...
    │           │             * time       (time) datetime64[ns] 1850-01-16T12:00:00 ... 1850-06-16
    │           │               time_bnds  (time, bnds) datetime64[ns] ...
    │           │             * member_id  (member_id) object 'r1i1p1f1'
    │           │           Dimensions without coordinates: bnds
    │           │           Data variables:
    │           │               mrso       (member_id, time, lat, lon) float32 ...
    │           │           Attributes: (12/48)
    │           │               Conventions:             CF-1.7 CMIP-6.2
    │           │               activity_id:             CMIP
    │           │               branch_method:           standard
    │           │               branch_time_in_child:    0.0
    │           │               branch_time_in_parent:   0.0
    │           │               cmor_version:            3.3.2
    │           │               ...                      ...
    │           │               variable_id:             mrso
    │           │               variant_label:           r1i1p1f1
    │           │               status:                  2019-10-25;created;by nhn2@columbia.edu
    │           │               netcdf_tracking_ids:     hdl:21.14100/a702781b-b6d9-4f90-a65d-c649d59a224...
    │           │               version_id:              v20190311
    │           │               intake_esm_dataset_key:  CMIP/MIROC/MIROC6/historical/Lmon/gn
    │           └── DataTree('Omon')
    │               └── DataTree('gn')
    │                       Dimensions:             (y: 256, x: 360, time: 6, lev: 1, bnds: 2,
    │                                                member_id: 1, vertices: 4)
    │                       Coordinates: (12/13)
    │                           latitude            (y, x) float32 ...
    │                         * lev                 (lev) float64 1.0lev_bnds            (lev, bnds) float64 ...
    │                           longitude           (y, x) float32 ...
    │                           sigma_bnds          (lev, bnds) float64 ...
    │                         * time                (time) datetime64[ns] 1850-01-16T12:00:00 ... 1850-06-16
    │                           ...                  ...
    │                         * x                   (x) float64 0.5 1.5 2.5 3.5 ... 356.5 357.5 358.5 359.5x_bnds              (x, bnds) float64 ...
    │                         * y                   (y) float64 -88.0 -85.75 -85.25 ... 148.6 150.5 152.4y_bnds              (y, bnds) float64 ...
    │                           zlev_bnds           (lev, bnds) float64 ...
    │                         * member_id           (member_id) object 'r1i1p1f1'Dimensions without coordinates: bnds, verticesData variables:
    │                           depth               (y, x) float32 ...
    │                           depth_c             float64 ...
    │                           eta                 (time, y, x) float32 ...
    │                           nsigma              int32 ...
    │                           sigma               (lev) float64 ...
    │                           thetao              (member_id, time, lev, y, x) float32 ...
    │                           vertices_latitude   (y, x, vertices) float32 ...
    │                           vertices_longitude  (y, x, vertices) float32 ...
    │                           zlev                (lev) float64 ...
    │                       Attributes: (12/48)
    │                           Conventions:             CF-1.7 CMIP-6.2activity_id:             CMIPbranch_method:           standardbranch_time_in_child:    0.0branch_time_in_parent:   0.0cmor_version:            3.3.2
    │                           ...                      ...
    │                           variable_id:             thetaovariant_label:           r1i1p1f1status:                  2019-11-08;created;by nhn2@columbia.edunetcdf_tracking_ids:     hdl:21.14100/16598b35-19b4-49e3-98de-27b9e9444ad...
    │                           version_id:              v20190311intake_esm_dataset_key:  CMIP/MIROC/MIROC6/historical/Omon/gn
    ├── DataTree('CCCma')
    │   └── DataTree('CanESM5')
    │       ├── DataTree('piControl')
    │       │   ├── DataTree('Amon')
    │       │   │   └── DataTree('gn')
    │       │   │           Dimensions:    (lat: 64, bnds: 2, lon: 128, member_id: 1, time: 6)
    │       │   │           Coordinates:
    │       │   │             * lat        (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86
    │       │   │               lat_bnds   (lat, bnds) float64 ...
    │       │   │             * lon        (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2
    │       │   │               lon_bnds   (lon, bnds) float64 ...
    │       │   │             * time       (time) object 5201-01-16 12:00:00 ... 5201-06-16 00:00:00
    │       │   │               time_bnds  (time, bnds) object ...
    │       │   │             * member_id  (member_id) object 'r1i1p1f1'
    │       │   │           Dimensions without coordinates: bnds
    │       │   │           Data variables:
    │       │   │               pr         (member_id, time, lat, lon) float32 ...
    │       │   │           Attributes: (12/57)
    │       │   │               CCCma_model_hash:            24718c8346665b218729640ffe79d263b76456c4
    │       │   │               CCCma_parent_runid:          rc3-pictrl
    │       │   │               CCCma_pycmor_hash:           33c30511acc319a98240633965a04ca99c26427e
    │       │   │               CCCma_runid:                 rc3.1-pictrl
    │       │   │               Conventions:                 CF-1.7 CMIP-6.2
    │       │   │               YMDH_branch_time_in_child:   5201:01:01:00
    │       │   │               ...                          ...
    │       │   │               variant_label:               r1i1p1f1
    │       │   │               version:                     v20190429
    │       │   │               status:                      2019-10-25;created;by nhn2@columbia.edu
    │       │   │               netcdf_tracking_ids:         hdl:21.14100/e32e34a8-13cf-45b5-ba68-3d6b584...
    │       │   │               version_id:                  v20190429
    │       │   │               intake_esm_dataset_key:      CMIP/CCCma/CanESM5/piControl/Amon/gn
    │       │   ├── DataTree('Lmon')
    │       │   │   └── DataTree('gn')
    │       │   │           Dimensions:    (member_id: 1, time: 6, lat: 64, lon: 128, bnds: 2)
    │       │   │           Coordinates:
    │       │   │             * lat        (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86
    │       │   │               lat_bnds   (lat, bnds) float64 ...
    │       │   │             * lon        (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2
    │       │   │               lon_bnds   (lon, bnds) float64 ...
    │       │   │             * time       (time) object 5201-01-16 12:00:00 ... 5201-06-16 00:00:00
    │       │   │               time_bnds  (time, bnds) object ...
    │       │   │             * member_id  (member_id) object 'r1i1p1f1'
    │       │   │           Dimensions without coordinates: bnds
    │       │   │           Data variables:
    │       │   │               gpp        (member_id, time, lat, lon) float32 ...
    │       │   │               mrso       (member_id, time, lat, lon) float32 ...
    │       │   │           Attributes: (12/47)
    │       │   │               source_type:                 AOGCM
    │       │   │               YMDH_branch_time_in_child:   5201:01:01:00
    │       │   │               initialization_index:        1
    │       │   │               product:                     model-output
    │       │   │               branch_method:               Spin-up documentation
    │       │   │               mip_era:                     CMIP6
    │       │   │               ...                          ...
    │       │   │               parent_time_units:           days since 1850-01-01 0:0:0.0
    │       │   │               parent_variant_label:        r1i1p1f1
    │       │   │               realization_index:           1
    │       │   │               parent_activity_id:          CMIP
    │       │   │               branch_time_in_parent:       1223115.0
    │       │   │               intake_esm_dataset_key:      CMIP/CCCma/CanESM5/piControl/Lmon/gn
    │       │   └── DataTree('Omon')
    │       │       └── DataTree('gn')
    │       │               Dimensions:             (i: 360, j: 291, lev: 1, bnds: 2, member_id: 1,
    │       │                                        time: 6, vertices: 4)
    │       │               Coordinates:
    │       │                 * i                   (i) int32 0 1 2 3 4 5 6 ... 353 354 355 356 357 358 359
    │       │                 * j                   (j) int32 0 1 2 3 4 5 6 ... 284 285 286 287 288 289 290
    │       │                   latitude            (j, i) float64 ...
    │       │                 * lev                 (lev) float64 3.047
    │       │                   lev_bnds            (lev, bnds) float64 ...
    │       │                   longitude           (j, i) float64 ...
    │       │                 * time                (time) object 5201-01-16 12:00:00 ... 5201-06-16 00:0...
    │       │                   time_bnds           (time, bnds) object ...
    │       │                 * member_id           (member_id) object 'r1i1p1f1'
    │       │               Dimensions without coordinates: bnds, vertices
    │       │               Data variables:
    │       │                   no3                 (member_id, time, lev, j, i) float32 ...
    │       │                   vertices_latitude   (j, i, vertices) float64 ...
    │       │                   vertices_longitude  (j, i, vertices) float64 ...
    │       │                   thetao              (member_id, time, lev, j, i) float32 ...
    │       │               Attributes: (12/52)
    │       │                   source:                      CanESM5 (2019): \naerosol: interactive\natmo...
    │       │                   source_type:                 AOGCM
    │       │                   institution:                 Canadian Centre for Climate Modelling and An...
    │       │                   YMDH_branch_time_in_child:   5201:01:01:00
    │       │                   initialization_index:        1
    │       │                   product:                     model-output
    │       │                   ...                          ...
    │       │                   parent_time_units:           days since 1850-01-01 0:0:0.0
    │       │                   parent_variant_label:        r1i1p1f1
    │       │                   realization_index:           1
    │       │                   parent_activity_id:          CMIP
    │       │                   branch_time_in_parent:       1223115.0
    │       │                   intake_esm_dataset_key:      CMIP/CCCma/CanESM5/piControl/Omon/gn
    │       └── DataTree('historical')
    │           ├── DataTree('Amon')
    │           │   └── DataTree('gn')
    │           │           Dimensions:    (lat: 64, bnds: 2, lon: 128, member_id: 1, time: 6)
    │           │           Coordinates:
    │           │             * lat        (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86
    │           │               lat_bnds   (lat, bnds) float64 ...
    │           │             * lon        (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2
    │           │               lon_bnds   (lon, bnds) float64 ...
    │           │             * time       (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:00:00
    │           │               time_bnds  (time, bnds) object ...
    │           │             * member_id  (member_id) object 'r1i1p1f1'
    │           │           Dimensions without coordinates: bnds
    │           │           Data variables:
    │           │               pr         (member_id, time, lat, lon) float32 ...
    │           │           Attributes: (12/57)
    │           │               CCCma_model_hash:            3dedf95315d603326fde4f5340dc0519d80d10c0
    │           │               CCCma_parent_runid:          rc3-pictrl
    │           │               CCCma_pycmor_hash:           33c30511acc319a98240633965a04ca99c26427e
    │           │               CCCma_runid:                 rc3.1-his01
    │           │               Conventions:                 CF-1.7 CMIP-6.2
    │           │               YMDH_branch_time_in_child:   1850:01:01:00
    │           │               ...                          ...
    │           │               variant_label:               r1i1p1f1
    │           │               version:                     v20190429
    │           │               status:                      2019-10-25;created;by nhn2@columbia.edu
    │           │               netcdf_tracking_ids:         hdl:21.14100/363e1ebe-46e7-43dc-9feb-a7a4a0c...
    │           │               version_id:                  v20190429
    │           │               intake_esm_dataset_key:      CMIP/CCCma/CanESM5/historical/Amon/gn
    │           ├── DataTree('Lmon')
    │           │   └── DataTree('gn')
    │           │           Dimensions:    (member_id: 1, time: 6, lat: 64, lon: 128, bnds: 2)
    │           │           Coordinates:
    │           │             * lat        (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86
    │           │               lat_bnds   (lat, bnds) float64 ...
    │           │             * lon        (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2
    │           │               lon_bnds   (lon, bnds) float64 ...
    │           │             * time       (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:00:00
    │           │               time_bnds  (time, bnds) object ...
    │           │             * member_id  (member_id) object 'r1i1p1f1'
    │           │           Dimensions without coordinates: bnds
    │           │           Data variables:
    │           │               gpp        (member_id, time, lat, lon) float32 ...
    │           │               mrso       (member_id, time, lat, lon) float32 ...
    │           │           Attributes: (12/53)
    │           │               source:                      CanESM5 (2019): \naerosol: interactive\natmo...
    │           │               source_type:                 AOGCM
    │           │               institution:                 Canadian Centre for Climate Modelling and An...
    │           │               YMDH_branch_time_in_child:   1850:01:01:00
    │           │               initialization_index:        1
    │           │               product:                     model-output
    │           │               ...                          ...
    │           │               parent_time_units:           days since 1850-01-01 0:0:0.0
    │           │               parent_variant_label:        r1i1p1f1
    │           │               realization_index:           1
    │           │               parent_activity_id:          CMIP
    │           │               branch_time_in_parent:       1223115.0
    │           │               intake_esm_dataset_key:      CMIP/CCCma/CanESM5/historical/Lmon/gn
    │           └── DataTree('Omon')
    │               └── DataTree('gn')
    │                       Dimensions:             (i: 360, j: 291, lev: 1, bnds: 2, member_id: 1,
    │                                                time: 6, vertices: 4)
    │                       Coordinates:
    │                         * i                   (i) int32 0 1 2 3 4 5 6 ... 353 354 355 356 357 358 359* j                   (j) int32 0 1 2 3 4 5 6 ... 284 285 286 287 288 289 290latitude            (j, i) float64 ...
    │                         * lev                 (lev) float64 3.047lev_bnds            (lev, bnds) float64 ...
    │                           longitude           (j, i) float64 ...
    │                         * time                (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:0...
    │                           time_bnds           (time, bnds) object ...
    │                         * member_id           (member_id) object 'r1i1p1f1'Dimensions without coordinates: bnds, verticesData variables:
    │                           no3                 (member_id, time, lev, j, i) float32 ...
    │                           vertices_latitude   (j, i, vertices) float64 ...
    │                           vertices_longitude  (j, i, vertices) float64 ...
    │                           thetao              (member_id, time, lev, j, i) float32 ...
    │                       Attributes: (12/52)
    │                           source:                      CanESM5 (2019): \naerosol: interactive\natmo...
    │                           source_type:                 AOGCMinstitution:                 Canadian Centre for Climate Modelling and An...
    │                           YMDH_branch_time_in_child:   1850:01:01:00initialization_index:        1product:                     model-output
    │                           ...                          ...
    │                           parent_time_units:           days since 1850-01-01 0:0:0.0parent_variant_label:        r1i1p1f1realization_index:           1parent_activity_id:          CMIPbranch_time_in_parent:       1223115.0intake_esm_dataset_key:      CMIP/CCCma/CanESM5/historical/Omon/gn
    └── DataTree('NCAR')
        └── DataTree('CESM2-WACCM')
            ├── DataTree('historical')
            │   ├── DataTree('Amon')
            │   │   └── DataTree('gn')
            │   │           Dimensions:    (member_id: 1, time: 6, plev: 1, lat: 192, lon: 288, nbnd: 2)
            │   │           Coordinates:
            │   │             * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0
            │   │               lat_bnds   (lat, nbnd) float64 ...
            │   │             * lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8
            │   │               lon_bnds   (lon, nbnd) float64 ...
            │   │             * plev       (plev) float64 1e+05
            │   │             * time       (time) object 1850-01-15 12:00:00 ... 1850-06-15 00:00:00
            │   │               time_bnds  (time, nbnd) object ...
            │   │             * member_id  (member_id) object 'r1i1p1f1'
            │   │           Dimensions without coordinates: nbnd
            │   │           Data variables:
            │   │               co2        (member_id, time, plev, lat, lon) float32 ...
            │   │               pr         (member_id, time, lat, lon) float32 ...
            │   │           Attributes: (12/46)
            │   │               source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
            │   │               source_type:             AOGCM BGC CHEM AER
            │   │               institution:             National Center for Atmospheric Research, Climat...
            │   │               initialization_index:    1
            │   │               product:                 model-output
            │   │               branch_method:           standard
            │   │               ...                      ...
            │   │               parent_time_units:       days since 0001-01-01 00:00:00
            │   │               parent_variant_label:    r1i1p1f1
            │   │               realization_index:       1
            │   │               parent_activity_id:      CMIP
            │   │               branch_time_in_parent:   20075.0
            │   │               intake_esm_dataset_key:  CMIP/NCAR/CESM2-WACCM/historical/Amon/gn
            │   ├── DataTree('Lmon')
            │   │   └── DataTree('gn')
            │   │           Dimensions:    (member_id: 1, time: 6, lat: 192, lon: 288, hist_interval: 2)
            │   │           Coordinates:
            │   │             * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0
            │   │               lat_bnds   (lat, hist_interval) float32 ...
            │   │             * lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8
            │   │               lon_bnds   (lon, hist_interval) float32 ...
            │   │             * time       (time) object 1850-01-15 11:45:00.000013 ... 1850-06-15 00:00:00
            │   │               time_bnds  (time, hist_interval) object ...
            │   │             * member_id  (member_id) object 'r1i1p1f1'
            │   │           Dimensions without coordinates: hist_interval
            │   │           Data variables:
            │   │               gpp        (member_id, time, lat, lon) float32 ...
            │   │               mrso       (member_id, time, lat, lon) float32 ...
            │   │           Attributes: (12/46)
            │   │               source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
            │   │               source_type:             AOGCM BGC CHEM AER
            │   │               institution:             National Center for Atmospheric Research, Climat...
            │   │               initialization_index:    1
            │   │               product:                 model-output
            │   │               branch_method:           standard
            │   │               ...                      ...
            │   │               parent_time_units:       days since 0001-01-01 00:00:00
            │   │               parent_variant_label:    r1i1p1f1
            │   │               realization_index:       1
            │   │               parent_activity_id:      CMIP
            │   │               branch_time_in_parent:   20075.0
            │   │               intake_esm_dataset_key:  CMIP/NCAR/CESM2-WACCM/historical/Lmon/gn
            │   └── DataTree('Omon')
            │       └── DataTree('gn')
            │               Dimensions:    (nlat: 384, nlon: 320, vertices: 4, lev: 1, d2: 2, member_id: 1,
            │                               time: 6)
            │               Coordinates:
            │                   lat        (nlat, nlon) float64 ...
            │                   lat_bnds   (nlat, nlon, vertices) float32 ...
            │                 * lev        (lev) float64 500.0lev_bnds   (lev, d2) float32 ...
            │                   lon        (nlat, nlon) float64 ...
            │                   lon_bnds   (nlat, nlon, vertices) float32 ...
            │                 * nlat       (nlat) int32 1 2 3 4 5 6 7 8 ... 377 378 379 380 381 382 383 384* nlon       (nlon) int32 1 2 3 4 5 6 7 8 ... 313 314 315 316 317 318 319 320* time       (time) object 1850-01-15 13:00:00 ... 1850-06-15 00:00:00time_bnds  (time, d2) object ...
            │                 * member_id  (member_id) object 'r1i1p1f1'Dimensions without coordinates: vertices, d2Data variables:
            │                   no3        (member_id, time, lev, nlat, nlon) float32 ...
            │                   thetao     (member_id, time, lev, nlat, nlon) float32 ...
            │               Attributes: (12/45)
            │                   source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
            │                   source_type:             AOGCM BGC CHEM AERinstitution:             National Center for Atmospheric Research, Climat...
            │                   initialization_index:    1product:                 model-outputbranch_method:           standard
            │                   ...                      ...
            │                   parent_time_units:       days since 0001-01-01 00:00:00parent_variant_label:    r1i1p1f1realization_index:       1parent_activity_id:      CMIPbranch_time_in_parent:   20075.0intake_esm_dataset_key:  CMIP/NCAR/CESM2-WACCM/historical/Omon/gn
            └── DataTree('piControl')
                ├── DataTree('Lmon')
                │   └── DataTree('gn')
                │           Dimensions:    (lat: 192, lon: 288, time: 6, member_id: 1, hist_interval: 2)
                │           Coordinates:
                │             * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0* lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8* time       (time) object 0001-01-15 11:44:59.999999 ... 0001-03-15 12:00:...
                │             * member_id  (member_id) object 'r1i1p1f1'lat_bnds   (lat, hist_interval) float32 ...
                │               lon_bnds   (lon, hist_interval) float32 ...
                │               time_bnds  (time, hist_interval) object ...
                │           Dimensions without coordinates: hist_intervalData variables:
                │               gpp        (member_id, time, lat, lon) float32 ...
                │               mrso       (member_id, time, lat, lon) float32 ...
                │           Attributes: (12/45)
                │               source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
                │               source_type:             AOGCM BGC CHEM AERinstitution:             National Center for Atmospheric Research, Climat...
                │               initialization_index:    1product:                 model-outputbranch_method:           standard
                │               ...                      ...
                │               parent_time_units:       days since 0001-01-01 00:00:00parent_variant_label:    r1i1p1f1realization_index:       1parent_activity_id:      CMIPbranch_time_in_parent:   48545.0intake_esm_dataset_key:  CMIP/NCAR/CESM2-WACCM/piControl/Lmon/gn
                ├── DataTree('Amon')
                │   └── DataTree('gn')
                │           Dimensions:    (member_id: 1, time: 6, plev: 1, lat: 192, lon: 288, nbnd: 2)
                │           Coordinates:
                │             * lat        (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0lat_bnds   (lat, nbnd) float64 ...
                │             * lon        (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8lon_bnds   (lon, nbnd) float64 ...
                │             * plev       (plev) float64 1e+05* time       (time) object 0001-01-15 12:00:00 ... 0001-06-15 00:00:00time_bnds  (time, nbnd) object ...
                │             * member_id  (member_id) object 'r1i1p1f1'Dimensions without coordinates: nbndData variables:
                │               co2        (member_id, time, plev, lat, lon) float32 ...
                │               pr         (member_id, time, lat, lon) float32 ...
                │           Attributes: (12/47)
                │               source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
                │               source_type:             AOGCM BGC CHEM AERinstitution:             National Center for Atmospheric Research, Climat...
                │               initialization_index:    1product:                 model-outputbranch_method:           standard
                │               ...                      ...
                │               parent_time_units:       days since 0001-01-01 00:00:00parent_variant_label:    r1i1p1f1realization_index:       1parent_activity_id:      CMIPbranch_time_in_parent:   48545.0intake_esm_dataset_key:  CMIP/NCAR/CESM2-WACCM/piControl/Amon/gn
                └── DataTree('Omon')
                    └── DataTree('gn')
                            Dimensions:    (nlat: 384, nlon: 320, vertices: 4, lev: 1, d2: 2, member_id: 1,
                                            time: 6)
                            Coordinates:
                                lat        (nlat, nlon) float64 ...
                                lat_bnds   (nlat, nlon, vertices) float32 ...
                              * lev        (lev) float64 500.0
                                lev_bnds   (lev, d2) float32 ...
                                lon        (nlat, nlon) float64 ...
                                lon_bnds   (nlat, nlon, vertices) float32 ...
                              * nlat       (nlat) int32 1 2 3 4 5 6 7 8 ... 377 378 379 380 381 382 383 384
                              * nlon       (nlon) int32 1 2 3 4 5 6 7 8 ... 313 314 315 316 317 318 319 320
                              * time       (time) object 0001-01-15 13:00:00.999998 ... 0001-06-15 00:00:...
                                time_bnds  (time, d2) object ...
                              * member_id  (member_id) object 'r1i1p1f1'
                            Dimensions without coordinates: vertices, d2
                            Data variables:
                                no3        (member_id, time, lev, nlat, nlon) float32 ...
                                thetao     (member_id, time, lev, nlat, nlon) float32 ...
                            Attributes: (12/44)
                                source:                  CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ...
                                source_type:             AOGCM BGC CHEM AER
                                institution:             National Center for Atmospheric Research, Climat...
                                initialization_index:    1
                                product:                 model-output
                                branch_method:           standard
                                ...                      ...
                                parent_time_units:       days since 0001-01-01 00:00:00
                                parent_variant_label:    r1i1p1f1
                                realization_index:       1
                                parent_activity_id:      CMIP
                                branch_time_in_parent:   48545.0
                                intake_esm_dataset_key:  CMIP/NCAR/CESM2-WACCM/piControl/Omon/gn
❯ ls -ltrh datasets
total 141M
-rw-rw-r-- 1 andersy005 andersy005  22M Aug  3 12:08 cesm2-lens-sample.nc
-rw-rw-r-- 1 andersy005 andersy005 119M Aug  3 12:59 cmip6-sample.nc

Do you prefer to have these hosted in a cloud storage bucket? We (at CarbonPlan) might be able to host these (Cc @jhamman)

@andersy005
Copy link
Member

@andersy005 @jhamman just a ping to say that if either of you can help me get some data like this I could really push forward with writing cool documentation for datatree!

Ccing @mgrover1 / @scottyhq who may have some ideas about additional radar / satellite datasets we could use

@mgrover1
Copy link
Contributor

mgrover1 commented Aug 3, 2022

Here is a demo notebook using datatree with radar data - plenty of other examples in that repo too.

https://github.com/mgrover1/pyart-xarray-sandbox/blob/main/notebooks/test-hierachal-multivolume.ipynb

Another example dataset might be a grib file with fields on different levels (isobaric pressure, surface, height above ground)

@mgrover1
Copy link
Contributor

mgrover1 commented Aug 3, 2022

I would be willing to help write some of the docs for the precious two use cases!

@eni-awowale
Copy link

Closed and moved to pydata/xarray#9437

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
documentation Improvements or additions to documentation help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

5 participants