Skip to content

Commit

Permalink
Merge pull request #49 from fjaviersanchez/update_data_md
Browse files Browse the repository at this point in the history
Updated simulated data product descriptions
  • Loading branch information
tddesjardins authored Sep 24, 2024
2 parents d390758 + 64366cd commit 332b6b2
Showing 1 changed file with 71 additions and 12 deletions.
83 changes: 71 additions & 12 deletions markdown/simulated-data.md
Original file line number Diff line number Diff line change
@@ -1,22 +1,81 @@
# Simulated Data Products on the RSP

The S3 bucket (where reference files are stored on the RSP) contains two types of files: simulated Roman WFI images and the corresponding input/configuration files used to generate them. Simulated WFI images are divided into two categories: those created with Roman I-Sim, which include detector-level calibration features, and those created with the Space Telescope Image Product Simulator (STIPS), which simulate astronomical scenes. For the Roman I-Sim products, the provided configuration files include input catalogs and shell scripts used in their creation. However, for STIPS products, only the input catalog is provided.

## Roman I-Sim Simulated Images
The Roman I-Sim products are stored in Advanced Scientific Data Format (ASDF) files and adhere to the development version of the WFI science data product schema. This means that Roman I-Sim ASDF files organize data and metadata identically to how WFI data will be organized during science operations. Input catalogs are saved as Enhanced Character-Separated Value (ECSV) files and were generated using the parametric source generation offered within the tool. It is important to note that this source generation process does not utilize spectral energy distribution (SED) information to produce sources with realistic colors.

Roman I-Sim products are created using reference files from the Calibration Reference Data System (CRDS). Therefore, the instrumental effects in the simulations are only as accurate as the reference files used to create them. Many of the reference files in CRDS are currently estimated based on mission requirements or models of the observatory/WFI and do not yet reflect recent ground testing campaigns. Additionally, some reference files (e.g., photometric calibration) are obtained for a single, idealized detector.
There are currently three types of simulated WFI images available for use with the Roman Science Platform (RSP). These include data products from:

The Roman I-Sim simulated products include:
- A simulation of a dense region (~12.5 deg2) composed of 108 stars and 106 galaxies
- Two full-focal-plane exposures (18 detectors each) centered at (RA, Dec) = (0.50, 0.50) deg in the F129 (J) and F158 (H) optical elements.
- Two full-focal-plane exposures (18 detectors each) centered at (RA, Dec) = (0.47, 0.51) deg in the F129 (J) and F158 (H) optical elements.
- A simulation of a 'high-latitude' region (~0.5 deg2) composed of 3 x 104 stars and 104 galaxies.
- One detector (WFI01) in F106 (Y) with the WFI focal plane centered at (RA, Dec) = (0.0, 0.0) deg


* [Roman I-Sim](#romanisim), which generates images that include simulated detector-level calibration features;

* The [Space Telescope Image Product Simulator](#stips) (STIPS), which simulates astronomical observing scenes;

* [The Open Universe Roman](#openuniverse) simulated preview data, converted to the Advanced Scientific Data Format (ADSF). This includes single exposures and, when available, truth catalogs for mock Roman Wide Area Survey data (Roman WAS) and the Time Domain Survey (Roman TDS).



An overview of the simulation tools and data products can be found below. At present, the majority of these files are stored in a private AWS S3 bucket accessible only within the RSP. More details on accessing these products can be found **HERE**.



<!--The S3 bucket (where reference files are stored on the RSP) contains two types of files: simulated Roman WFI images and the corresponding input/configuration files used to generate them. Simulated WFI images are divided into two categories: those created with Roman I-Sim, which include detector-level calibration features, and those created with the Space Telescope Image Product Simulator (STIPS), which simulate astronomical scenes. For the Roman I-Sim products, the provided configuration files include input catalogs and shell scripts used in their creation. However, for STIPS products, only the input catalog is provided.-->



## <a name="romanisim"></a>Roman I-Sim Simulated Images

[Roman I-Sim](https://github.com/spacetelescope/romanisim) generates Roman WFI imaging products by combining an input source catalog with simulated instrument and detector effects. A detailed description of the features included in the simulations can be found in the Roman-I-Sim [package documentation](https://romanisim.readthedocs.io/en/latest/romanisim/overview.html).

The Roman I-Sim products have metadata and an organizational structure identical to what is expected for actual Roman data taken during operations. For example, the products are stored in Advanced Scientific Data Format (ASDF) files and adhere to the development version of the [WFI science data product schema](https://roman-docs.stsci.edu/data-handbook-home/wfi-data-format/data-levels-and-products) installed on the RSP. In the example [Science Workflows and Tutorials](LINK), the input catalogs are provided as Enhanced Character-Separated Value (ECSV) files and are generated using the parametric source generation offered within the tool. It is important to note that this source generation process does not utilize spectral energy distribution (SED) information to produce sources with realistic colors.

Roman I-Sim products are created using reference files from the [Calibration Reference Data System](https://roman-docs.stsci.edu/data-handbook-home/accessing-wfi-data/crds-for-reference-files) (CRDS). Therefore, the accuracy of the instrumental effects in the simulations _is only as good as the reference files used to create them_. Currently, many of the reference files in CRDS do not reflect knowledge gained from laboratory ground test campaigns, since the analysis of these data is ongoing. In the absence of measured ground test results, sample reference files are created based on a combination of mission requirements, observatory/WFI models, and simulations. Additionally, some reference files (e.g., for photometric calibration) are based on a single, idealized detector. Questions regarding calibration reference files should be sent to the [Roman Helpdesk at STScI](https://stsci.service-now.com/roman).

The Roman-I-Sim simulated products include:

- A simulation of a dense region (~12.5 sq deg) composed of 10<sup>8</sup> stars and 10<sup>6</sup> galaxies

- Two full-focal-plane exposures (18 detectors each) centered at (RA, Dec) = (0.50, 0.50) deg in the F129 (J) and F158 (H) optical elements.

- Two full-focal-plane exposures (18 detectors each) centered at (RA, Dec) = (0.47, 0.51) deg in the F129 (J) and F158 (H) optical elements.

- A simulation of a 'high-latitude' region (~0.5 sq deg) composed of 3 x 10<sup>4</sup> stars and 10<sup>4</sup> galaxies.

- One detector (WFI01) in F106 (Y) with the WFI focal plane centered at (RA, Dec) = (0.0, 0.0) deg.

## <a name="stips"></a>STIPS Astronomical Scenes

[STIPS](https://stips.readthedocs.io/en/latest/) is an image simulator designed to quickly generate post-pipeline astronomical scenes for any number of detectors, encompassing the entire WFI FOV.  STIPS is equipped to incorporate instrumental distortion (if available), along with calibration residuals originating from flatfields, dark currents, and cosmic rays. Furthermore, it includes an estimate of Poisson and readout noise in the simulations, though it doesn't cover instrument saturation or non-linearity effects.

STIPS is particularly useful when Pandeia, the Exposure Time Calculator for the WFI, does not provide a sufficiently large simulation area, i.e., more than 25 x 25 arcseconds. Therefore,  STIPS is necessary for full-detector or multiple-detector image simulations. STIPS obtains instrument and filter parameters for the WFI directly from Pandeia and approximates Point Spread Functions (PSFs) at any pixel location by interpolating over a grid of nine evenly-distributed, detector-specific PSFs generated with WebbPSF, the PSF modeling software for the WFI. The resulting flux measurements are expected to be within ~10% of those generated by Pandeia.

While STIPS can add error residuals (representing the remaining uncertainty after pipeline calibration), it does not start with Level 1 (L1) data products to propagate instrumental calibrations through the output images. Additionally, the output residuals are not fully validated against the actual pipeline calibrations of L1 data products. Therefore, STIPS is not the ideal choice if high instrumental accuracy is needed.

Simulated STIPS images are currently saved in a multi-extension FITS format, with each detector saved as a separate extension. Unlike the Roman I-Sim files described above, these files do not contain extensive WFI metadata. Input catalogs are saved as Comma-Separated Value (CSV) files.

## STIPS Astronomical Scenes
STIPS files are currently saved in a multi-extension FITS format, with each detector saved as a separate extension. These files do not contain extensive WFI metadata. Input catalogs are saved as Comma-Separated Value (CSV) files.
The STIPS simulated products include:

- A full-focal-plane simulation (18 detectors) of the globular cluster M13 using F087 (z), F106 (Y), F129 (J), F158 (H), and F184 (H/K) filters

## <a name="openuniverse"></a>Open Universe Roman simulation

The Open Universe 2024 project aims to simulate overlapping images from surveys mimicking those that will be carried out by the Nancy Grace Roman Space Telescope, and the Vera C. Rubin Observatory. These data are publicly available at: https://irsa.ipac.caltech.edu/data/theory/openuniverse2024/overview.html as a set of FITS images and Parquet catalogs. As a demonstration of the capabilities of ASDF and the platform, Roman-specific images from this project are available on the RSP. There are two main datasets currently converted:

- Roman High Latitude Wide Area Survey single exposures in 6 photometric bands (F184, F158, F129, F213, F146, F106).

- Roman High Latitude Time Domain Survey in 7 photometric bands (F184, F158, F129, F213, F062, F106, and F087).

The images in the RSP consist of Roman images overlapping with the LSST ELAIS-S1 Deep Drilling Field (DDF), simulating the High Latitude Time Domain Survey, the High Latitude Wide Area Survey, as well as a calibration deep field for the High Latitude Wide Area Survey.

All data have been converted to ASDF format, which include, when available, the truth catalog with simulated sources overlapping the exposures. For simplicity, the ASDF files containing the simulated data follow a custom schema.

The data is contained in the `roman` block, which contains the following sub-blocks


| Data block name | Description |
| ----------------- |--------------|
| data | 4096 × 4096 pixel images containing the accumulated total counts after the exposure|
| dq | 4096 × 4096 pixel images with the data quality flags|
| meta | Metadata block|
| catalogs | Block containing the truth catalogs of the sources whose positions lie in the image of interest |


0 comments on commit 332b6b2

Please sign in to comment.