We should have some utilities to validate and re-cast to the target schema given they exist in functional forms. #37
Labels
enhancement
New feature or request
Pending Community Use
Issues that should be solved after sufficient community uptake and use to dictate method
priority:medium
Medium priority; should be triaged for inclusion in near-term releases.
Usability
For the usability of the MEDS schema more generally by the community at a technical level.
E.g., see this function: https://github.com/mmcdermott/MEDS_transforms/blob/573816cbf3f6005a8fc25eb25424706ca0c97b6e/src/MEDS_transforms/extract/finalize_MEDS_metadata.py#L28
This is polars specific, obviously, which we don't want to be, but having the ability to identify if a
codes.parquet
or adata/*.parquet
file meets a valid extended schema and converting to the right pyarrow schema is very useful (especially because there exist minor differences we should be cognizant of likelarge_string
vsstring
, etc.).Tagging @EthanSteinberg for your input.
The text was updated successfully, but these errors were encountered: