Replies: 2 comments 1 reply
-
I think that there should be an output table for the crosswalk, since this is a table that other data users may want to use independently (for example, we access it in our eGRID project, and it would be great to have a standardized version available from pudl). Other ideas:
|
Beta Was this translation helpful? Give feedback.
1 reply
-
Ok @grgmiller I just tacked an output table onto that same PR! #1692 getting very close to done!! 🎉 That is, until the next crosswalk release... :) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Historically we have not given glue tables their own output table. This is because the output or "analysis" table generally represents the denormalized version of the tables in the pudl db and combines info from glue tables with other tables.
I don't really want to integrate CEMS ids into all EIA data or vice versa, so I think it makes most sense to just have an accessible crosswalk that you can merge with whatever data you want whenever you want. My question is, Should this be an output table?
I think this ties into larger conversations surrounding PUDL access. Are we going to encourage people to tap into the SQL db more often? Are we going to save and catalog (for intake) the normalized or denormalized versions of the tables? Are we going to treat the output tables as a caching mechanism more than a denormalization mechanism?
I want to make sure we aren't confusing users by giving them access to some tables this way and some tables that way.
Beta Was this translation helpful? Give feedback.
All reactions