Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Harmonize PUDL and OGE subplant_id pipeline #3691

Open
6 tasks
grgmiller opened this issue Jun 22, 2024 · 0 comments
Open
6 tasks

Harmonize PUDL and OGE subplant_id pipeline #3691

grgmiller opened this issue Jun 22, 2024 · 0 comments

Comments

@grgmiller
Copy link
Collaborator

grgmiller commented Jun 22, 2024

Overview

In OGE, we are currently attempting to eliminate our dependency on the pudl codebase, and instead entirely rely on the pudl output tables. However, over the past few years, our pipeline for generating subplant_ids has diverged from the one used in PUDL, and so to be able to rely directly on pudl's subplant_id outputs, I'm creating this issue to begin thinking through how we can incorporate the updates we've made into pudl. @catalyst-cooperative/com-dev

In OGE, our subplant identification code occurs in this file: https://github.com/singularity-energy/open-grid-emissions/blob/development/src/oge/subplant_identification.py. This module has 4 main functions:

Our most recent major changes to our subplant_id pipeline are contained in singularity-energy/open-grid-emissions#353, which attempted to keep the subplant_ids as static as possible. This had previously been an issue for us because we were creating these IDs indpendently for a single year at a time, but you may have had less issue with that since you were likely already generating these IDs based on all years of data.

Areas to investigate for harmonization

core_epa__assn_eia_epacamd
In oge.load_data.load_epa_eia_crosswalk(), we augment this table using two manual data sources:

We also augment this table to add all plant, generator pairs before passing into pudl.analysis.epacamd_eia.filter_crosswalk(). Looking through your code, it looks like you may be no longer even using the filter_crosswalk() function anywhere?

Success Criteria

We can stop importing pudl.analysis.epacamd_eia and pudl.etl.glue_assets into OGE, and the two subplant tables are consistent

Next steps

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: New
Development

No branches or pull requests

1 participant