Catalyst operational tasks not related to coding.
Data analysis tasks that involve actually using PUDL to figure things out, like calculating MCOE.
Distributing or working with PUDL data using Google BigQuery
Business development and outreach to users & stakeholders.
Things that are just plain broken.
Tasks related to CCAI grant for entity matching
Issues related to the Census DP1 dataset which we distribute as an SQLite DB
Scripts and other command line interfaces to PUDL.
Stuff that has to do with adapting PUDL to work in cloud computing context.
Issues related to working with / extracting data from CSV files
Issues related to our use of the Dagster orchestrator
Tasks related to cleaning & regularizing data during ETL.
data that we expect should exist seem to be missing or dropped in pudl tables
Interpolating or extrapolating data that we don't actually have.
Dtype conversions, standardization and implications of data types
When fresh data is integrated into PUDL from quarterly or annual updates
Issues related to checking whether data meets our quality expectations.
Frictionless data package input, output, metadata, manipulation
Issues related the accessing PUDL data via Datasette.
Managing the acquisition and organization of external raw data.
Data coming from FERC's old Visual FoxPro DBF database file format.
Issues related to the data build tool aka dbt
Pull requests that update a dependency file
A discussion about design issues and other project decisions.
Documentation for users and contributors.
Issues referring to duckdb, the embedded OLAP database
Issues related to Greg Miller's hourly eGRID project