Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplicate filename warnings while archiving FERC XBRL data #311

Open
zaneselvans opened this issue Mar 30, 2024 · 1 comment
Open

Duplicate filename warnings while archiving FERC XBRL data #311

zaneselvans opened this issue Mar 30, 2024 · 1 comment
Labels
ferc1 FERC Form 1 xbrl Anything having to do with our FERC XBRL data sources

Comments

@zaneselvans
Copy link
Member

I tried running the FERC Form 1 archiver locally and saw a number of warnings about duplicate filenames in zipfiles. E.g.

UserWarning: Duplicate name: 'System_Energy_Resources,_Inc._form1_Q4_1702681857.xbrl'
UserWarning: Duplicate name: 'System_Energy_Resources,_Inc._form1_Q4_1702684358.xbrl'
UserWarning: Duplicate name: 'System_Energy_Resources,_Inc._form1_Q4_1702685900.xbrl'
UserWarning: Duplicate name: 'NextEra_Energy_Transmission_New_York,_Inc._form1_Q4_1708143331.xbrl'
UserWarning: Duplicate name: 'NorthWestern_Corporation_form1_Q4_1709182516.xbrl'

Do we expect there to be filename collisions?

@zaneselvans zaneselvans added xbrl Anything having to do with our FERC XBRL data sources ferc1 FERC Form 1 labels Mar 30, 2024
@zschira
Copy link
Member

zschira commented May 1, 2024

I noticed this when I was recently working on the FERC archivers, and what's happening is that the main RSS feed contains only the most recent filings, while older filings can only be found in month specific feeds. This leads to some collisions where recent filings are available in a month specific feed, and the main feed. They should be the exact same filing, it shouldn't really be a problem, but I think it would be best to fix this and raise an error if we see unexpected duplicates.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ferc1 FERC Form 1 xbrl Anything having to do with our FERC XBRL data sources
Projects
Status: New
Development

No branches or pull requests

2 participants