Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Publish August 1st 2024 archives #398

Closed
21 tasks done
github-actions bot opened this issue Aug 1, 2024 · 7 comments
Closed
21 tasks done

Publish August 1st 2024 archives #398

github-actions bot opened this issue Aug 1, 2024 · 7 comments
Assignees
Labels
automation Issues relating to automated archiver runs zenodo

Comments

@github-actions
Copy link

github-actions bot commented Aug 1, 2024

Summary of results:

See the job run logs and results here.
Second run of CEMS and NREL ATB data here.

Review and publish archives

For each of the following archives, find the run status in the Github archiver run. If validation tests pass, manually review the archive and publish. If no changes detected, delete the draft. If changes are detected, manually review the archive following the guidelines in step 3 of README.md, then publish the new version. Then check the box here to confirm publication status, adding a note on the status (e.g., "v1 published", "no changes detected, draft deleted"):

Tasks

Preview Give feedback

Validation failures

For each run that failed because of validation test failures (seen in the GHA logs), add it to the tasklist. Download the run summary JSON by going into the "Upload run summaries" tab of the GHA run for each dataset, and follow the link. Investigate the validation failure.

If the validation failure is deemed ok after manual review (e.g., Q2 of 2024 data doubles the size of a file that only had Q1 data previously, but the new data looks as expected), go ahead and approve the archive and leave a note explaining your decision in the task list.

If the validation failure is blocking (e.g., file format incorrect, whole dataset changes size by 200%), make an issue to resolve it.

Tasks

Preview Give feedback
No tasks being tracked yet.

Other failures

For each run that failed because of another reason (e.g., underlying data changes, code failures), create an issue describing the failure and take necessary steps to resolve it.

Tasks

Preview Give feedback
@zaneselvans
Copy link
Member

It seems a little fishy to me that the ferc1 took 2 hours, but ferc2 only took 3 minutes, given that their archives should end up being about the same size, and almost all of the ferc2 files got updated.

@e-belfer
Copy link
Member

e-belfer commented Aug 1, 2024

Still working my way through the archives, I'll take a look.

@e-belfer
Copy link
Member

e-belfer commented Aug 2, 2024

Everything has been inspected and published.

@e-belfer e-belfer closed this as completed Aug 2, 2024
@github-project-automation github-project-automation bot moved this from In progress to Done in Catalyst Megaproject Aug 2, 2024
@zaneselvans
Copy link
Member

Would it be easy to automate checking for the kind of failed upload that CEMS experienced this time around? Like check that all the files in the datapackage area actually in the draft deposition and have the same checksum?

@e-belfer
Copy link
Member

e-belfer commented Aug 7, 2024

The datapackage and checksums are produced at the end from the files uploaded, so I'm not exactly sure what you're proposing? We already check file size against the last upload. This seems to be some kind of problem with the way that 502 errors are getting retried.

@zaneselvans
Copy link
Member

I was imagining that we could calculate the file size and/or checksums locally, and compare to the file sizes and/or checksums that are reported on Zenodo, and if they don't match, raise an error.

Are you saying that the filesizes & checksums that end up in the datapackage.json are being populated based on the information on Zenodo, rather than the local files?

@e-belfer
Copy link
Member

e-belfer commented Aug 9, 2024

Ah yes, that would be a pretty straightforward validation! I'll write up an issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
automation Issues relating to automated archiver runs zenodo
Projects
Archived in project
Development

No branches or pull requests

2 participants