Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SOP for pheval releases #341

Open
matentzn opened this issue Jun 13, 2024 · 0 comments
Open

SOP for pheval releases #341

matentzn opened this issue Jun 13, 2024 · 0 comments
Assignees

Comments

@matentzn
Copy link
Member

matentzn commented Jun 13, 2024

I would suggest the following SOP for PhEval releases

  1. We manually trigger a PhEval release by creating a tagged GitHub release
  2. With the triggered release, pheval is built and published on pypi
  3. At the same time, a Zenodo dump is created with the all the corpora nicely zipped (alternatively we can rely on https://github.com/monarch-initiative/pheval/archive/refs/tags/v2024-06-04.tar.gz etc which is created automatically, but then we dont have a doi)

For the pheval pipelines, we have a little config element called

pheval-version

This will

  1. install a specific version of pheval instead of the latest
  2. during corpus preparation download the archive corresponding to the version (either zenodo or github, see above)

In effect this will ensure

  1. That our corpus version is always compatible with our tool version, for example when the schema changes, or we use different gene ids, or any other changes related to the phenopackets that need to be taken into account by preprocessing
  2. That all our experiments are fully reproducible.
  3. That our pypi releases always have an associated tag in version control

@souzadevinicius @yaseminbridges @julesjacobsen

What do you think?

There is one thing I dont like about that:

All corpora go into one mega archive. It would be sort of nice if I only wanted to test one corpus, that the pipeline only downloads that one.

In theory this is easy:

The GitHub action that performs the release can zip the corpora individually and attach them to the release. In practice I dont know if the disc space limitations in GitHub actions will be exceeded. I would vote to try this..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants