Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop HGVS evaluation framework #20

Open
reece opened this issue Feb 1, 2024 · 0 comments
Open

Develop HGVS evaluation framework #20

reece opened this issue Feb 1, 2024 · 0 comments
Labels
project proposal project proposal for interns and GSoC students

Comments

@reece
Copy link
Member

reece commented Feb 1, 2024

Difficulty Expected Duration Possible Mentors
Medium 175h or 350h depending on scope @reece

Summary

HGVS Nomenclature is the lingua franca for presenting sequence variants to humans in databases, scientific literature, and clinical reports. Unfortunately, some implementations of the standard are buggy and/or incomplete. The purpose of this project is to develop an framework for evaluating implementations of HGVS Nomenclature to help users understand the limitations of packages, to help package developers improve their packages, and ultimately to increase the fidelity of data sharing. As a byproduct of the proposed plan, an OpenAPI specification would be developed and, if adopted, enable easier migration between tools.

Community Benefits

The primary beneficiaries of this work would be the users of specific implementations who would be able to make well-informed decisions about gaps or issues. Since these users often work with clinicians, scientists, or adjacent professionals, it is expected that they would develop greater confidence that HGVS variant descriptions and manipulations were being performed consistently.

Required and Desired Skills

  • Python: very good-excellent
  • OpenAPI: some experience is ideal

Expected Results / Deliverables

  • An OpenAPI specification for HGVS Nomenclature variant descriptions and manipulation
  • A set of classified tests that exercise all aspects of HGVS Nomenclature
  • A framework for executing/evaluating packages, storing and publishing results, and comparing within and across implementations
  • Demonstration of the framework using at least one publicly available implementation (ideally more)
  • Package the project so that it can be used in CI/CD testing and by a public evaluation server
  • Develop a full-stack app to store and display evaluation results.

Benefits to Intern

This project is well-suited to individuals who have experience with REST interface design and testing methodology. Implementation will be in Python, and applicants should have strong Python skills. Exposure to HGVS and genetic variants is desirable but not required. This project has enormous opportunities to learn and to have significant impact in the scientific community. At a minimum, the intern would:

  • become familiar with HGVS Nomenclature and some of the challenges when writing standards
  • learn how to design a REST interface and use it to create a façade over different software packages
  • develop software architecture skills

This project could be simple, but also offers many avenues for extensions and learning. For example, the student might seek to build a web interface with authentication to store results.

How to apply

Students applying to this project should briefly describe their vision for this project, highlight their existing skills and the skills they would need to learn, and estimate an implementation timeline.

@reece reece added the project proposal project proposal for interns and GSoC students label Feb 1, 2024
@reece reece changed the title [Project Proposal]: Develop HGVS evaluation framework (WIP) Develop HGVS evaluation framework Feb 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
project proposal project proposal for interns and GSoC students
Projects
Status: No status
Development

No branches or pull requests

1 participant