Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Named alleles #33

Merged
merged 10 commits into from
Feb 2, 2024
Merged

Named alleles #33

merged 10 commits into from
Feb 2, 2024

Conversation

apriltuesday
Copy link
Collaborator

@apriltuesday apriltuesday commented Jan 24, 2024

Better diff of test output here

Example named-allele evidence:

{
  "datasourceId": "pharmgkb",
  "datasourceVersion": "2024-01-05",
  "datatypeId": "clinical_annotation",
  "studyId": "1448104189",
  "evidenceLevel": "2A",
  "literature": [
    "16338275",
    "17329992",
    "26389554",
    "26389554",
    "27286724",
    "17178267",
    "17502774",
    "17502774",
    "21589866",
    "28723731",
    "28723731",
    "32302325"
  ],
  "genotype": "*6",
  "genotypeAnnotationText": "Patients carrying the CYP2B6*6 allele in combination with a normal or decreased function allele may have decreased clearance of methadone as compared to patients with two normal function alleles. However, conflicting evidence has been reported. Other genetic and clinical factors may also affect methadone metabolism. This annotation only covers the pharmacokinetic relationship between CYP2B6 and methadone and does not include evidence about clinical outcomes.",
  "directionality": "Decreased function",
  "drugFromSource": "methadone",
  "drugFromSourceId": "CHEBI_6807",
  "pgxCategory": "metabolism/pk",
  "phenotypeText": "Heroin Dependence",
  "phenotypeFromSourceId": "EFO_0004240",
  "haplotypeId": "CYP2B6*6",
  "haplotypeFromSourceId": "PA165818762",
  "targetFromSourceId": "ENSG00000197408"
}

@apriltuesday apriltuesday marked this pull request as ready for review January 25, 2024 12:18
Copy link
Member

@tcezard tcezard left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes looks ok.
The current model (if kept as is) would have to thoroughly explain though because it is a bit confusing.
The new haplotypeId seems to make sense since the named alleles represent haplotypes but in some case it won't match. (haplotypeId: GSTT1 non-null/non-null). Maybe we should discuss with OT to have a more appropriate name
For the genotype, it is similarly ambiguous although more often wrong than right.

The options are:

  • Changing the name to something that reflect the heterogenicity of the data
  • Document the heterogenicity in the field's description.

@apriltuesday
Copy link
Collaborator Author

Updates to schema as per this thread

@apriltuesday apriltuesday requested a review from tcezard February 1, 2024 12:19
@apriltuesday apriltuesday merged commit bcd2fd5 into EBIvariation:main Feb 2, 2024
1 check passed
@apriltuesday apriltuesday deleted the named-alleles branch January 8, 2025 14:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants