Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GQ score clarification #68

Open
Derrup opened this issue Nov 4, 2020 · 3 comments
Open

GQ score clarification #68

Derrup opened this issue Nov 4, 2020 · 3 comments

Comments

@Derrup
Copy link

Derrup commented Nov 4, 2020

Hi, I have converted some GTC files to VCF and I have noticed that the GQ score is always between 0 and 17. I have done the conversion with both Beeline and IAAP but the result does not change. I was under the impression that this value should be equal to the GenCall score. Could you please clarify?
Is there a way to have the actual GenCall score in the GTC files?

@jjzieve
Copy link
Contributor

jjzieve commented Nov 4, 2020

It is the GenCall score just phred scaled https://en.wikipedia.org/wiki/Phred_quality_score

@XubCherif
Copy link

XubCherif commented Nov 23, 2020

Getting the score between 0 and 17 I would like to ask about the recommended threshold. Could you please advise?

@jjzieve
Copy link
Contributor

jjzieve commented Nov 23, 2020

This is where the calculation occurs: https://github.com/Illumina/GTCtoVCF/blob/develop/GencallFormat.py#L62
A score of 14 would be about a 0.96. The default Gencall cutoff to label as a "NoCall" is 0.15 (https://www.illumina.com/documents/products/technotes/technote_infinium_genotyping_data_analysis.pdf). So I'd say anything greater than or equal to 1 is a an O.K. call. But it depends on how stringent you want your results to be. The reason we moved away from reporting the gencall score directly is to stick to the VCF spec as close as possible as to mitigate bugs in secondary analysis pipelines that consume VCFs. In the spec we reference (https://samtools.github.io/hts-specs/VCFv4.1.pdf) it lists GQ as a phred-scaled integer instead of a float. If you'd like a separate field for the raw gencall score, it should be a pretty straightforward PR that I'd be happy to review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants