Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Advice on my study and parameters for complete #19

Open
ptranvan opened this issue May 4, 2021 · 1 comment
Open

Advice on my study and parameters for complete #19

ptranvan opened this issue May 4, 2021 · 1 comment

Comments

@ptranvan
Copy link

ptranvan commented May 4, 2021

Hi,

I am working on a non model organism and I am looking to estimate recombination rate along my genome and different part of the genome (ex chromosome, within chromosome etc.)

I have RADseq data of a family: parents and multiple offsprings and did a simple snp calling using Stacks in order to obtain a vcf file.

Now I a bit struggled about the following step.

  1. are the parents needed or I can compute the recombination rate just with the offpsrings ?

  2. I sorted my vcf with vcf-sort and convert with vcftools to ldhat .sites and .locs using vcftools then I did a quick test for one chromosome:

convert -seq chr1.ldhat.sites -loc chr1.ldhat.locs

I got a result but I am not sure how ton interpret it:

Summary of output data

Segregating sites      =      786
Average PWD            =  339.222
Watterson theta        =  138.827
Tajima D statistic     =    4.717
Fu and Li D* statistic =    3.062
  1. I want to create a lookup table with complete: it asked me the theta, is it the one estimated above ? Watterson theta.

Am I doing right ?

Thanks

@ptranvan ptranvan changed the title LDhat from VCF file ? Advice on my study and error with pairwise May 5, 2021
@ptranvan ptranvan changed the title Advice on my study and error with pairwise Advice on my study May 5, 2021
@ptranvan ptranvan changed the title Advice on my study Advice on my study and parameters for complete May 5, 2021
@viannam
Copy link

viannam commented Sep 22, 2021

Hi @ptranvan,

Try to select a subset of your data and run the pairwise, or other programs of LDhat, that gives you the suggested theta per site. Will be 0.01 or 0.001 depending on your study, as you can see in the ik files examples.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants