Skip to content

R package for quality control of plink genetic datasets

License

Notifications You must be signed in to change notification settings

hzahroh/plinkQC

 
 

Repository files navigation

CRAN_Status_Badge Build Status License: MIT Downloads

plinkQC

plinkQC is a R/CRAN package for genotype quality control in genetic association studies. It makes PLINK basic statistics (e.g.missing genotyping rates per individual, allele frequencies per genetic marker) and relationship functions easily accessible from within R and allows for automatic evaluation of the results.

Full documentation is available at http://meyer-lab-cshl.github.io/plinkQC/.

plinkQC generates a per-individual and per-marker quality control report. A step-by-step guide on how to run these analyses can be found here.

Individuals and markers that fail the quality control can subsequently be removed with plinkQC to generate a new, clean dataset.

plinkQC facilitates an ancestry check for study individuals based on comparison to reference datasets. The processing of the reference datasets is documented in detail here.

Removal of individuals based on relationship status via plinkQC is optimised to retain as many individuals as possible in the study.

Installation

The current github version of plinkQC is: 0.3.3 and can be installed via

library(devtools)
install_github("meyer-lab-cshl/plinkQC")

The current CRAN version of plinkQC is: 0.3.2 and can be installed via

install.packages("plinkQC")

A log of version changes can be found here.

Citation

Meyer HV (2018) plinkQC: Genotype quality control in genetic association studies. doi:10.5281/zenodo.3373798

About

R package for quality control of plink genetic datasets

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • R 95.8%
  • C++ 4.2%