Skip to content

Analysis pipeline for Johnson et al 2021 (Hemp genotyping and genomic selection)

Notifications You must be signed in to change notification settings

wallacelab/paper-johnson-hemp-gs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Analysis pipeline for Johnson et al. 2021

This is the analysis pipeline for Johnson et al 2021, "Genomic and Chemical Diversity of Commercially Available Industrial Hemp Accessions." It contains all analysis scripts starting from turning aligned BAM files into genotypes. It also includes key intermediate files so that users can pick up partway instead of having to run the entire thing from scratch.

Setting up the Conda environment

This pipieline uses a Conda environment for reproducibility. To set up an identical environment:

  1. Install Anaconda (I recommend the Miniconda version)
  2. Follow the instructions in 0_CreateCondaEnvironment.sh to initialize an identical environment

Running the analysis

After setting up your conda enviroment (above), you can run 0_RerunHempGenotyping.sh. The script assumes you have already aligned the FASTQ files to a genome and gotten BAM files out. If you don't want to rerun the entire pipeline from scratch, you can start any of the key intermediate files included in this repo.

Key files in this repository

  • Genotypes: 1g_hemp_filtered.sorted.vcf.gz
  • Keyfile linking sample number to accession name: 0_HDGS_keyfile.txt
  • HPLC results from each sample: 0_HDGS_HPLC_Results.csv

About

Analysis pipeline for Johnson et al 2021 (Hemp genotyping and genomic selection)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published