Version reference database with parameter #189

nebfield · 2023-10-03T10:31:07Z

New internal parameter: --ref_format_version

Starting version at v0.1, previous database versions matched the workflow version, but we want to version them separately because references should rarely change

Information about the ancestry version

* Allow VCF inputs with variable chromosome fields, but filter the variants to canonical or user-specified chromosomes. * Fix test * Documentation edits for VCF import * typo * Clairification on sampleset naming conventions (also enforced in check_samplesheet within utils) * set up offline testing * fix singularity url * update samplesheets * fail when join goes bad * make meta key for vmiss match converted vcf * fix --only_projection * add vcf ancestry test * fix plink call * what's wrong with my yaml :( * arf * double arf * I hate yaml * fix test name * activate ancestry vcf singularity test * fix test name * add retry for score and combine (often runs out of RAM on larger datasets) * Bump utils to v0.4.2 (#185) * bump v0.4.1 -> v0.4.2 * remove whitespace * fix tag * Fix conda action on publish (#184) * fix finding mamba * test ancestry with conda too * fix mamba profile * remove matrix from ancestry * fix conda channel message * Fix plink2_score options (#181) * Add mean-imputation back to scoring when samples aren't using a reference panel * Ignore allele frequency calculation ONLY when we have a reference, else just calculate allele frequencies on scoring file variants (with --extract) * Only use --error-on-freq-calc when non-mean-imputation is applied. Use extract to consider only variants from the scoring file. * Better comment placement * update test to match behaviour * fix test * add token to avoid rate limit * check plink log more thoroughly --------- Co-authored-by: Benjamin Wingfield <[email protected]> * Note about score precision (Closes #162) * Handle new reference naming (#173) * More default memory for plink, needed for a bigger reference (gnomAD). * More generalized file pattern for relatedness files in a reference ("GRCh3#_*.king.cutoff.out.id") with a single meta ID. ToDo: rename the deg2 relatedness files in setup_resource.nf / bootstrap_ancestry.nf * Update report.qmd to handle other colours * give king cutoff files consistent names * fix test * add version check to extract_database * add version check to extract_database --------- Co-authored-by: Benjamin Wingfield <[email protected]> * Update changelog.rst * Version reference database with parameter (#189) * version reference db separately * Update RELEASE-CHECKLIST.md Information about the ancestry version --------- Co-authored-by: Sam Lambert <[email protected]> --------- Co-authored-by: smlmbrt <[email protected]>

version reference db separately

19eb9d3

nebfield changed the title ~~Update db version~~ Version reference database with parameter Oct 3, 2023

Update RELEASE-CHECKLIST.md

da4ef3c

Information about the ancestry version

nebfield merged commit 9cc3ef8 into dev Oct 3, 2023

nebfield deleted the db_version branch October 3, 2023 11:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Version reference database with parameter #189

Version reference database with parameter #189

nebfield commented Oct 3, 2023 •

edited

Loading

Version reference database with parameter #189

Version reference database with parameter #189

Conversation

nebfield commented Oct 3, 2023 • edited Loading

nebfield commented Oct 3, 2023 •

edited

Loading