augmenting available sequence for a specific mammalian protein with VGP/Tree of Life raw reads #7

avilella · 2022-05-20T10:00:23Z

Hi all,

I am trying to augment the available sequences for a handful of specific vertebrate/mammalian proteins and my idea was to use 'diamond blastx' to blast fastq data from the VGP / Tree of Life raw reads.

I've seen the darwintreeoflife.data repo, which says it's discontinued, and this seems to be the one containing more up to date information (up to current month). Is there a way to get a long list of http bam or cram URLs for all the vertebrates/mammalian genomes in the Tree of Life project? E.g.

Something equivalent to the "*data.tsv" files in darwintreeoflife.data but up-to-date with current freshly generated data. Thanks in advance.

find darwintreeoflife.data/ -name "*data.tsv" | sort -V | xargs cat | grep -e 'bam$' -e 'cram$'

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

augmenting available sequence for a specific mammalian protein with VGP/Tree of Life raw reads #7

augmenting available sequence for a specific mammalian protein with VGP/Tree of Life raw reads #7

avilella commented May 20, 2022

augmenting available sequence for a specific mammalian protein with VGP/Tree of Life raw reads #7

augmenting available sequence for a specific mammalian protein with VGP/Tree of Life raw reads #7

Comments

avilella commented May 20, 2022