-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CAMISIM: Generate the abundance profiles #19
Comments
CREATING DATAFRAME
pseudo code
|
|
Going with 70-80% Fungi, 20-30% Bacteria/Archaea and 0.5-2% Plasmids/Circular DNA/Virus #17 |
Example
The relative abundance between the endophytes are:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
With the chosen genomes, generate the abundance profiles.
Smartest seems to create a bash script that generates it for us.
Here is info from my comment on [#13]
HOW I UNDERSTAND THE ABUNDANCE CALCULATION
The abundance is calculated based on the total sum of genome sizes.
G1, G2, G3, ..., G10, Orchid
genomes. And i wantOrchid
to have an abundance of 50%.G1:G5
is 1000bp each,G6:G10
is 1500bp each andOrchid
is 12000bpabu = 1 / (number of genomes - 1)
Orchid
to 0.5abu_tot = abu_orchid + sum(abu_G1:abu_G10)
nrm_abu_orchid = abu_ochid / abu_tot
= 0.5 / 1.5 = 0.3333abundance.tsv
file0.0741 x 10 + 0.3333 ~ 1
Good info on these issues
The text was updated successfully, but these errors were encountered: