The here available summary table was used in Figure 1 in Rodríguez-Gijón and Nuy et al., 2022 (Frontiers in Microbiology). We include below information on how this table was created:
In this review, we have included 64,500 environmental MAGs available via two recently published
datasets, stratfreshDB and GEMs. StratfreshDB offers 12,000 MAGs (>40% completeness) from 41
stratified lakes and ponds assembled with Megahit (v1.1.13) and binned with Metabat (v2.12.1)
(Buck et al., 2021a). GEMs offers 52,000 MAGs (>50% completeness) from > 10,000 metagenomes
collected from diverse habitats on Earth (Nayfach et al., 2020). GEMs dataset was assembled
using metaSPAdes and binned with Metabat (v0.32.5). After dereplication using fastANI and
mOTUlizer (Jain et al., 2018; Buck et al., 2021b), our meta-analysis includes 17 834 mOTUs, with
one representative MAG each (completeness > 50% and contamination < 5%, assessed with CheckM
v1.1.3) (Parks et al., 2015). We complemented these environmental MAGs by adding 8 267
representative genomes (>90% complete) of isolates from GTDB (Parks et al., 2020; Figure 1A).
These genomes are marked in the GTDB databases (release 95) to originate from culture collections.
Please note that the table has 4 genomes of origin in "Air" environment which we did not include in our meta analysis.
- Rodríguez-Gijón A, Nuy JK, Mehrshad M, Buck M, Schulz F, Woyke T, et al. A genomic perspective across Earth’s microbiomes reveals that genome size in Archaea and Bacteria is linked to ecosystem type and trophic strategy. Frontiers in Microbiology. 2022 Jan 5;12(761869). https://doi.org/10.3389/fmicb.2021.761869.
- Buck M, Garcia SL, Fernandez L, Martin G, Martinez-Rodriguez GA, Saarenheimo J, et al. Comprehensive dataset of shotgun metagenomes from oxygen stratified freshwater lakes and ponds. Scientific Data. 2021 Dec;8(1):131. https://doi.org/10.1038/s41597-021-00910-1.
- Nayfach S, Roux S, Seshadri R, Udwary D, Varghese N, Schulz F, et al. A genomic catalog of Earth’s microbiomes. Nature Biotechnology. 2020 Apr;39(4):499–509. https://doi.org/10.1038/s41587-020-0718-6.