Skip to content

Latest commit

 

History

History
29 lines (16 loc) · 2.03 KB

File metadata and controls

29 lines (16 loc) · 2.03 KB

Virus-associated organosulfur metabolism in human and environmental systems

The published version of this manuscript can be found at Cell Reports.

August 2021
Kristopher Kieft
Karthik Anantharaman
University of Wisconsin-Madison

Explanation of files

Kieft_et_al_2021_virus_AMGs.accnos : All 4,103 IMG/VR, RefSeq and Genbank names or accession numbers of viral AMG protein sequences, respective to Kieft_et_al_2021_virus_AMGs.faa.

Kieft_et_al_2021_virus_AMGs.faa : All 4,103 viral AMG protein sequences.

Kieft_et_al_2021_virus_genomes_full_01.fna : First half of the 3,749 viral genomes from IMG/VR, RefSeq and Genbank that encode an AMG. Note: genome files were split into two due to the large file sizes.

Kieft_et_al_2021_virus_genomes_full_02.fna : Second half of the 3,749 viral genomes from IMG/VR, RefSeq and Genbank that encode an AMG. Note: genome files were split into two due to the large file sizes.

Kieft_et_al_2021_virus_genomes.accnos : All 3,749 IMG/VR, RefSeq and Genbank names or accession numbers of viral genome sequences, respective to Kieft_et_al_2021_virus_genomes_full_01.fna and Kieft_et_al_2019_virus_genomes_full_02.fna.

Kieft_et_al_2021_virus_genomes.faa : All 167,379 proteins encoded by IMG/VR viruses in which proteins were predicted by this study (excludes viral proteins from RefSeq, Genbank and other studies).

Kieft_et_al_2021_Supplemental_Data_S1 : A folder containing a FASTA file for each of the 39 AMGs. Each file is named with the three or four letter protein ID of the AMG followed by the respective KEGG orthology number. The combination of all protein sequences in this folder is identical to Kieft_et_al_2021_virus_AMGs.faa.

Kieft_et_al_2021_Supplemental_Data_S2 : A folder containing a FASTA file for each of the alignments respective to Figures S1 and S3 to S8 of the full manuscript.

Kieft_et_al_2021_Supplemental_Data_S3 : Full genome sequence, in FASTA format, of the cysC-encoding Lake Mendota virus.