Skip to content

AnantharamanLab/Kieft_et_al_2021_organosulfur_AMGs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Virus-associated organosulfur metabolism in human and environmental systems

The published version of this manuscript can be found at Cell Reports.

August 2021
Kristopher Kieft
Karthik Anantharaman
University of Wisconsin-Madison

Explanation of files

Kieft_et_al_2021_virus_AMGs.accnos : All 4,103 IMG/VR, RefSeq and Genbank names or accession numbers of viral AMG protein sequences, respective to Kieft_et_al_2021_virus_AMGs.faa.

Kieft_et_al_2021_virus_AMGs.faa : All 4,103 viral AMG protein sequences.

Kieft_et_al_2021_virus_genomes_full_01.fna : First half of the 3,749 viral genomes from IMG/VR, RefSeq and Genbank that encode an AMG. Note: genome files were split into two due to the large file sizes.

Kieft_et_al_2021_virus_genomes_full_02.fna : Second half of the 3,749 viral genomes from IMG/VR, RefSeq and Genbank that encode an AMG. Note: genome files were split into two due to the large file sizes.

Kieft_et_al_2021_virus_genomes.accnos : All 3,749 IMG/VR, RefSeq and Genbank names or accession numbers of viral genome sequences, respective to Kieft_et_al_2021_virus_genomes_full_01.fna and Kieft_et_al_2019_virus_genomes_full_02.fna.

Kieft_et_al_2021_virus_genomes.faa : All 167,379 proteins encoded by IMG/VR viruses in which proteins were predicted by this study (excludes viral proteins from RefSeq, Genbank and other studies).

Kieft_et_al_2021_Supplemental_Data_S1 : A folder containing a FASTA file for each of the 39 AMGs. Each file is named with the three or four letter protein ID of the AMG followed by the respective KEGG orthology number. The combination of all protein sequences in this folder is identical to Kieft_et_al_2021_virus_AMGs.faa.

Kieft_et_al_2021_Supplemental_Data_S2 : A folder containing a FASTA file for each of the alignments respective to Figures S1 and S3 to S8 of the full manuscript.

Kieft_et_al_2021_Supplemental_Data_S3 : Full genome sequence, in FASTA format, of the cysC-encoding Lake Mendota virus.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published