From 6b2c5dd2fd0bbaa9c6c5af7971ca30191a1e7075 Mon Sep 17 00:00:00 2001 From: Jennifer Chang Date: Thu, 17 Oct 2024 00:54:46 -0700 Subject: [PATCH] Add global root to example data --- phylogenetic/example_data/metadata.tsv | 3 +- phylogenetic/example_data/sequences.fasta | 139 ++++++++++++++++++++++ 2 files changed, 141 insertions(+), 1 deletion(-) diff --git a/phylogenetic/example_data/metadata.tsv b/phylogenetic/example_data/metadata.tsv index e67893b..96b6858 100644 --- a/phylogenetic/example_data/metadata.tsv +++ b/phylogenetic/example_data/metadata.tsv @@ -67,4 +67,5 @@ HM488132 2000-XX-XX North America USA Connecticut CT Culiseta melanura Armstro HQ671707 1999-XX-XX North America USA Connecticut CT Culex pipiens Henn et al. https://www.ncbi.nlm.nih.gov/nuccore/HQ671707 10607 NY99 AF202541 XXXX-XX-XX North America USA New York NY Jia et al. https://www.ncbi.nlm.nih.gov/nuccore/AF202541 10945 NY99 AF206518 XXXX-XX-XX North America USA Connecticut Greenwich-Stanford Town Line CT Culex pipiens Anderson et al. https://www.ncbi.nlm.nih.gov/nuccore/AF206518 10975 NY99 -AF481864 XXXX-XX-XX Ciconiidae Malkinson et al. https://www.ncbi.nlm.nih.gov/nuccore/AF481864 11029 pre-NY \ No newline at end of file +AF481864 XXXX-XX-XX Ciconiidae Malkinson et al. https://www.ncbi.nlm.nih.gov/nuccore/AF481864 11029 pre-NY +AF260968 XXXX-XX-XX Africa Egypt Bowen et al. https://www.ncbi.nlm.nih.gov/nuccore/AF260968 11029 NY99 \ No newline at end of file diff --git a/phylogenetic/example_data/sequences.fasta b/phylogenetic/example_data/sequences.fasta index 59ed8e5..6ad2c61 100644 --- a/phylogenetic/example_data/sequences.fasta +++ b/phylogenetic/example_data/sequences.fasta @@ -12443,3 +12443,142 @@ GACTAGAGGTTAGAGGAGACCCCGCGGTTTAAAGTGCACGGCCCAGCCTGACTGAAGCTG TAGGTCAGGGGAAGGACTAGAGGTTAGTGGAGACCCCGTGCCACAAAACACCACAACAAA ACAGCATATTGACACCTGGGATAGACTAGGAGATCTTCTGCTCTGCACAACCAGCCACAC GGCACAGTGCGCCGACAATGGTGGCTGGTGGTGCGAGAACACAGGATCT +>AF260968 +AGTAGTTCGCCTGTGTGAGCTGACAAACTTAGTAGTGTTTGTGAGGATTAACAACAATTAACACGGTGCGAGCTGTTTCT +TAGCACGAAGATCTCGATGTCTAAGAAACCAGGAGGGCCCGGCAAGAGCCGGGCTGTCAATATGCTAAAACGCGGAATGC +CCCGCGTGTTGTCCTTGATTGGACTGAAGAGGGCAATGTTGAGCCTGATCGACGGCAAGGGACCAATACGATTTGTGTTG +GCTCTCTTGGCGTTCTTCAGGTTCACAGCAATTGCTCCGACCCGAGCAGTGCTGGATCGATGGAGAGGTGTGAACAAACA +AACAGCGATGAAACACCTTCTGAGTTTTAAGAAGGAACTAGGGACCTTGACCAGTGCTATCAATCGGCGGAGCTCAAAAC +AAAAGAAAAGAGGAGGAAAGACCGGAATTGCAGTCATGATTGGCTTGATCGCCAGCGTGGGAGCAGTTACCCTCTCTAAC +TTCCAAGGGAAGGTGATGATGACTGTAAATGCCACTGACGTCACAGACGTCATCACGATTCCAACAGCTGCTGGAAAGAA +TCTATGCATTGTCAGAGCAATGGACGTGGGGTACATGTGTGATGATACTATCACCTATGAATGTCCAGTGCTGTCGGCTG +GTAATGATCCAGAAGACATCGACTGTTGGTGCACAAAATCAGCAGTCTACGTCAGGTATGGAAGATGCACCAAGACACGC +CACTCAAGACGTAGCCGGAGGTCACTGACAGTGCAGACACATGGAGAAAGCACTCTAGCGAACAAGAAGGGGGCTTGGAT +GGACAGCACCAAGGCTACAAGGTATTTGGTAAAAACAGAATCATGGATCTTGAGGAACCCCGGATATGCCCTGGTGGCAG +CCGTCATTGGTTGGATGCTTGGAAGCAACACCATGCAGCGAGTTGTGTTCGTTGTGCTACTGCTCTTGGTGGCTCCAGCC +TACAGCTTTAACTGCCTTGGAATGAGCAACAGAGACTTCTTAGAGGGAGTGTCTGGAGCAACATGGGTGGATTTGGTTCT +CGAAGGCGACAGCTGTGTGACCATCATGTCTAAGGACAAGCCTACCATCGATGTGAAGATGATGAATATGGAGGCCGCCA +ACCTGGCAGAGGTCCGCAGTTATTGCTATCTGGCCACCGTCAGCGATCTCTCCACCAAAGCTGCGTGCCCGACTATGGGA +GAAGCTCACAATGACAAACGTGCTGACCCAGCTTTTGTGTGTAAACAAGGAGTAGTGGACAGGGGTTGGGGCAACGGCTG +TGGACTATTTGGTAAAGGAAGCATTGACACATGCGCCAAATTTGCCTGTTCTACCAAGGCAACAGGAAGAACCATTCTGA +AAGAGAACATCAAGTACGAAGTGGCTATCTTTGTCCATGGACCAACCACTGTGGAGTCGCATGGAAACTACCCCACACAG +ATTGGGGCCACTCAGGCAGGGAGATTCAGCATCACTCCTGCGGCGCCTTCATACACACTAAAACTTGGAGAGTATGGAGA +GGTGACGGTGGACTGTGAACCACGATCAGGGATTGACACCAATGCATACTACGTGATGACTGTCGGAACAAAGACGTTCT +TGGTCCATCGTGAGTGGTTTATGGACCTCAACCTCCCCTGGAGCAGTGCCGGAAGCACTGTGTGGAGGAACAGAGAGACG +TTGATGGAGTTTGAGGAACCACACGCCACGAAGCAGTCTGTGATAGCATTGGGCTCACAAGAGGGAGCTCTGCATCAAGC +TTTGGCTGGAGCCATTCCTGTGGAATTTTCAAGCAACACTGTCAAGTTGACATCGGGTCATTTGAAGTGTAGAGTGAAGA +TGGAAAAATTGCAGTTGAAGGGAACAACCTACGGCGTCTGTTCAAAGGCTTTCAAGTTTCTTGGAACTCCCGCAGACACA +GGCCACGGCACTGTAGTGTTGGAATTGCAGTACACTGGCACGGATGGACCTTGCAAAGTTCCCATCTCGTCAGTGGCTTC +ATTGAACGACCTAACGCCAGTGGGCAGGTTGGTCACTGTCAACCCCTTTGTTTCAGTAGCCACGGCCAATGCCAAGGTCC +TGATTGAATTGGAACCACCCTTTGGAGACTCATACATAGTGGTGGGCAGAGGAGAACAACAGATTAATCACCATTGGCAC +AAGTCTGGAAGCAGCATTGGCAAAGCCTTCACAACCACCCTCAAAGGGGCGCAGAGATTAGCCGCCCTAGGAGATACAGC +TTGGGACTTTGGATCAGTTGGAGGGGTGTTCACCTCAGTGGGGAAGGCTGTCCATCAAGTGTTTGGTGGAGCATTCCGCT +CACTGTTCGGAGGCATGTCTTGGATAACGCAAGGATTGCTGGGGGCTCTGCTGTTGTGGATGGGCATCAATGCTCGTGAC +AGGTCCATAGCTCTCACGTTTCTCGCAGTTGGAGGGGTTTTGCTCTTTCTCTCCGTGAACGTGCACGCTGACACTGGATG +TGCCATAGACATCAGCCGGCAGGAGCTGAGATGTGGAAGTGGAGTGTTCATACACAATGATGTGGAGGCTTGGATGGACC +GGTACAAGTACTACCCTGAAACGCCACAAGGCCTAGCCAAGATCATTCAAAAAGCCCACAAAGAAGGAGTGTGCGGTCTA +CGGTCGGTTTCCAGACTGGAGCACCAAATGTGGGAAGCGGTGAAGGACGAGCTAAACACTCTTTTGAAAGAGAATGGTGT +GGACCTCAGTGTTGTGGTTGAGAAACAGGAGGGAATGTACAAGTCAGCACCTAAACGTCTCACCGCTACCACGGAAAAAT +TGGAAATAGGCTGGAAGGCCTGGGGAAAGAGCATCCTATTCGCACCAGAATTGGCCAACAACACTTTTGTGGTTGATGGT +CCGGAGACCAAGGAATGCCCAACTCAGAATCGCGCTTGGAACAGCTTGGAAGTAGAGGATTTTGGATTTGGTCTCACCAG +TACCCGGATGTTCCTGAAGGTCAGAGAGAGCAACACAACTGAATGTGACTCAAAGATCATCGGAACGGCTGTCAAGAACA +ACTTGGCGATCCACAGTGACCTGTCCTATTGGATTGAAAGCAGGCTTAATGATACGTGGAAGCTTGAAAGGGCGGTCCTG +GGTGAAGTTAAATCATGCACTTGGCCTGAAACGCACACTTTGTGGGGTGAAGGAATCCTCGAGAGTGACTTGATAATACC +AGTCACACTGGCGGGACCACGAAGCAACCACAATCGGAGACCTGGGTACAAGACACAAAACCAGGGCCCATGGGACGAAG +GCCGGGTAGAGATTGATTTCGATTACTGCCCAGGAACGACGGTCACCCTGAGTGAGAGCTGCGGACACCGTGGACCTGCC +ACTCGCACCACCACAGAGAGCGGAAAGCTGATAACGGACTGGTGCTGCAGGAGCTGCACCTTACCACCATTGCGCTACCA +GACGGACAGCGGTTGTTGGTATGGTATGGAGATTAGACCACAGAGGCATGATGAAAAGACCCTTGTGCAGTCACAAGTGA +ATGCTTACAACGCTGATATGATTGATCCTTTTCAGCTGGGCCTTCTGGTCGTGTTCTTGGCCACCCAGGAGGTCCTTCGC +AAGAGGTGGACAGCCAAGATCAGCATGCCAGCTATACTGATTGCTCTGCTAGTCCTGGTGTTTGGGGGCATTACTTACAC +TGACGTGTTACGCTATGTCATCTTAGTGGGAGCAGCTTTCGCAGAATCCAATTCGGGAGGAGACGTGGTACACTTGGCGC +TCATGGCGACCTTCAAGATACAACCAGTGTTTATGGTGGCATCGTTTCTCAAAGCGAGATGGACCAACCAGGAGAATATC +TTGTTGATGTTGGCGGCTGTTTTCTTTCAAATGGCTTACCATGACGCTCGCCAAATTCTGCTTTGGGAGATCCCTGATGT +GTTGAATTCATTGGCAGTAGCTTGGATGATACTGAGAGCCATAACCTTTACAACAACATCAAACGTGGTTGTTCCGCTGC +TAGCTCTGTTAACACCCGGACTGAGATGCTTGAATCTGGATGTGTACAGGATCCTGCTATTGATGGTCGGAATAGGCAGC +TTGATCAGAGAGAAGAGAAGCGCAGCTGCAAAAAAGAAAGGAGCAAGTCTGTTATGCCTGGCTCTAGCCTCAACAGGACT +TTTCAACCCTATGATCCTCGCCGCTGGACTCATTGCATGTGATCCCAACCGTAAACGAGGATGGCCCGCAACTGAAGTGA +TGACTGCTGTCGGCCTGATGTTTGCCATTGTCGGAGGGCTGGCAGAGCTTGACATTGACTCCATGGCCATTCCAATGACC +ATCGCAGGGCTCATGTTTGCTGCCTTCGTGATATCTGGGAAATCAACAGATATGTGGATCGAGAGGACGGCGGACATCTC +CTGGGAAAGTGATGCGGAAATTACAGGCTCGAGCGAGAGAGTTGATGTGCGGCTTGATGATGACGGAAATTTCCAGCTCA +TGAATGATCCAGGAGCACCTTGGAAGATATGGATGCTCAGAATGGCTTGCCTCGCGATTAGTGCGTACACCCCTTGGGCA +ATCCTGCCCTCAGTAGTTGGATTTTGGATAACTCTCCAATACACAAAGAGAGGAGGTGTGCTGTGGGACACTCCCTCACC +AAAGGAGTACAAAAAAGGGGACACGACCACTGGCGTCTACAGGATCATGACTCGTGGGCTGCTCGGCAGTTATCAAGCAG +GAGCGGGCGTGATGGTTGAAGGGGTTTTCCACACCCTTTGGCATACAACAAAAGGAGCCGCTCTGATGAGCGGGGAAGGC +CGCCTGGACCCATACTGGGGTAGTGTCAAAGAGGATCGACTTTGCTACGGAGGACCCTGGAAATTGCAGCACAAGTGGAA +TGGGCAGGATGAGGTGCAAATGATTGTGGTGGAACCTGGCAAGAACGTTAAAAACGTCCAGACGAAACCAGGGGTGTTCA +AAACACCTGAAGGAGAAATTGGGGCCGTGACTCTGGACTTCCCCACTGGAACATCAGGCTCACCAATAGTGGACAAAAAC +GGTGATGTGATCGGGCTCTATGGCAATGGAGTCATAATGCCCAACGGCTCATACATAAGCGCGATAGTGCAGGGTGAAAG +GATGGATGAGCCGATCCCAGCCGGATTCGAACCTGAGATGCTGAGGAAAAAACAGATCACAGTTCTGGACCTTCATCCCG +GTGCTGGTAAAACAAGGAGGATACTGCCACAGATCATCAAAGAGGCCATAAATAGAAGATTGAGAACGGCCGTGCTAGCA +CCAACTAGGGTTGTAGCCGCTGAGATGGCTGAAGCCCTGAGAGGACTGCCCATCCGGTATCAGACATCTGCAGTGCCCAG +AGAACACAATGGAAATGAGATTGTTGATGTCATGTGCCATGCCACTCTCACTCACAGGCTGATGTCTCCTCACAGGGTGC +CGAACTACAATCTTTTCGTGATGGATGAGGCTCATTTTACCGACCCAGCTAGCATTGCAGCAAGGGGTTATATTTCCACA +AAAGTCGAGCTGGGGGAGGCGGCGGCAATATTCATGACAGCTACCCCACCAGGCACTTCAGACCCATTCCCAGAGTCCAA +TTCACCTATTTCTGACTTGCAGACTGAGATCCCAGATCGGGCCTGGAACTCTGGGTACGAATGGATTACAGAATACATTG +GGAAAACGGTTTGGTTTGTGCCCAGTGTGAAAATGGGGAATGAGATTGCCCTTTGTCTACAACGTGCCGGCAAAAAAGTA +GTCCAACTGAACAGAAAGTCGTATGAGACGGAGTACCCAAAGTGCAAGAACGATGATTGGGACTTTGTTATCACAACAGA +CATATCTGAAATGGGGGCTAACTTCAAGGCGAGCAGGGTGATTGACAGCAGGAAGAGTGTGAAACCAACCATCATCACGG +AAGGAGAAGGGAGGGTGATCCTGGGAGAACCATCCGCTGTGACAGCAGCTAGTGCAGCCCAAAGACGTGGACGCATCGGT +AGGAATCCATCGCAAGTTGGTGATGAGTACTGCTATGGGGGGCACACGAATGAAGACGACTCGAACTTCGCCCATTGGAC +TGAGGCACGAATCATGCTGGACAACATCAACATGCCAAACGGACTGATCGCTCAATTCTACCAACCAGAGCGTGAAAAGG +TATACACCATGGATGGAGAATACCGACTCAGAGGAGAAGAGAGGAAAAACTTTCTGGAATTATTGAGGACTGCAGATCTG +CCAGTTTGGCTGGCTTACAAGGTGGCAGCGGCTGGAGTGTCATACCACGATCGGAGATGGTGTTTTGATGGCCCTAGGAC +AAACACAATTCTAGAAGACAACAACGAAGTGGAAGTCATTACGAAGCTTGGTGAAAGAAAGATTCTGAGGCCGCGCTGGA +TTGACGCCAGGGTGTACTCGGATCATCAGGCATTAAAGGCGTTCAAGGACTTTGCTTCGGGAAAGCGTTCTCAGATAGGG +CTCATTGAGGTTCTGGGAAAGATGCCTGAGCACTTCATGGGGAAGACATGGGAAGCACTTGACACCATGTATGTTGTGGC +CACCGCAGAGAAAGGGGGAAGAGCTCACAGAATGGCCTTGGAGGAACTGCCAGATGCTCTCCAGACAATTGCCCTGATTG +CCTTATTGAGTGTGATGACCATGGGAGTATTCTTCCTCCTCATGCAGCGGAAGGGCATTGGAAAGATAGGTTTGGGAGGC +GTTGTCCTGGGAGTCGCAACCTTCTTTTGTTGGATGGCTGAAGTTCCAGGAACGAAGATCGCCGGAATGTTGCTGCTTTC +CCTTCTCTTGATGATTGTGCTAATCCCTGAGCCAGAGAAGCAACGTTCGCAGACAGACAACCAGCTAGCCGTGTTCCTGA +TTTGTGTGTTGACCCTCGTGAGCGCAGTGGCAGCCAACGAAATGGGTTGGCTGGACAAGACCAAGAATGATATAAGCAGT +TTGTTTGGGCAAAGAATTGAGGCCAAGGAGAATTTCAGTATGGGAGAGTTTCTCCTGGACTTGAGACCGGCAACAGCCTG +GTCACTGTATGCTGTGACCACAGCGGTTCTCACTCCACTGCTAAAGCATCTGATCACGTCAGATTACATCAACACTTCAT +TGACCTCAATCAATGTTCAAGCAAGTGCACTATTCACACTCGCGCGAGGCTTCCCCTTTGTCGATGTTGGAGTGTCGGCT +CTCCTGCTAGCAGCCGGATGCTGGGGACAAGTCACCCTCACCGTGACGGTGACAGCGGCAACACTCCTGTTCTGCCACTA +CGCCTACATGGTTCCCGGATGGCAGGCTGAGGCAATGCGCTCAGCCCAGCGGCGGACAGCGGCTGGAATCATGAAAAACG +CTGTAGTGGATGGCATCGTGGCCACGGACGTCCCAGAATTAGAGCGCACCACACCCATCATGCAGAAGAAAGTTGGGCAA +ATCATGCTGATCTTGGTGTCTCTAGCTGCAGTAGTAGTGAACCCGTCTGTGAAGACAGTGCGAGAAGCCGGAATTCTGAT +CACGGCAGCAGCGGTGACACTCTGGGAGAATGGAGCAAGCTCTGTTTGGAATGCAACAACTGCCATCGGACTCTGCCACA +TCATGCGTGGGGGTTGGTTGTCATGCTTATCCATAACATGGACACTCATAAAGAACATGGAAAAACCAGGACTAAAAAGA +GGTGGGGCAAAGGGACGCACCTTGGGAGAGGTTTGGAAAGAAAGACTCAACCAGATGACAAAAGAAGAGTTCACTAGGTA +CCGCAAAGAGGCCATCATCGAAGTCGATCGCTCAGCAGCAAAACACGCCAGGAAAGAAGGCAATGTCACTGGAGGGCATC +CAGTCTCTAGAGGCACAGCAAAGCTGAGATGGCTGGTCGAGCGGAGGTTTCTCGAACCGGTCGGAAAAGTGATTGACCTT +GGATGTGGAAGAGGCGGTTGGTGTTACTACATGGCAACCCAAAAAAGAGTCCAAGAGGTCAGAGGGTACACAAAGGGTGG +TCCCGGACATGAAGAGCCCCAACTGGTGCAAAGTTATGGATGGAACATTGTCACCATGAAGAGCGGAGTGGATGTGTTCT +ACAGACCTTCTGAGTGCTGCGATACCCTCCTTTGTGACATCGGAGAGTCTTCATCAAGTGCTGAGGTTGAAGAGCATAGG +ACGATCCGGGTCCTTGAAATGGTTGAGGACTGGCTGCACCGAGGGCCAAAGGAATTTTGTGTGAAGGTGCTCTGCCCCTA +TATGCCAAAAGTCATAGAAAAGATGGAGCTGCTCCAGCGCCGGTATGGGGGGGGACTGGTCAGAAACCCACTCTCGCGGA +ATTCCACGCACGAGATGTATTGGGTAAGTCGAGCTTCGGGCAATGTGGTACACTCAGTGAACATGACCAGCCAGGTGCTT +CTGGGAAGAATGGAGAAAAGGACCTGGAAGGGACCCCAATACGAGGAAGATGTGAACTTGGGAAGTGGAACCAGGGCGGT +GGGAAAACCCCTACTCAACTCAGACACTAGTAAAATCAAGAACAGGATTGAACGACTCAGGCGTGAGTACAGTTCGACGT +GGCACCACGATGAGAACCACCCATATAGAACCTGGAACTATCACGGCAGTTATGATGTGAAACCTACAGGCTCCGCCAGC +TCGCTGGTCAATGGAGTGGTTAGGCTCCTCTCAAAACCATGGGACACCATCACGAACGTTACCACCATGGCCATGACTGA +CACTACTCCCTTCGGACAGCAGCGGGTGTTTAAAGAGAAGGTGGACACGAAAGCTCCTGAACCGCCAGAAGGAGTGAAGT +ATGTGCTCAATGAAACCACCAACTGGTTGTGGGCGTTTCTGGCCAGAGAAAAACGTCCCAGAATGTGCTCTCGAGAGGAA +TTCATAAAAAAGGTCAATAGCAATGCAGCTCTGGGTGCCATGTTTGAAGAGCAGAACCAATGGAGGAGCGCCAGAGAAGC +AGTTGAGGATCCAAAATTTTGGGAGATGGTGGATGAGGAGCGCGAGGCACACCTGCGGGGGGAATGTCACACTTGCATCT +ACAACATGATGGGGAAGAGAGAGAAGAAACCTGGAGAGTTCGGAAAGGCTAAGGGAAGCAGAGCCATATGGTTCATGTGG +CTCGGAGCTCGCTTTCTGGAGTTCGAAGCTCTGGGCTTTCTTAACGAAGACCACTGGCTTGGAAGAAAGAACTCAGGAGG +CGGGGTCGAGGGCTTGGGCCTCCAAAAACTGGGTTATATTCTGCGTGAAGTTGGCACCCGACCTGGAGGCAAGATCTATG +CTGATGACACAGCTGGCTGGGACACCCGCATTACGAGAGCTGACCTGGAAAATGAAGCTAAGGTTCTTGAGTTGCTGGAT +GGGGAACATCGGCGTCTTGCTAGGGCCATCATTGAGCTCACCTATCGTCACAAAGTTGTGAAAGTGATGCGCCCGGCTGC +TGATGGAAGAACCGTCATGGATGTCATCTCCAGAGAAGATCAGAGGGGGAGTGGACAAGTTGTCACCTACGCTCTAAACA +CCTTCACCAACCTGGCCGTCCAGTTGGTGAGGATGATGGAAGGGGAAGGAGTGATTGGCCCAGATGATGTGGAGAAACTC +ACAAAGGGAAAAGGACCTAAAGTCAGGACCTGGCTGTTTGAGAATGGGGAGGAAAGACTCAGCCGCATGGCTGTCAGCGG +AGATGACTGTGTGGTAAAGCCCCTAGATGACCGCTTCGCCACCTCTCTCCACTTCCTCAACGCCATGTCAAAGGTTCGCA +AAGATATCCAGGAGTGGAAACCGTCAACTGGATGGTATGACTGGCAGCAGGTTCCATTCTGCTCGAACCATTTCACTGAA +TTAATCATGAAAGATGGAAGAACACTGGTGGTTCCATGCCGAGGACAGGACGAACTGGTAGGCAGAGCTCGCATTTCTCC +AGGGGCCGGATGGAACGTCCGTGACACTGCTTGTCTGGCTAAGTCTTATGCCCAGATGTGGCTGCTTCTGTACTTCCACA +GAAGAGACCTGCGGCTAATGGCCAACGCCATTTGCTCCGCTGTCCCTGTGAATTGGGTCCCTACCGGAAGAACCACGTGG +TCCATCCATGCCGGAGGGGAGTGGATGACAACAGAAGACATGCTGGAGGTCTGGAACCGTGTTTGGATAGAGGAGAATGA +ATGGATGGAAGACAAAACCCCAGTGGAGAAATGGAGTGACGTCCCATACTCAGGAAAACGGGAGGACATCTGGTGTGGCA +GCTTGATTGGCACAAGAACCCGAGCCACGTGGGCAGAAAACATCCAGGTAGCCATCAACCAAGTCAGAGCAATCATTGGA +GATGAGAAGTATGTGGATTACATGAGTTCATTAAAGAGATATGAAGACACGACTTTGGTTGAGGACACAGTACTGTAAAT +ACTTTATTAATTGTAAATAGACAATGTAAGCATGTGTAAAAGTATAGTTTTATAGTAGCATTTAGTGATGTTAGTGTAAA +TAGTTAAGAAAATTTTAAGGAGGAAGTCAGGCCGGAAAGTTTCCGCCACCGGAAGTTGAGTAGACGGTGCTGCCTGCGAC +TCAACCCCAGGAGGACTGGGTGAACAAAGCTGCGAAGTGATCCATGTAAGCCCTCAGAACCGTCTCGGAAGGAGGACCCC +ACATGTTGTAACTTCAAAGCCCAATGTCAGACCACGCTACGGCGTGCCACTCTGCGGAGAGTGCAGTCTGCGATAGTGCC +CCAGGAGGACTGGGTTAACAAAGGCAGATCAACGCCCCACGCGGCCCTAGCCCTGGTAATGGTGTTAACCAGGGCGAAAG +GACTAGAGGTTAGAGGAGACCCCGCGGTTTAAAGTGCACGGCCCAGCCTGACTGAAGCTGTAGGTCAGGGGAAGGACTAG +AGGTTAGTGGAGACCCCGTGCCACAAAACACCACAACAAAACAGCATATTGACACCTGGGATAGACTAGGAGATCTTCTG +CTCTGCACAACCAGCCACACGGCACAGTGCGCCGACAATGGTGGCTGGTGGTGCGAGAACACAGGATCT