Typing tool
|
Complete norovirus genomes
OR051903 | GII.6 | ||
---|---|---|---|
GII.P7 |
ORF1: 1..5094 ORF2: 5075..6718 ORF3: 6718..7497LOCUS OR051903 7522 bp RNA linear VRL 06-OCT-2023 DEFINITION Norovirus GII isolate GII/Hu/US/2013/GII.6[P7]/NIH29.1 nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION OR051903 VERSION OR051903.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7522) AUTHORS Chaimongkol,N., Dabilla,N., Tohma,K., Matsushima,Y., Behrle Yardley,A., Levenson,E.A., Johnson,J.A., Ahorrio,C., Oler,A.J., Kim,D.Y., Souza,M., Sosnovtsev,S.V., Parra,G.I. and Green,K.Y. TITLE Norovirus Evolves as One or More Distinct Clonal Populations in Immunocompromised Hosts JOURNAL Unpublished REFERENCE 2 (bases 1 to 7522) AUTHORS Chaimongkol,N., Dabilla,N., Sosnovtsev,S.V. and Green,K.Y. TITLE Direct Submission JOURNAL Submitted (26-MAY-2023) LID, NIAID, 50 South Drive/R6318, Bethesda, MD 20892, USA COMMENT ##Assembly-Data-START## Assembly Method :: HIVE Hexagon/Heptagon v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7522 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="GII/Hu/US/2013/GII.6[P7]/NIH29.1" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="Nov-2013" /note="genotype: GII.6" gene 1..5094 /gene="ORF1" CDS 1..5094 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="WHW95053.1" /translation="MKMASNDASAAFGNQNSVNDSINTAPSNKEEVGAFSNIKVGFKK ILGAVPKGTKAPSSDQHCPTVKIGTKTLTVPPEPPNGEDAVQFDVKSETVRGLPDLTT VQNEHENTPYTVPPLSEREHRPATEPLPGTILEMWDGEFYHYSVYVSGGKALGVHKPP AAISLATIELTPISLYWRPVYTPNYLVCPDTLKGLAGEKFPYTAFSNNCYNFCCWVLE LNDTWLSRRSISRTTGFFKPYQSWNRKPLPTVDDGKIKKVANAILCALGSLFSKPIKD LLGKLKPLNLLHLLASCDWTFAGIVETVILMAELFNIFWTPPDVSSFIASLIGDFELQ GPEDLAVELVPVVMGGIGMVLGFTAEKIGRMLSSAASTLRACKDLGNYALDILKLVMK WFFPKKEEKAEMETLRAIEDAVLDMEAIGNNHLTTLLKDKDSLTAFMKTLDLEEEKAR KLSTKSSSPDIVGTINAILARIAAARSLLHKAKEEMFSRIRPVVVMISGRPGIGKTHM ARHLAKSIANTMSGDQRVGLVPRNGVDHWDAYRGERVVLWDDYGMGNPVKDALTLQEL ADTCPVTLNCDRIENKGKMFDSDVIIITTNLVNPAPLDYVNFEACSRRVDFLVYAESP EIEKVKRDFPGQPDMWKDHFKSDFSHIKLTLAPQGGFDKNGNTPHGKGTMRSLTQGSL TARVAGLVHERRDEFQLQGTDLQTYNFDTNRVSAFRKLAADNKYGIMETMRVGTALKS VKTLEDLKVALRDVKFNECEIIYRNSKYRVSSNGKGSVSVDKIEDQTSQTANEVGAAL LRLRQARARYYVSCFQDLVYTLIQVAGASFVVNRISKRFCWERWVKPTETQETSESEK EVAQGRWEIEPKDTEPEGKKGKNKKGRGKKHTAFSSKGLSDEEYDEFKRIREERNGKY SIEEYLQDRDRYYEEVAVARATEEDFCEEEEAKIRQRIFRPTKKQRKEERGVLGLVTG SDIRKRRPDDFQPKGNLWADDTRSVDYNERLDFEAPPSVWSRIVPLGTGWGFWVSSNL LITTTHVLPKGIKELFGVEIKQIQIHKSGEFCRFRFPRPIRPDVTGLVLEEGAPEGTV CSILVKRPTGEMIPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDLGTGPGDCG CPYIYKRGNDIVVAGVHTAAARGGNTVICATQGQDGEAVLEGNESLGTYCGAPILGPG KAPKLSTKTKFWRSSPDALPPGTYEPAYLGGKDPRVEKGPSLQQVMRDQLKPFTEPRG KPPRPAVLEEAKKTVMNVLEQTIDPAKPWTYSQACASLDKTTSSGSPHHVRKNDHWNG ESFTGPLADQASKANLMYEQAKHVQPVYTAALKDELVKTDKIYKKIKKRLLWGSDLGT MIRCARAFGGLMDSMKASCIALPCRVGMNMNEDGPIIFDKHSKYRYHYDADYSRWDST QQRSILSAAMEVMVRFSAEPELAQVVAEDLLAPSQLDVGDFVILVQEGLPSGVPCTSQ WNSIAHWILTLSAMAEVSGLSPDVVQAHSCFSFYGDDEIVSTDINLDPMKLTQKLREY GLVPTRPDKTEGPLVITEDLTGLTFLRRSIARDPAGWFGKLDQDSILRQLYWTRGPNH ENPYESMVPHSQRATQLMALLGEASLHGPQFYKKVSKMVINEIKSGGLEFYVPRQEAM FRWMRFSDLSTWEGDRNLAPEGVNEDGVE" mat_peptide 1..1002 /gene="ORF1" /product="p48" mat_peptide 1003..2100 /gene="ORF1" /product="NTPase" mat_peptide 2101..2619 /gene="ORF1" /product="p22" mat_peptide 2620..3018 /gene="ORF1" /product="VPg" mat_peptide 3019..3561 /gene="ORF1" /product="Pro" mat_peptide 3562..5091 /gene="ORF1" /product="RdRp" gene 5075..6718 /gene="ORF2" CDS 5075..6718 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="WHW95054.1" /translation="MKMASNDAAPSNDGAANLVPEANNEVMALEPVVGASIAAPVVGQ QNIIDPWIRENFVQAPQGEFTVSPRNSPGEMLLNLELGPELNPYLSHLSRMYNGYAGG MQVQVVLAGNAFTAGKIIFAAVPPHFPVENISAAQITMCPHVIVDVRQLEPVLLPLPD IRNRFFHYNQENTSRMRLVAMLYTPLRANSGEDVFTVSCRVLTRPAPDFEFTFLVPPT VESKTKPFTLPILTLGELSNSRFPAPIDMLYTDPNEGIVVQPQNGRCTLDGTLQGTTQ LVPTQICAFRGTLIGQTSRSSDSTDSAPRRRDHPLHVQLKNLDGTQYDPTDEVPAVLG AIDFKGTVFGVASQRDVSGQQVGATRAHEVHINTTDPRYTPKLGSILMHSESDDFVTG QPVRFTPIGMGDNDWHQWELPDYSGHLTLNMNLAPAVAPAFPGERILFFRSVVPSAGG YGSGQIDCLIPQEWVQHFYQEAAPSQSAVALIRYVNPDTGRNIFEAKLHREGFITVAN SGNNPIVVPPNGYFRFEAWVNQFYTLTPMGTGQGRRRNQ" gene 6718..7497 /gene="ORF3" CDS 6718..7497 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="WHW95055.1" /translation="MASAFLAGLAGDVITNGVGSLINAGANAVNQKVEYDFNKQLQMA SFKHDKEMLQSQVLATKQLQQEMMNIRQGVLTAGGFSPADAARGAVNAPMTKILDWNG TRYWAPNSMKTTSYSGQFSSSPVHKSPAPSLHTASPKSRLQNDSASVYSFPSSVSSQS THSTVLSVGTGSSRSIPTSTATPTLSRTSDWVRGQNERLSPFMDGALQTAFVTPPSSR ASSNGTVSTVPKAVLDSWTPMFNTHRQPLFAHPRRRGESQV" ORIGIN 1 atgaagatgg cgtctaacga cgcttccgct gcctttggca accaaaactc tgtcaatgac 61 agtatcaaca ccgccccttc caataaagaa gaggttggtg cattctccaa catcaaagtt 121 ggttttaaga aaatactggg tgccgttccc aaggggacca aagcacccag tagtgatcag 181 cactgcccca cagttaagat cgggaccaaa acattaacag ttccccctga gcccccaaat 241 ggtgaagacg ccgtgcagtt tgatgtgaag tcggaaactg tgcgtgggct accagacttg 301 acaacggtgc agaatgaaca cgaaaacaca ccatacactg tccccccatt gagtgaaagg 361 gagcacagac cagccactga gccgcttcct ggtacaatac tggagatgtg ggatggtgaa 421 ttctaccatt actccgtata cgtcagcggt ggcaaagccc taggggttca caagccccct 481 gcagcgataa gcctcgcgac gatagagctc acacccatat ctctttattg gaggcccgtt 541 tacaccccaa actatctggt ctgtccagac acgttgaaag gcctcgctgg tgagaagttc 601 ccctacacgg ccttcagcaa caactgttac aacttctgtt gctgggtgct cgagctcaat 661 gacacgtggc ttagcaggag aagcatctcc aggaccaccg gcttcttcaa accatatcag 721 tcttggaata ggaaacccct tccaaccgtt gatgatggaa agattaaaaa ggtggccaac 781 gcgattcttt gcgcacttgg atcactgttt tcaaaaccaa tcaaggacct attgggcaaa 841 cttaaaccat tgaacttgct gcacttgctc gcatcatgtg actggacgtt tgcgggtata 901 gtagagacag ttatcctaat ggcggaactt ttcaacatct tttggacacc gccagatgtt 961 tccagcttca tagcctccct gataggcgat tttgaactgc aagggcctga ggacctggct 1021 gtggagcttg tccccgtggt catgggtggc ataggaatgg ttctcggctt cactgctgag 1081 aagataggcc gcatgctatc atccgctgca tcaacactgc gagcatgtaa agacttgggg 1141 aattatgccc ttgacatatt gaaactggtt atgaagtggt tcttcccaaa gaaagaagag 1201 aaagccgaga tggaaacctt aagggcgatc gaggatgctg tcctggatat ggaagctata 1261 ggaaacaacc atctcacaac cctccttaag gacaaggaca gcctcacggc tttcatgaag 1321 actctggacc tagaagagga gaaggcaagg aagctgtcca ccaagtcatc ctcgccagat 1381 atcgtcggca ctattaatgc catactggcc agaatagctg ctgccaggtc cctcttacac 1441 aaggccaaag aagaaatgtt tagcaggatt agacctgtgg ttgttatgat ctctggtaga 1501 cctggcatcg gaaaaaccca catggcaagg cacttggcta agagtatcgc caacaccatg 1561 agcggcgacc agagagtggg gctcgtccca cgcaacgggg tcgatcattg ggacgcctac 1621 aggggagaga gggtggtctt atgggatgat tatggcatgg gaaaccctgt caaagatgcc 1681 ctaacactgc aagagttggc tgacacctgc ccagtaaccc taaactgcga caggattgag 1741 aacaagggga agatgttcga cagtgatgtc atcataatca cgaccaactt agtcaaccca 1801 gcgcccctcg attatgtgaa cttcgaggca tgctccagaa gggtcgactt tctggtctac 1861 gcagagtcac cagagattga gaaggttaag agagatttcc ccggccaacc tgacatgtgg 1921 aaggaccatt ttaagtcaga cttttcccac ataaaactga ctctagcccc acagggcgga 1981 tttgacaaga atggcaacac cccccatggt aagggcacca tgcgatccct aacccagggg 2041 tccctgactg caagggttgc aggtcttgtt catgagagga gagacgagtt tcagctccag 2101 ggaaccgacc ttcaaacata caactttgac accaacaggg tctctgcatt taggaagctg 2161 gccgcagaca acaagtatgg gatcatggag acaatgagag tgggcacggc gcttaagagt 2221 gtcaagacat tagaagatct gaaagtcgcc ctgagggatg tgaaattcaa tgagtgtgag 2281 ataatttaca ggaactccaa ataccgagtc tcttccaatg gtaaaggttc tgtctctgtt 2341 gacaaaattg aggaccaaac atcccagact gcaaacgaag tgggtgcagc acttcttaga 2401 ctcaggcagg caagagccag atactatgtc agctgcttcc aagatcttgt ctacactctg 2461 atacaggtcg ccggagcatc attcgttgtc aacaggatct ctaaaagatt ctgctgggag 2521 agatgggtca agccaactga gacccaggaa acgagtgaat ccgaaaagga agtagcccaa 2581 ggcagatggg agattgaacc caaggacaca gaaccagaag gtaagaaggg caagaacaag 2641 aaaggaagag gcaagaaaca cacagctttc tccagcaaag gcctgagtga tgaggagtat 2701 gacgagttca agagaatcag agaagaaagg aatggaaaat actccattga ggagtacctg 2761 caagaccggg atcgctacta tgaggaagtg gcagttgccc gggcgacaga ggaggacttc 2821 tgtgaagagg aggaggccaa gataaggcaa agaatcttcc gccccacaaa gaaacagagg 2881 aaggaagaaa ggggggtgct tggtttggtc actggctcag acatcaggaa gaggagacca 2941 gacgattttc aaccgaaggg caatctgtgg gcagacgaca ccaggagtgt ggactacaac 3001 gagagacttg atttcgaggc tcccccgagt gtctggtcaa gaatagtccc attgggcact 3061 ggctggggtt tctgggtctc atccaacctc ctgatcacaa caacacacgt cctgcctaag 3121 ggaattaaag aactttttgg agttgaaatc aaacagattc aaattcataa gtctggagag 3181 ttctgcaggt ttaggttccc gagaccaatc agaccagatg tcacagggct cgtgttggag 3241 gaaggtgctc cagagggcac tgtctgctcc atactcgtga agagacccac aggtgagatg 3301 atccccctgg cagtgaggat gggcacgcat gcatccatga aaatacaggg caggaccgtc 3361 ggtggccaga tgggaatgct cctcacgggg gcaaatgcga agaacatgga tcttggcact 3421 ggtcctggtg actgcggttg cccttacatc tacaaacgtg gcaatgatat agttgtcgcg 3481 ggtgtccaca ccgcagcagc ccggggaggc aatactgtca tatgtgccac tcaagggcaa 3541 gatggggagg cagtccttga gggaaatgaa agccttggca cttactgtgg tgccccaatt 3601 ttgggtccag gcaaggcgcc caaacttagc acgaagacca aattctggcg ttcatcacca 3661 gacgctttgc cgccaggcac ttatgagcct gcctacttgg ggggcaagga ccccagagtg 3721 gaaaaaggac catccctgca gcaagtcatg agggaccagc taaaaccttt cacagaaccc 3781 agaggcaaac cacctagacc tgcagtctta gaagaagcca aaaagacggt tatgaatgtt 3841 ctggaacaaa ccattgaccc tgctaagcca tggacctatt cccaagcatg cgcctctctg 3901 gacaagacca cttccagtgg tagcccccat catgtcagga aaaatgatca ctggaatggg 3961 gaatctttca ctggccccct cgcagaccaa gcatccaaag ccaacctcat gtacgaacag 4021 gccaaacatg tgcaacccgt gtacacggcc gcactcaaag atgagctagt caaaactgat 4081 aagatctaca aaaagataaa gaaaaggctc ttatgggggt cggatcttgg tacaatgatc 4141 aggtgtgcca gggcctttgg cggtctcatg gatagcatga aggcaagttg catagccctc 4201 ccatgtaggg taggaatgaa catgaatgaa gatggtccca tcatatttga caaacactct 4261 aagtataggt accactatga tgctgactat tccaggtggg actcaaccca gcaaaggagc 4321 atcctctcgg ccgctatgga agtgatggtg cggttctctg ccgaaccaga gctggcacaa 4381 gtggttgcag aggacctcct ggcacctagc cagctagatg ttggcgactt tgtcatctta 4441 gtccaggagg gtctgccatc aggggttcca tgcacatcac aatggaattc aatagcacat 4501 tggatcctga ccttgagtgc aatggcagaa gtgtcgggtc tctcaccaga tgttgtccaa 4561 gcccactcct gtttctcatt ctacggtgat gatgagatcg tcagcactga catcaacctc 4621 gatcccatga agttgacaca gaaactcaga gagtatggtc tggtccccac tcgacctgat 4681 aaaactgagg gccccctcgt gataactgaa gacctcaccg gcctaacgtt cctgcgcagg 4741 tcaattgcac gggacccagc tgggtggttt ggaaagttgg accaagactc aatcctcaga 4801 cagttgtact ggacaagggg ccccaatcat gagaacccat atgagagcat ggtccctcat 4861 tcccaacggg ccacacagct catggccctt ctcggcgagg cttccctgca tggcccccag 4921 ttttacaaga aggttagcaa gatggtcatc aatgaaatca aaagtggtgg tctggaattc 4981 tatgtgccca gacaagaggc tatgttcaga tggatgagat tctctgacct cagcacatgg 5041 gagggcgatc gcaatcttgc tcccgagggt gtgaatgaag atggcgtcga atgacgctgc 5101 tccatcgaat gatggtgctg ccaacctcgt accagaggcc aacaatgagg tcatggcact 5161 tgaaccggtg gtgggagcct caatcgcagc tcctgtcgtc ggccaacaga acataattga 5221 cccctggatt agagaaaatt ttgtccaagc accacagggc gagtttaccg tctcaccgag 5281 gaactcgcct ggtgagatgc tattaaatct tgaactaggc ccagaactta acccctatct 5341 aagtcacctg tcccgtatgt ataatgggta tgctggtggc atgcaggttc aggtggtcct 5401 agctgggaat gcgttcacag ctgggaaaat catctttgcc gccgtaccac cacattttcc 5461 cgtagagaac atcagtgcag ctcaaataac tatgtgtccc catgtaattg ttgatgtgag 5521 acagcttgaa ccagtacttc tgcccctccc tgacataagg aataggttct ttcattacaa 5581 tcaggagaac acctcccgga tgagacttgt ggccatgctt tatacccccc tgagggccaa 5641 ctctggtgaa gatgtgttta ctgtttcctg tagggtcttg acccgtcctg cccctgattt 5701 tgagtttact ttcttggtac caccgactgt tgaatcgaag actaaacctt tcacactgcc 5761 catattaact cttggtgagc tatctaactc cagattccca gccccaatag atatgttgta 5821 cactgaccca aatgagggga ttgtggtcca accacaaaat ggtaggtgca ctcttgatgg 5881 cactctgcaa ggcaccacac aactggtccc cacccaaatt tgtgctttca ggggcacact 5941 aattggccaa acatcaagat cttcagactc aaccgactca gcccctcgga gaagggatca 6001 cccactccat gtgcaattaa agaaccttga tggcacgcag tacgacccaa ctgatgaagt 6061 gccagcagtc cttggtgcca ttgacttcaa ggggactgtc tttggggtgg ctagtcagag 6121 agatgtgtca gggcaacaag tgggagcaac tcgagctcat gaagtgcaca tcaacacaac 6181 cgatcctagg tacacaccaa aactgggatc cattctcatg cactcagagt cggacgactt 6241 tgtgactgga cagccggtcc gcttcacacc cataggaatg ggtgacaacg actggcacca 6301 gtgggagctg cccgactact ctggacacct aaccctgaac atgaaccttg ccccagcagt 6361 cgctcctgct tttccgggcg agaggattct tttcttcagg tcagtagtcc cgtctgctgg 6421 tggctatggg tctggacaaa tagactgcct cataccacag gagtgggttc agcatttcta 6481 ccaggaagct gcaccatccc aatctgccgt ggcactcatc aggtatgtca atcctgacac 6541 aggcagaaac atctttgagg ctaaattgca cagggaaggt ttcatcaccg tggcaaattc 6601 tggcaacaac cccatcgttg tcccccctaa tgggtatttt aggtttgagg cttgggtgaa 6661 tcaattttac actttgaccc ccatgggaac tggtcagggg cgtaggagga atcaataatg 6721 gccagtgctt ttcttgcagg tcttgctggc gacgtcataa caaatggcgt tggatctcta 6781 ataaatgctg gagctaatgc agttaatcag aaagttgaat atgattttaa taaacagctt 6841 caaatggcat cattcaaaca tgataaagag atgttgcaat cgcaagtgtt ggcaaccaag 6901 cagttgcagc aggagatgat gaacataagg cagggggtgt tgaccgctgg cggcttctcc 6961 cccgcggatg ctgctagagg ggccgtcaat gccccaatga caaagattct ggactggaat 7021 ggcaccaggt attgggcacc aaacagcatg aaaaccacaa gttattcagg acagttttct 7081 agcagccctg ttcataagtc tcctgcccct tctttgcaca ctgcttcacc aaagagtaga 7141 ttgcaaaatg attctgctag tgtatatagt ttcccttcct ctgtctcttc acaatcaact 7201 cattcaacag tgttgtcagt aggaaccggt tcttccaggt ccatccccac atccacagcg 7261 acccctacct tgtctagaac cagcgactgg gttaggggac agaatgagag gctcagcccg 7321 tttatggatg gtgctctcca aactgccttt gtcacacctc catcgagcag agcttcttca 7381 aatggtacgg tctcaaccgt tcccaaagct gttttggact cctggacccc tatgtttaac 7441 acacataggc agcctctctt cgcccatcca cgtaggcgag gggagtcaca agtttagtga 7501 aaagagtgat taggatttct cc //