Typing tool
|
Complete norovirus genomes
OR069404 | GII.3 | ||
---|---|---|---|
GII.P21 |
ORF1: 1..5100 ORF2: 5081..6727 ORF3: 6727..7491LOCUS OR069404 7543 bp RNA linear VRL 11-OCT-2023 DEFINITION Norovirus GII isolate GII/Hu/US/2015/GII.3[P21]/NIH38.3 nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION OR069404 VERSION OR069404.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7543) AUTHORS Chaimongkol,N., Dabilla,N., Tohma,K., Matsushima,Y., Behrle Yardley,A., Levenson,E.A., Johnson,J.A., Ahorrio,C., Oler,A.J., Kim,D.Y., Souza,M., Sosnovtsev,S.V., Parra,G.I. and Green,K.Y. TITLE Norovirus Evolves as One or More Distinct Clonal Populations in Immunocompromised Hosts JOURNAL Unpublished REFERENCE 2 (bases 1 to 7543) AUTHORS Chaimongkol,N., Dabilla,N., Sosnovtsev,S.V. and Green,K.Y. TITLE Direct Submission JOURNAL Submitted (26-MAY-2023) LID, NIAID, 50 South Drive/R6318, Bethesda, MD 20892, USA COMMENT ##Assembly-Data-START## Assembly Method :: HIVE Hexagon/Heptagon v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7543 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="GII/Hu/US/2015/GII.3[P21]/NIH38.3" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="Apr-2015" /note="genotype: GII.3" gene 1..5100 /gene="ORF1" CDS 1..5100 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="WID03728.1" /translation="MKMASNDASAAAAANSNNDNAKTSSDGVFSNMAVTFKRALGARP KQPPPKDKPPKPPRPPTPELVKTIPPPPPNGEDEPVISYNVKEGVSGLPELSTVTQPV ENSTAFSVPPLSQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAIS LAKVELTPLSLYWRPVYTPQYLISPDTLRKLHGETFPYIAFDSNCYTFCCWVLDLNDS WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKVKKVANIVLCAFASVFTRPIKDIIGK LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED LVVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP KKDEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDTLATYMRTLDMEEEKARKLST KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARDL AKKIASSLAGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPEVEK AKRDFPGQPDMWKDTFRPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTAGSLVARA SGLLHERLDEYELQGPTPTTFNFDRNKVLAFRQLAAENKYGLVDTMRVGSQLKNVKTM TELKQALKNISVKKCQLVYGGGTYTLESDGKGNVHVEKVKNANVQTNNELSGALHHLR CARIRYYVKCVQEALYSILQIAGAAFITTRIVKRMNIQNLWSKPQVEDLEEAGNEEGC PKPRNDEEFVISSDDIKTEGKKGKNKAGRGKKHTAFSSKGLSDEEYDEFKRIREERNG KYSIEEYLHDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERCSLGLV TGSEIRKRNPDDFKPKGKLWADDDRSVDYAEKLSFEAPPSIWSRIVNFGSGWGFWVSP SLFITSTHVIPQGAQEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVSGMILEEGAPEG TVATILIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKNMDLGTTPGD CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILG PGNAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP RGKPPKPGVLEAAKKTIINVLEQTIDPPQKWTYAQACASLDKTTSSGHPHHMRKNECW NGETFTGKLADQASKANLMYEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL STMIRCARAFGGLMDELKAHCVSLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD STQQRAVLAAALEIMVKFSPEPQLAQVVAEDLLSPSVMDVGDFKISINEGLPSGVPCT SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTSKLK EYGLKPTRPDKTEGPLIISEDLNGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGP NHEDPSETMIPHSQRPIQLMSLLGEAALHGPSFYSKISKLVISELKEGGMDFYVPRQE PMFRWMRFSDLSTWEGDRNLAPSYVNEDGVE" mat_peptide 1..990 /gene="ORF1" /product="p48" mat_peptide 991..2088 /gene="ORF1" /product="NTPase" mat_peptide 2089..2625 /gene="ORF1" /product="p22" mat_peptide 2626..3024 /gene="ORF1" /product="VPg" mat_peptide 3025..3567 /gene="ORF1" /product="Pro" mat_peptide 3568..5097 /gene="ORF1" /product="RdRp" gene 5081..6727 /gene="ORF2" CDS 5081..6727 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="WID03729.1" /translation="MKMASNDAAPSSDGAAGLVPEINNEAMALEPVAGAAIAAPLTGQ HNIIDPWIMNNFVQAPGGEFTVSPRNSPGEVLLKLELGPEINPYLAHLARMYNGYAGG FEVQVILAGNAFTAGKIIFAAVPPNFPIDNLSPAQITMCPHVIVDVRQLEPINLPMPD VRNTFFHYNQGADSRLCLVAMLYTPLRANNSGDDVFTVSCRVLTRPSPDFSFNFLVPP TVESKMKPFTIPILTISEMSNSRFPVPIDSLHTSPTENIVVQCQNGRVTLDGELMGTT QLLPSQICAFRGVLTGSTSRASDQTDTATPRQFNHRWHIQLDNLNGTPYDPAEDVPGP LGTPDFRGKIFGVASQRNPDSTTRAHEAKVNTAAGRFTPKLGSLEMSTESDDFDQNQP TRFTPVGIGVDNEMDFQQWSLPNYSGQLTHNMNLAPAVAPNFPGEQLLFFRSQLPSSG GRSSGILDCLVPQEWVQHFYQESAPAQTQVALVRYVNPDTGRVLFEAKLHKLGFLTIA KNGDSPITVPPNGYFRFESWVNPFYTLAPMGTGNGRRRIQ" gene 6727..7491 /gene="ORF3" CDS 6727..7491 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="WID03730.1" /translation="MAGAFIAGLAGDMLTNTVGSLVNAGANAINQTVEFENNKYLQNA SFNHDKEMLIAQVEATKKLQADMMAIKQGVLTAGGFSPTDAARGAINAPMTKVLDWNG TRLWAPNATSTTSMSGGFTHQAVRRTTPDFKMNQAPKSTPSSGSSVRSNTTQVTSLSS HSSGSSRSSGSTVVSSLPSSNRTRDWVNQQNFNLEPHMPGSLRTAFVTPPSSTASSSG TVSTVPKNVLDSWTSAFNTRRQPLFAHLRRRGESNV" ORIGIN 1 atgaagatgg cgtctaacga cgcttccgct gccgctgctg ccaacagcaa caacgacaac 61 gcaaaaactt caagtgacgg agtattttct aatatggctg tcacttttaa acgagccctc 121 ggggcgcggc ctaaacagcc gcccccgaag gacaaaccac caaaaccccc aagaccgccc 181 acaccagagc tggttaagac aatcccccct cccccaccca acggggagga cgagccagtc 241 atttcttaca acgtcaaaga gggtgtttct ggtttgcctg aactctccac agtaacccaa 301 ccggtggaga actctacggc attcagtgtc ccccctctca gtcagaggga gaacagagat 361 gcaaaggaac ccttgactgg aaccattctg gaaatgtggg acggggagat ctaccactat 421 ggcttgtatg tggaacgagg attagtgctc ggtgtacaca aaccaccagc ggccatcagc 481 cttgctaagg ttgagctgac ccctctgtcc ttgtattgga ggccagtgta caccccacag 541 tacctcattt cccctgacac tctcagaaag ttgcacgggg agacgttccc ttacatagcc 601 tttgacagca actgctacac cttctgctgt tgggttcttg acttgaacga ttcatggctg 661 agcagaagga tgatacagag aacaacaggc ttcttccgac cctaccaaga ctggaacagg 721 aaacccctcc ccacgatgga tgactccaaa gtgaagaagg tggccaatat tgtcctctgc 781 gctttcgcct cagtgttcac tagaccaatt aaggatatta ttggaaagtt gaaaccccta 841 aacatcctca acatcctagc cacatgtgat tggacttttg caggcatagt ggaatcactg 901 atcctcttgg ctgaactatt tggagttttc tggacacccc cagatgtgtc tgcaatgatc 961 gctcccttac tgggtgatta cgaactacag gggcccgagg acctcgttgt ggaactcgta 1021 cccgtggtaa tgggagggat aggtttggtg ctgggattca ccaaagagaa aatcggtaaa 1081 atgctttcat ctgctgcttc aaccctgaga gcgtgtaaag atcttggtgc atatgggctg 1141 gagatcctca agttggtcat gaaatggttt ttcccaaaga aagacgaggc aaacgaattg 1201 gctatggtga gatccatcga ggatgcagta ctagacctcg aggccattga gaacaaccat 1261 atgacaactc tactcaagga caaggacacc cttgcaacct acatgaggac tcttgacatg 1321 gaggaggaga aggcgaggaa gctgtccacc aagtctgcct caccggacat cgtgggcacg 1381 atcaactccc tactggcaag gattgcagcc gcccggtctc tagtacacag ggctaaggag 1441 gaattgtcaa gcagaccccg accggttgtt gtgatgattt cagggagacc aggtataggg 1501 aaaacccacc tagctagaga cctggcaaag aagatcgcct cctcactcgc gggtgaccag 1561 agggtgggtc tcatcccacg caacggagtt gaccactggg acgcgtacaa aggtgaaaga 1621 gtcgtcctct gggacgacta cgggatgagc aaccccatcc acgacgctct cagactccag 1681 gagctcgctg acacctgccc cctcacactc aactgtgaca ggattgagaa caaaggcaaa 1741 gtctttgaca gtgacgccat aataataacc accaacttag ccaacccagc gccactggat 1801 tatgtcaatt ttgaagcttg ctcaagacgt atagacttcc tcgtctatgc tgatgcccct 1861 gaagttgaga aggccaaacg agacttccca ggccagccag atatgtggaa agacaccttc 1921 agacctgact tctcacacat aaaactgtca ctagccccac aaggaggttt tgacaagaat 1981 ggcaacaccc ctcatggaaa gggtgtcatg aagaccctca ctgctggttc tctcgttgcc 2041 cgagcatcag ggctcctcca tgagagacta gacgagtatg agctgcaggg cccaaccccc 2101 acaacattca acttcgaccg caacaaggtg cttgctttta gacaactcgc tgctgagaac 2161 aagtacggcc ttgttgacac aatgagggtc ggatcacagc tcaagaatgt caaaaccatg 2221 acagaactca aacaggctct caagaacatc tcagtcaaga aatgccagct tgtgtatggt 2281 gggggcacgt acacacttga atctgatggc aaaggcaacg tgcacgtcga gaaggtgaag 2341 aacgccaatg tgcaaactaa caacgagctc tccggggctc tgcaccacct caggtgtgcc 2401 aggatcaggt actatgtcaa gtgtgtccag gaagctctct actccatctt gcaaattgcc 2461 ggggctgcat tcatcaccac gcgcattgta aagcgcatga acatacaaaa cctctggtcc 2521 aaaccacaag tggaagacct ggaggaggct ggcaatgagg agggttgccc aaaacctaga 2581 aatgatgagg aatttgtcat ctcctctgat gacatcaaaa ccgagggcaa gaaaggaaag 2641 aacaaagctg gccgtggcaa gaagcacaca gccttttcca gcaaaggtct cagtgatgag 2701 gagtacgatg agtttaagag aattagggaa gagagaaacg gcaagtactc catagaggaa 2761 tacctacatg acagggacaa gtactatgag gaagtggcca tagccagagc aaccgaggaa 2821 gacttctgtg aagaagagga ggctaaaatc cggcaaagga tttttagacc aacaaggaaa 2881 caacgcaaag aggaaaggtg ctctctcggt ttggtcacag gttctgagat caggaagaga 2941 aacccagatg acttcaagcc caaagggaaa ctgtgggccg atgatgacag gagtgttgac 3001 tacgctgaga aactcagttt cgaggctcca ccaagcattt ggtcacgaat agtcaacttt 3061 ggttcagggt ggggtttttg ggtctcgccc agtctcttta taacatctac ccatgtcatc 3121 ccccaaggcg cacaggagtt ctttggagtg cccatcaaac agatacaaat tcataaatca 3181 ggtgagttct gccggctcag gttcccaaaa ccaatcagga cagacgtttc gggcatgatc 3241 ttggaggaag gtgctccaga aggcactgtt gccacaattc tcatcaagag accaaccggg 3301 gaactcatgc ccttggcggc cagaatgggc actcacgcga ctatgaaaat ccagggtcgc 3361 actgttggtg gacagatggg catgctactc acagggtcca acgccaagaa catggatttg 3421 ggcacgactc ctggcgactg tgggtgtccc tacatataca agagagggaa cgactacgtg 3481 gtcattggag tccacaccgc cgccgctcgt ggagggaaca ccgtcatctg tgcaacccag 3541 ggaagcgagg gtgaggccac gctagaaggt ggtgacaaca aaggaactta ctgtggagca 3601 ccaatattgg gccctggaaa tgcccccaaa ctcagcacca aaaccaaatt ctggagatct 3661 tccaccaccc ccctaccacc cggaacctat gaaccagctt atctgggtgg taaggacccc 3721 agagtgaaag gtggcccctc actgcaacag gttatgaggg accaactaaa gcctttcact 3781 gagcccagag gcaagccacc caaaccgggt gtactagaag ctgccaagaa gaccataatc 3841 aatgtgcttg aacagacaat agacccaccc caaaaatgga catacgcaca ggcgtgtgca 3901 tcattggaca aaacaacttc cagcggtcac cctcaccaca tgcggaagaa tgagtgctgg 3961 aacggggaaa ctttcacagg gaaactggca gaccaagcat caaaggctaa cctaatgtat 4021 gaagaaggaa agaacatgac cccagtatac acaggagctt tgaaggatga gctagtcaag 4081 actgataaga tctacgggaa aatcaaaaag aggctcctgt gggggtcaga cctatcgacc 4141 atgatacggt gtgcacgagc ctttggtggg ctgatggacg agctcaaggc ccactgcgtc 4201 tcactaccgg tcagggttgg tatgaacatg aatgaggatg gacccataat atttgagaaa 4261 cactccaggt acaaatacca ctatgatgca gactactccc gctgggactc aacgcaacag 4321 agggcagtgc tggctgcagc tttggaaata atggtcaaat tttcaccaga accccagcta 4381 gcccaagtag ttgcagaaga cctcttgtcc cccagtgtga tggacgtggg tgacttcaag 4441 atatcaatca acgaggggtt gccctctggt gtgccttgta cctcacaatg gaactccatt 4501 gcccactggc tcctaacact gtgtgcactg tctgaagtca cagatctgtc ccctgacatc 4561 atccaggcaa actccctatt ctccttttat ggtgatgatg aaatagtgag cacagacatc 4621 aaactggacc cagaaaaatt gacatcaaaa ctgaaggaat acgggctaaa accaacccgc 4681 cctgataaaa cagaaggacc cctaattata tctgaagact tgaatggcct aaccttcttg 4741 cggagaacgg taacccgtga tccggctggg tggtttggca aactggatca aagttcaata 4801 ctcagacaga tgtactggac cagaggacca aaccatgagg acccttctga gacaatgata 4861 ccacactccc agagacccat acaactgatg tcactactag gtgaagctgc actgcatggc 4921 ccatcattct acagcaaaat cagcaaactg gtcatctcag aattgaaaga gggtggaatg 4981 gacttttacg tgcccaggca agaaccaatg tttaggtgga tgagattctc agatttgagc 5041 acgtgggagg gcgatcgcaa tctggctccc agttatgtga atgaagatgg cgtcgaatga 5101 cgctgctcca tctagtgatg gtgccgccgg cctcgtccca gagatcaaca atgaggcaat 5161 ggcgctagag ccagtggcgg gtgcagcgat agcagcacct ctcactggcc agcataatat 5221 aattgatccc tggattatga ataattttgt gcaagcacct ggtggtgagt ttacagtgtc 5281 ccctagaaat tcccctggtg aagtgctcct taaattggaa ttgggtccag aaataaaccc 5341 ctatttggcc catcttgcca gaatgtacaa tggttatgca ggtggatttg aagtgcaggt 5401 gatccttgct ggaaatgcgt ttacagcagg gaaaataatc tttgcagctg tacccccaaa 5461 ctttccaatt gacaatttga gcccagcaca gatcacaatg tgcccacatg tgattgtgga 5521 tgtcagacag ctggaaccaa tcaaccttcc aatgcctgat gttcgcaaca ccttctttca 5581 ttacaatcaa ggagctgatt cgagattgtg cttagttgca atgttataca cacctcttag 5641 ggcaaataat tctggtgatg atgtctttac tgtgtcttgt agagtgctga ctaggcctag 5701 ccctgatttc tcattcaact ttcttgtgcc acccactgta gagtcaaaga tgaaaccctt 5761 tactatcccc attctgacta tctctgaaat gtctaactct aggttcccag tgccgattga 5821 ctcactgcac accagcccaa ctgagaatat cgttgtccag tgccaaaatg ggcgcgtcac 5881 ccttgatggt gagttgatgg gcaccaccca gcttctgccc agtcaaatct gtgctttcag 5941 aggcgtgctc actggatcaa caagcagggc cagtgaccaa accgacacag caacccccag 6001 gcagttcaat caccgttggc acatacaatt ggataatcta aatggaaccc cctatgaccc 6061 tgcagaggac gtcccaggcc ccctaggaac accagacttt cggggcaaga tcttcggcgt 6121 ggccagccag aggaaccctg acagcacaac tagagcacat gaagcaaagg tgaacacagc 6181 agctggtcgt ttcaccccaa aattaggttc actagagatg tccactgaat cagatgactt 6241 tgaccaaaat cagccaacaa gattcacccc agtcggcatt ggggttgaca atgaaatgga 6301 ctttcaacag tggtctttac ctaactattc tggtcagctc acccacaaca tgaacctggc 6361 accagctgtt gctcccaact tccctggtga gcagctcctc ttcttccgct cacagttacc 6421 atcctctggt gggcggtcca gcgggatcct agattgcctg gtcccccaag agtgggttca 6481 acatttctac caggagtcgg cccccgctca aacccaagtg gccctggtta gatatgtcaa 6541 ccctgacact ggtagagtgc tgtttgaggc caagttacac aaactaggct tcttgaccat 6601 agctaagaat ggtgattctc caataactgt ccctccaaat ggttacttta ggtttgagtc 6661 ttgggtgaac ccattttaca cacttgcccc catgggaact gggaacgggc gtagaaggat 6721 tcaataatgg ctggagcttt tatagcaggg ttagctggtg acatgttaac aaatactgta 6781 gggtctctgg ttaatgcagg ggctaatgcc atcaatcaaa cagttgaatt tgaaaataac 6841 aaatacttac aaaatgcttc ctttaatcat gataaagaaa tgttgatagc acaagttgag 6901 gcaacaaaga aactacaggc tgatatgatg gccatcaagc agggggtttt gaccgctggc 6961 ggcttctccc ctactgatgc agcccgcggg gcaattaatg cccccatgac aaaagtccta 7021 gattggaatg gaacaaggct ctgggcacca aatgccacct ccacaacctc gatgtcgggt 7081 ggcttcacac atcaggcagt gcgcaggacc acgccagact tcaaaatgaa ccaggctccc 7141 aaatccacac ctagcagtgg gtcttcagtg aggtcaaaca caacccaagt gactagtctg 7201 agctcacact catccgggtc gtcccgatcc agtgggtcta cagtagtcag ctcattacca 7261 tcatctaaca ggactagaga ttgggttaac caacagaact tcaatttgga accacacatg 7321 cctgggtctc tcaggacggc ttttgtcact ccaccatcta gcacagcttc tagttcaggt 7381 acggtctcaa ccgtgcccaa aaatgttttg gactcctgga catctgcgtt taacacgcgc 7441 agacagccgc tattcgcaca ccttcgcaga aggggggagt caaatgttta gtgaaaagat 7501 tgctttaaat ttgatttaaa ttaaatttga tttggaatct ttt //