Typing tool
|
Complete norovirus genomes
MK764019 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 1..5055 ORF2: 5036..6658 ORF3: 6658..7464LOCUS MK764019 7511 bp RNA linear VRL 13-APR-2019 DEFINITION Norovirus GII isolate Hu/US/2015/GII.Pe-GII.4 Sydney/Arlington0551 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK764019 VERSION MK764019.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7511) AUTHORS Barclay,L., Cannon,J.L., Wikswo,M.E., Phillips,A., Browne,H., Montmayeur,A.M., Tatusov,R.L., Burke,R.M., Hall,A.J. and Vinje,J. TITLE Emergence and characterization of multiple viruses associated with a novel GII.P16 polymerase in the United States, 2015-2018 JOURNAL Unpublished REFERENCE 2 (bases 1 to 7511) AUTHORS Barclay,L., Cannon,J.L., Dickinson,M.K., Montmayeur,A.M., Tatusov,R.L., Chhabra,P. and Vinje,J. TITLE Direct Submission JOURNAL Submitted (05-APR-2019) Division of Viral Diseases, Centers for Disease Control and Prevention, 1600 Clifton Road NE, Atlanta, GA 30329, USA COMMENT ##Assembly-Data-START## Assembly Method :: Geneious v. R10; SPAdes v. 3.6 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7511 /organism="Norovirus GII" /mol_type="genomic RNA" /strain="Arlington0551" /isolate="Hu/US/2015/GII.Pe-GII.4 Sydney/Arlington0551" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="19-Mar-2015" /note="genotype: GII.Pe-GII.4 Sydney" gene <1..5055 /gene="ORF1" CDS <1..5055 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QBX92736.1" /translation="SNNDIAKSSSDGVFSNMAVTFKRALGARPKQPPPKEIPPRPPRP PTPELVKKIPPPPPNGEDELVVSYSARDGVSGLPELTTVRQPEETNTAFSVPPLNQRE SRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKVELTPLSLFWRP VYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFR PYQDWNKKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDW TFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGL VLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANELAMVRSIE DAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINALL ARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVG LIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKV FDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNA FSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQG PALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKC QIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARIRYYVKCVQEAL YSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMINKDGCPKPKDDEEFVVSSDD IKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYY EEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKP KGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAK EFFGVPIKQIQIHKSGEFCRLRFPKSIRTDVTGMILEEGAPEGTVVTLLIKRPTGELM PLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVV IGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSTKTKFWR SSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKK TIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASK ANLMFEEGKNMTPVYTGALKDELVKTDKVYGKIKKRLLWGSDLATMIRCARAFGGLMD ELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIM VKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCA LSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGP LVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQR PIQLMSLLGEAALHGPVFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWE GDRNLAPSFVNEDGVE" mat_peptide <1..945 /gene="ORF1" /product="p48" mat_peptide 946..2043 /gene="ORF1" /product="NTPase" mat_peptide 2044..2580 /gene="ORF1" /product="p22" mat_peptide 2581..2979 /gene="ORF1" /product="VPg" mat_peptide 2980..3522 /gene="ORF1" /product="Pro" mat_peptide 3523..5052 /gene="ORF1" /product="RdRp" gene 5036..6658 /gene="ORF2" CDS 5036..6658 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QBX92737.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSHNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV" gene 6658..7464 /gene="ORF3" CDS 6658..7464 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QBX92738.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAIPARGSSSKSS NSSVATSVYSNQTTSTRLGSTAGSGASVSSLPSTARTRSWVEDQNRSLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA" ORIGIN 1 agcaacaacg acatcgcaaa atcttcaagt gacggtgtgt tttctaacat ggctgtcact 61 tttaagcggg ccctcggggc gcggcctaaa cagccgcccc cgaaggagat accacccaga 121 cccccgcgac cacccacacc agaactggtc aaaaagatcc ctcctccccc acccaacggg 181 gaggatgaac tagtggtctc ttacagcgcc agagatggcg tttccggact gcctgagctc 241 accactgtca ggcaaccgga agaaaccaac acggcgttca gtgtcccccc actcaaccaa 301 agggagagca gggacgccaa ggagccacta actggaacaa ttattgaaat gtgggatgga 361 gaaatctacc attacggcct gtacgtggaa cgaggtctta tacttggtgt gcataagcca 421 ccggcagcta ttagccttgc caaggtcgag ctaacaccgc tctctttgtt ctggagacct 481 gtatacaccc cccagtatct catctctcca gacactctta ggaggttgca tggagagtca 541 ttcccctaca ctgcatttga caacaactgc tatgcctttt gttgttgggt attagaccta 601 aacgactcat ggctaagcag gagaatgatt cagagaacaa caggtttctt caggccgtac 661 caggattgga acaagaaacc cctccccact atggatgact ccaaattaaa gaaggtagcc 721 aacatattct tgtgcacttt gtcttcacta ttcaccagac ccattaagga cataataggg 781 aagttgaaac ctcttaacat ccttaacatt ctggctacgt gtgattggac cttcgcaggc 841 atagtggaat ctttaatact cttagcagaa ctctttggag ttttctggac acccccagat 901 gtgtctgcga tgatcgcccc cttgctaggt gattatgagc tgcaaggacc tgaggacctt 961 gcagtagaac tggtccctat agtgatgggg gggataggtt tggtgctagg atttaccaaa 1021 gagaaaattg gaaagatgct atcatccgct gcatccactt taagagcttg taaagacctt 1081 ggtgcatacg gactggaaat cttaaaattg gtcatgaagt ggttcttccc aaagaaagag 1141 gaagcaaatg aactggctat ggtgagatcc atcgaggatg cagtgctaga cctcgaggca 1201 attgaaaaca accacatgac caccctactc aaagacaaag acagcttggc aacctacatg 1261 agaacccttg accttgagga ggagaaagcc agaaaactct caaccaaatc tgcttcaccc 1321 gatattgtgg gcacaatcaa cgctcttctg gcaagaatcg ctgctgcacg ctccctagtg 1381 catcgggcga aagaagagct ctccagcagg ccgagacctg tcgttgtgat gatatcggga 1441 agaccaggga tagggaaaac tcaccttgcc agggagctgg ccaagaaaat cgcggcctcc 1501 ctcacagggg accagcgtgt gggtcttatc ccacgcaatg gtgtcgatca ctgggacgca 1561 tacaagggcg aaagagttgt cctatgggac gactatggaa tgagcaaccc catccatgat 1621 gccctcaggt tgcaggaact tgctgacact tgccccctaa cgctaaattg tgacagaatt 1681 gagaacaaag ggaaagtctt tgacagtgat gccataatta tcaccaccaa tctggccaac 1741 ccagcaccac tggattatgt caattttgaa gcatgctcga gacgtattga cttcctcgtg 1801 tacgcagaag cccctgaggt ggagaaggca aagcgtgact tcccaggtca acctgacatg 1861 tggaagaacg ctttcagtcc tgacttctca cacataaaac tgtcattggc tccacagggt 1921 ggttttgaca agaacggtaa caccccgcat ggaaaagggg tcatgaagac cctcaccact 1981 ggctccctca tcgcccgagc atcagggtta ctccatgaga ggctagatga atatgaactg 2041 caaggcccag ccctcaccac tttcaacttt gaccgcaaca agatacttgc ttttagacag 2101 cttgctgctg aaaacaagta tgggctgatg gacacaatga gagttggaaa acagctcaag 2161 gatgtcaaga ccatgtcaga cctcaaacaa gcactcaaga acatcgcgat caagaagtgc 2221 cagatagtgt acaatggtgg cacctacaca cttgaggccg atggcaaggg tagtgtgaaa 2281 gttgacaaag tgcaaagtgc cactgtgcag accaacaacg aactagccgg cgccctacac 2341 cacctgaggt gcgctagaat cagatactat gttaagtgcg tccaggaggc gctgtattcc 2401 atcatccaaa tcgctggggc tgcattcgtc accacgcgca tcgctaaacg catgaatata 2461 cagaatctct ggtccaagcc acaggtggaa gacacagaag agatgatcaa caaagatggt 2521 tgcccaaaac ccaaagatga tgaagagttt gtcgtctcat ccgacgacat caaaactgag 2581 ggcaagaaag ggaagaacaa gtccggccgt ggcaagaagc acacagcctt ttcaagcaaa 2641 gggctcagtg atgaggagta cgatgagtac aagagaatca gagaagaaag gaatggtaag 2701 tactccatag aagaatacct tcaggacaga gacaggtact acgaggaggt ggccattgcc 2761 agggcaaccg aagaggactt ctgtgaagaa gaagaggcca aaatccggca gagaattttc 2821 agaccaacaa ggaaacaacg caaagaagag agggcctctc tcggcttggt cacaggctct 2881 gaaatcagga agagaaaccc agaagacttc aaacccaagg ggaagctgtg ggctgatgat 2941 gacagaagtg ttgactacaa tgagaaactc aactttgagg ccccaccaag catttggtcg 3001 cggatagtca attttggttc aggctggggc ttctgggtct cccccagtct gtttataaca 3061 tcaacccatg tcatacccca aggtgcaaaa gagttcttcg gagtccctat caagcaaatc 3121 caaatacaca aatcaggtga attctgccgg ttgaggttcc ctaagtcaat cagaactgat 3181 gtgacgggca tgattctaga agaaggtgcg cccgagggga ccgtggtcac actgctcatc 3241 aagagaccaa ctggagagct catgcctctg gcagccagaa tggggaccca tgcaaccatg 3301 aaaattcagg ggcgcacagt tggagggcaa atgggtatgc tcctgacagg atccaacgcc 3361 aagagtatgg acctaggcac aacaccaggc gactgcggct gcccctacat ctacaagagg 3421 gggaatgact acgtggtcat aggagtccat acggccgccg cccgtggagg aaacaccgtc 3481 atatgtgcca cccaggggag tgagggagaa gccacacttg aaggaggtga cagtaaaggg 3541 acatactgtg gcgcaccaat cttgggccca gggagcgctc cgaagctcag taccaagact 3601 aagttttgga gatcatccac aacaccactc ccacctggca cctacgaacc agcctacctc 3661 ggtggcaaag accccagagt caaaggtggc ccttcattgc aacaagttat gagggaccag 3721 ctaaaaccat tcacagaacc cagaggcaaa ccaccaaggc caaatgtgtt ggaagctgcc 3781 aagaaaacca tcatcaatgt ccttgagcaa acaattgatc caccccagaa atggtcattt 3841 gcgcaagctt gcgcgtccct tgacaaaacc acttccagcg gccacccgca ccacatgcgg 3901 aaaaacgatt gttggaatgg ggagtccttt acaggaaaat tggctgatca agcctccaag 3961 gccaacctaa tgtttgaaga gggaaagaac atgactccag tctacacagg tgcacttaaa 4021 gatgagttgg ttaagaccga caaagtttat ggtaagatca agaagaggct tctgtggggt 4081 tcagatctgg cgaccatgat acggtgcgcc cgagcttttg gaggccttat ggatgaactc 4141 aaggcgcact gtgtcacact tcctgtcaga gttggcatga acatgaatga ggatggcccc 4201 atcatctttg agaagcactc cagatataga tatcactatg acgctgatta ttcccggtgg 4261 gactcaacac aacaaaggga tgtgctagca gcagcactag aaatcatggt taagttctcc 4321 ccagaaccac acctggccca ggtagttgca gaagacctcc tttcccctag cgtaatggat 4381 gtaggtgact ttcaaatatc aataagtgag ggtcttccct ctggggtgcc ttgtacctcc 4441 cagtggaatt ccatcgccca ctggctcctc actctttgtg cactctctga agtcacggac 4501 ctgtcccctg acatcattca ggccaactcc cttttctcct tctatggtga tgatgagatt 4561 gtgagcacag atataaagtt ggacccagag aagctgacag caaaactcaa ggagtacggg 4621 ctgaaaccaa cccgccccga caaaactgaa ggaccccttg ttatctctga agacctggat 4681 ggcctaacat ttctccggag aactgtgacc cgtgacccag ctggttggtt tggaaaattg 4741 gaacaaagtt caattcttag acaaatgtac tggaccaggg gtcccaacca tgaagatcca 4801 ttcgaaacaa tgataccgca ctcccaaaga cccatacaat tgatgtcctt gctgggcgag 4861 gctgcactcc acggcccggt attttatagc aaaattagca aattagtcat tgcagagttg 4921 aaggaaggtg gcatggattt ttacgtgccc agacaagagc caatgttcag gtggatgaga 4981 ttctcagatc tgagcacgtg ggagggcgat cgcaatctgg ctcccagttt tgtgaatgaa 5041 gatggcgtcg agtgacgcca acccatctga tgggtccgca gccaacctcg tcccagaggt 5101 caacaatgag gttatggctc tggagcccgt tgttggtgcc gctattgcgg cacctgtagc 5161 gggccaacaa aatgtaattg acccctggat tagaaacaat tttgtacaag cccctggtgg 5221 agagtttaca gtatccccta gaaacgctcc aggtgaaata ctatggagcg cgcccttggg 5281 ccctgatcta aatccctacc tatcccattt ggccagaatg tacaatggtt atgcaggtgg 5341 ttttgaagtg caggtaattc tcgcggggaa cgcgttcacc gccgggaagg tcatatttgc 5401 agcagtccca ccaaattttc caactgaagg cttgagccct agccaggtca ctatgttccc 5461 ccatatagta gtagatgtta ggcaactaga acctgtactg attcccttac ccgatgttag 5521 gaacaatttc tatcattaca atcaatcaaa tgaccccacc attaagttga tagcaatgtt 5581 gtatacacca cttagggcta ataatgctgg ggatgatgtc ttcacagttt cttgccgagt 5641 tctcacgaga ccatcccccg attttgactt catatttcta gtgccaccca cagttgagtc 5701 aagaactaaa ccattctctg tcccagtttt aactgttgag gagatgacca attcaagatt 5761 ccccattcct ttggaaaagc tgttcactgg ccccagcagt gcctttgttg tccaaccaca 5821 aaacggcagg tgcacgactg atggcgtgct cctaggcacc acccaactgt ctcctgtcaa 5881 catctgcacc ttcagaggag atgtcaccca catcacaggc agtcacaact acacaatgaa 5941 tttggcttct caaaattgga ataattatga cccaacagaa gaaatcccag cccctctagg 6001 aactccagac tttgtgggga agattcaagg tgtgctcacc caaaccacaa ggacagatgg 6061 ctcaacacgc ggccacaaag ccacagtgta cactgggagc gccgactttg ctccaaaact 6121 gggtagagtt caatttgaaa ctgacacaaa ccatgatttt gaagctaacc aaaacacaaa 6181 gttcacccca gtcggtgtca tccaagatgg tagtaccacc caccgaaatg aaccccaaca 6241 gtgggtgctc ccaagttact caggcagaaa tactcataat gtgcatctgg cccccgctgt 6301 agcccccact tttccgggtg agcaacttct cttcttcaga tccaccatgc ccggctgcag 6361 cgggtacccc aacatggatt tggattgtct gctcccccag gaatgggtgc agtacttcta 6421 ccaagaggca gccccagcac aatctgatgt ggctctgcta agatttgtga atccagacac 6481 aggtagggtt ttgtttgagt gtaagctcca taaatcaggc tatgttacag tggctcacac 6541 tggccaacat gatttggtca ttccccccaa tggttatttt agatttgatt cctgggtcaa 6601 ccagttctac acgcttgccc ccatgggaaa tggaacgggg cgtagacgtg cagtataatg 6661 gctggagctt tctttgctgg attggcatct gatgtccttg gctctggact tggttccctt 6721 atcaatgctg gggctggggc catcaaccaa aaagttgagt ttgaaaacaa cagaaaattg 6781 caacaagcat ccttccaatt tagcagcaat ctacaacagg cttcctttca acatgacaaa 6841 gagatgctcc aagcacaaat tgaggccacc aaaaagttac aacaggaaat gatgagagtt 6901 aagcaggcaa tgctcctaga gggtgggttc tctgagacgg atgcagcccg tggggcaatc 6961 aacgccccca tgacaaaagc tctggattgg agcgggacaa ggtactgggc tcccgatgct 7021 agaactacaa catacaatgc aggccgcttt tctacccctc aaccatcggg ggcactgcca 7081 ggaagagcta atcttaggga tgctatccct gctcggggtt cctccagtaa atcttctaac 7141 tcttctgtag ctacttctgt gtactcaaat caaactactt caacgagact tggttctaca 7201 gctggttctg gcgccagtgt ctcgagcctc ccgtcaactg caaggactag gagttgggtt 7261 gaggatcaaa ataggagttt gtcacctttc atgagggggg cccacaatat atcgttcgtc 7321 accccaccat ctagcagatc ctctagccaa ggcacagtct caaccgtgcc caaagagatt 7381 ttggactcct ggactggcgc tttcaacacg cgcaggcagc cactcttcgc tcacattcgt 7441 aagcgagggg agtcacgggc gtaatgtgaa aagacaaaat tgattatctt tctctttctt 7501 tagtgtcttt t //