![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MK764019 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 1..5055
ORF2: 5036..6658
ORF3: 6658..7464
LOCUS MK764019 7511 bp RNA linear VRL 13-APR-2019
DEFINITION Norovirus GII isolate Hu/US/2015/GII.Pe-GII.4 Sydney/Arlington0551
nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2)
and VP2 (ORF3) genes, complete cds.
ACCESSION MK764019
VERSION MK764019.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7511)
AUTHORS Barclay,L., Cannon,J.L., Wikswo,M.E., Phillips,A., Browne,H.,
Montmayeur,A.M., Tatusov,R.L., Burke,R.M., Hall,A.J. and Vinje,J.
TITLE Emergence and characterization of multiple viruses associated with
a novel GII.P16 polymerase in the United States, 2015-2018
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 7511)
AUTHORS Barclay,L., Cannon,J.L., Dickinson,M.K., Montmayeur,A.M.,
Tatusov,R.L., Chhabra,P. and Vinje,J.
TITLE Direct Submission
JOURNAL Submitted (05-APR-2019) Division of Viral Diseases, Centers for
Disease Control and Prevention, 1600 Clifton Road NE, Atlanta, GA
30329, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Geneious v. R10; SPAdes v. 3.6
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7511
/organism="Norovirus GII"
/mol_type="genomic RNA"
/strain="Arlington0551"
/isolate="Hu/US/2015/GII.Pe-GII.4 Sydney/Arlington0551"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="USA"
/collection_date="19-Mar-2015"
/note="genotype: GII.Pe-GII.4 Sydney"
gene <1..5055
/gene="ORF1"
CDS <1..5055
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="QBX92736.1"
/translation="SNNDIAKSSSDGVFSNMAVTFKRALGARPKQPPPKEIPPRPPRP
PTPELVKKIPPPPPNGEDELVVSYSARDGVSGLPELTTVRQPEETNTAFSVPPLNQRE
SRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKVELTPLSLFWRP
VYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFR
PYQDWNKKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDW
TFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGL
VLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANELAMVRSIE
DAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINALL
ARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVG
LIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKV
FDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNA
FSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQG
PALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKC
QIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARIRYYVKCVQEAL
YSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMINKDGCPKPKDDEEFVVSSDD
IKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYY
EEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKP
KGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAK
EFFGVPIKQIQIHKSGEFCRLRFPKSIRTDVTGMILEEGAPEGTVVTLLIKRPTGELM
PLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVV
IGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSTKTKFWR
SSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKK
TIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASK
ANLMFEEGKNMTPVYTGALKDELVKTDKVYGKIKKRLLWGSDLATMIRCARAFGGLMD
ELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIM
VKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCA
LSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGP
LVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQR
PIQLMSLLGEAALHGPVFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWE
GDRNLAPSFVNEDGVE"
mat_peptide <1..945
/gene="ORF1"
/product="p48"
mat_peptide 946..2043
/gene="ORF1"
/product="NTPase"
mat_peptide 2044..2580
/gene="ORF1"
/product="p22"
mat_peptide 2581..2979
/gene="ORF1"
/product="VPg"
mat_peptide 2980..3522
/gene="ORF1"
/product="Pro"
mat_peptide 3523..5052
/gene="ORF1"
/product="RdRp"
gene 5036..6658
/gene="ORF2"
CDS 5036..6658
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QBX92737.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSHNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG
STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
gene 6658..7464
/gene="ORF3"
CDS 6658..7464
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QBX92738.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAIPARGSSSKSS
NSSVATSVYSNQTTSTRLGSTAGSGASVSSLPSTARTRSWVEDQNRSLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN
1 agcaacaacg acatcgcaaa atcttcaagt gacggtgtgt tttctaacat ggctgtcact
61 tttaagcggg ccctcggggc gcggcctaaa cagccgcccc cgaaggagat accacccaga
121 cccccgcgac cacccacacc agaactggtc aaaaagatcc ctcctccccc acccaacggg
181 gaggatgaac tagtggtctc ttacagcgcc agagatggcg tttccggact gcctgagctc
241 accactgtca ggcaaccgga agaaaccaac acggcgttca gtgtcccccc actcaaccaa
301 agggagagca gggacgccaa ggagccacta actggaacaa ttattgaaat gtgggatgga
361 gaaatctacc attacggcct gtacgtggaa cgaggtctta tacttggtgt gcataagcca
421 ccggcagcta ttagccttgc caaggtcgag ctaacaccgc tctctttgtt ctggagacct
481 gtatacaccc cccagtatct catctctcca gacactctta ggaggttgca tggagagtca
541 ttcccctaca ctgcatttga caacaactgc tatgcctttt gttgttgggt attagaccta
601 aacgactcat ggctaagcag gagaatgatt cagagaacaa caggtttctt caggccgtac
661 caggattgga acaagaaacc cctccccact atggatgact ccaaattaaa gaaggtagcc
721 aacatattct tgtgcacttt gtcttcacta ttcaccagac ccattaagga cataataggg
781 aagttgaaac ctcttaacat ccttaacatt ctggctacgt gtgattggac cttcgcaggc
841 atagtggaat ctttaatact cttagcagaa ctctttggag ttttctggac acccccagat
901 gtgtctgcga tgatcgcccc cttgctaggt gattatgagc tgcaaggacc tgaggacctt
961 gcagtagaac tggtccctat agtgatgggg gggataggtt tggtgctagg atttaccaaa
1021 gagaaaattg gaaagatgct atcatccgct gcatccactt taagagcttg taaagacctt
1081 ggtgcatacg gactggaaat cttaaaattg gtcatgaagt ggttcttccc aaagaaagag
1141 gaagcaaatg aactggctat ggtgagatcc atcgaggatg cagtgctaga cctcgaggca
1201 attgaaaaca accacatgac caccctactc aaagacaaag acagcttggc aacctacatg
1261 agaacccttg accttgagga ggagaaagcc agaaaactct caaccaaatc tgcttcaccc
1321 gatattgtgg gcacaatcaa cgctcttctg gcaagaatcg ctgctgcacg ctccctagtg
1381 catcgggcga aagaagagct ctccagcagg ccgagacctg tcgttgtgat gatatcggga
1441 agaccaggga tagggaaaac tcaccttgcc agggagctgg ccaagaaaat cgcggcctcc
1501 ctcacagggg accagcgtgt gggtcttatc ccacgcaatg gtgtcgatca ctgggacgca
1561 tacaagggcg aaagagttgt cctatgggac gactatggaa tgagcaaccc catccatgat
1621 gccctcaggt tgcaggaact tgctgacact tgccccctaa cgctaaattg tgacagaatt
1681 gagaacaaag ggaaagtctt tgacagtgat gccataatta tcaccaccaa tctggccaac
1741 ccagcaccac tggattatgt caattttgaa gcatgctcga gacgtattga cttcctcgtg
1801 tacgcagaag cccctgaggt ggagaaggca aagcgtgact tcccaggtca acctgacatg
1861 tggaagaacg ctttcagtcc tgacttctca cacataaaac tgtcattggc tccacagggt
1921 ggttttgaca agaacggtaa caccccgcat ggaaaagggg tcatgaagac cctcaccact
1981 ggctccctca tcgcccgagc atcagggtta ctccatgaga ggctagatga atatgaactg
2041 caaggcccag ccctcaccac tttcaacttt gaccgcaaca agatacttgc ttttagacag
2101 cttgctgctg aaaacaagta tgggctgatg gacacaatga gagttggaaa acagctcaag
2161 gatgtcaaga ccatgtcaga cctcaaacaa gcactcaaga acatcgcgat caagaagtgc
2221 cagatagtgt acaatggtgg cacctacaca cttgaggccg atggcaaggg tagtgtgaaa
2281 gttgacaaag tgcaaagtgc cactgtgcag accaacaacg aactagccgg cgccctacac
2341 cacctgaggt gcgctagaat cagatactat gttaagtgcg tccaggaggc gctgtattcc
2401 atcatccaaa tcgctggggc tgcattcgtc accacgcgca tcgctaaacg catgaatata
2461 cagaatctct ggtccaagcc acaggtggaa gacacagaag agatgatcaa caaagatggt
2521 tgcccaaaac ccaaagatga tgaagagttt gtcgtctcat ccgacgacat caaaactgag
2581 ggcaagaaag ggaagaacaa gtccggccgt ggcaagaagc acacagcctt ttcaagcaaa
2641 gggctcagtg atgaggagta cgatgagtac aagagaatca gagaagaaag gaatggtaag
2701 tactccatag aagaatacct tcaggacaga gacaggtact acgaggaggt ggccattgcc
2761 agggcaaccg aagaggactt ctgtgaagaa gaagaggcca aaatccggca gagaattttc
2821 agaccaacaa ggaaacaacg caaagaagag agggcctctc tcggcttggt cacaggctct
2881 gaaatcagga agagaaaccc agaagacttc aaacccaagg ggaagctgtg ggctgatgat
2941 gacagaagtg ttgactacaa tgagaaactc aactttgagg ccccaccaag catttggtcg
3001 cggatagtca attttggttc aggctggggc ttctgggtct cccccagtct gtttataaca
3061 tcaacccatg tcatacccca aggtgcaaaa gagttcttcg gagtccctat caagcaaatc
3121 caaatacaca aatcaggtga attctgccgg ttgaggttcc ctaagtcaat cagaactgat
3181 gtgacgggca tgattctaga agaaggtgcg cccgagggga ccgtggtcac actgctcatc
3241 aagagaccaa ctggagagct catgcctctg gcagccagaa tggggaccca tgcaaccatg
3301 aaaattcagg ggcgcacagt tggagggcaa atgggtatgc tcctgacagg atccaacgcc
3361 aagagtatgg acctaggcac aacaccaggc gactgcggct gcccctacat ctacaagagg
3421 gggaatgact acgtggtcat aggagtccat acggccgccg cccgtggagg aaacaccgtc
3481 atatgtgcca cccaggggag tgagggagaa gccacacttg aaggaggtga cagtaaaggg
3541 acatactgtg gcgcaccaat cttgggccca gggagcgctc cgaagctcag taccaagact
3601 aagttttgga gatcatccac aacaccactc ccacctggca cctacgaacc agcctacctc
3661 ggtggcaaag accccagagt caaaggtggc ccttcattgc aacaagttat gagggaccag
3721 ctaaaaccat tcacagaacc cagaggcaaa ccaccaaggc caaatgtgtt ggaagctgcc
3781 aagaaaacca tcatcaatgt ccttgagcaa acaattgatc caccccagaa atggtcattt
3841 gcgcaagctt gcgcgtccct tgacaaaacc acttccagcg gccacccgca ccacatgcgg
3901 aaaaacgatt gttggaatgg ggagtccttt acaggaaaat tggctgatca agcctccaag
3961 gccaacctaa tgtttgaaga gggaaagaac atgactccag tctacacagg tgcacttaaa
4021 gatgagttgg ttaagaccga caaagtttat ggtaagatca agaagaggct tctgtggggt
4081 tcagatctgg cgaccatgat acggtgcgcc cgagcttttg gaggccttat ggatgaactc
4141 aaggcgcact gtgtcacact tcctgtcaga gttggcatga acatgaatga ggatggcccc
4201 atcatctttg agaagcactc cagatataga tatcactatg acgctgatta ttcccggtgg
4261 gactcaacac aacaaaggga tgtgctagca gcagcactag aaatcatggt taagttctcc
4321 ccagaaccac acctggccca ggtagttgca gaagacctcc tttcccctag cgtaatggat
4381 gtaggtgact ttcaaatatc aataagtgag ggtcttccct ctggggtgcc ttgtacctcc
4441 cagtggaatt ccatcgccca ctggctcctc actctttgtg cactctctga agtcacggac
4501 ctgtcccctg acatcattca ggccaactcc cttttctcct tctatggtga tgatgagatt
4561 gtgagcacag atataaagtt ggacccagag aagctgacag caaaactcaa ggagtacggg
4621 ctgaaaccaa cccgccccga caaaactgaa ggaccccttg ttatctctga agacctggat
4681 ggcctaacat ttctccggag aactgtgacc cgtgacccag ctggttggtt tggaaaattg
4741 gaacaaagtt caattcttag acaaatgtac tggaccaggg gtcccaacca tgaagatcca
4801 ttcgaaacaa tgataccgca ctcccaaaga cccatacaat tgatgtcctt gctgggcgag
4861 gctgcactcc acggcccggt attttatagc aaaattagca aattagtcat tgcagagttg
4921 aaggaaggtg gcatggattt ttacgtgccc agacaagagc caatgttcag gtggatgaga
4981 ttctcagatc tgagcacgtg ggagggcgat cgcaatctgg ctcccagttt tgtgaatgaa
5041 gatggcgtcg agtgacgcca acccatctga tgggtccgca gccaacctcg tcccagaggt
5101 caacaatgag gttatggctc tggagcccgt tgttggtgcc gctattgcgg cacctgtagc
5161 gggccaacaa aatgtaattg acccctggat tagaaacaat tttgtacaag cccctggtgg
5221 agagtttaca gtatccccta gaaacgctcc aggtgaaata ctatggagcg cgcccttggg
5281 ccctgatcta aatccctacc tatcccattt ggccagaatg tacaatggtt atgcaggtgg
5341 ttttgaagtg caggtaattc tcgcggggaa cgcgttcacc gccgggaagg tcatatttgc
5401 agcagtccca ccaaattttc caactgaagg cttgagccct agccaggtca ctatgttccc
5461 ccatatagta gtagatgtta ggcaactaga acctgtactg attcccttac ccgatgttag
5521 gaacaatttc tatcattaca atcaatcaaa tgaccccacc attaagttga tagcaatgtt
5581 gtatacacca cttagggcta ataatgctgg ggatgatgtc ttcacagttt cttgccgagt
5641 tctcacgaga ccatcccccg attttgactt catatttcta gtgccaccca cagttgagtc
5701 aagaactaaa ccattctctg tcccagtttt aactgttgag gagatgacca attcaagatt
5761 ccccattcct ttggaaaagc tgttcactgg ccccagcagt gcctttgttg tccaaccaca
5821 aaacggcagg tgcacgactg atggcgtgct cctaggcacc acccaactgt ctcctgtcaa
5881 catctgcacc ttcagaggag atgtcaccca catcacaggc agtcacaact acacaatgaa
5941 tttggcttct caaaattgga ataattatga cccaacagaa gaaatcccag cccctctagg
6001 aactccagac tttgtgggga agattcaagg tgtgctcacc caaaccacaa ggacagatgg
6061 ctcaacacgc ggccacaaag ccacagtgta cactgggagc gccgactttg ctccaaaact
6121 gggtagagtt caatttgaaa ctgacacaaa ccatgatttt gaagctaacc aaaacacaaa
6181 gttcacccca gtcggtgtca tccaagatgg tagtaccacc caccgaaatg aaccccaaca
6241 gtgggtgctc ccaagttact caggcagaaa tactcataat gtgcatctgg cccccgctgt
6301 agcccccact tttccgggtg agcaacttct cttcttcaga tccaccatgc ccggctgcag
6361 cgggtacccc aacatggatt tggattgtct gctcccccag gaatgggtgc agtacttcta
6421 ccaagaggca gccccagcac aatctgatgt ggctctgcta agatttgtga atccagacac
6481 aggtagggtt ttgtttgagt gtaagctcca taaatcaggc tatgttacag tggctcacac
6541 tggccaacat gatttggtca ttccccccaa tggttatttt agatttgatt cctgggtcaa
6601 ccagttctac acgcttgccc ccatgggaaa tggaacgggg cgtagacgtg cagtataatg
6661 gctggagctt tctttgctgg attggcatct gatgtccttg gctctggact tggttccctt
6721 atcaatgctg gggctggggc catcaaccaa aaagttgagt ttgaaaacaa cagaaaattg
6781 caacaagcat ccttccaatt tagcagcaat ctacaacagg cttcctttca acatgacaaa
6841 gagatgctcc aagcacaaat tgaggccacc aaaaagttac aacaggaaat gatgagagtt
6901 aagcaggcaa tgctcctaga gggtgggttc tctgagacgg atgcagcccg tggggcaatc
6961 aacgccccca tgacaaaagc tctggattgg agcgggacaa ggtactgggc tcccgatgct
7021 agaactacaa catacaatgc aggccgcttt tctacccctc aaccatcggg ggcactgcca
7081 ggaagagcta atcttaggga tgctatccct gctcggggtt cctccagtaa atcttctaac
7141 tcttctgtag ctacttctgt gtactcaaat caaactactt caacgagact tggttctaca
7201 gctggttctg gcgccagtgt ctcgagcctc ccgtcaactg caaggactag gagttgggtt
7261 gaggatcaaa ataggagttt gtcacctttc atgagggggg cccacaatat atcgttcgtc
7321 accccaccat ctagcagatc ctctagccaa ggcacagtct caaccgtgcc caaagagatt
7381 ttggactcct ggactggcgc tttcaacacg cgcaggcagc cactcttcgc tcacattcgt
7441 aagcgagggg agtcacgggc gtaatgtgaa aagacaaaat tgattatctt tctctttctt
7501 tagtgtcttt t
//