![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| PP658553 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7513
LOCUS PP658553 7559 bp RNA linear VRL 01-JUN-2024
DEFINITION Norovirus GII isolate PBH23188-STN, complete genome.
ACCESSION PP658553
VERSION PP658553.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7559)
AUTHORS Doungngern,P., Pittayawong-anont,C., Kraipatanapong,S.,
Waikhruea,K., Menkoon,K., Rattanatumhi,K., Supataragul,A.,
Thippamom,N., Khunnawutmanotham,W., Hirunpatrawong,P.,
Wacharapluesadee,S., Sereewit,J., Sobolik,E.B., Roychoudhury,P.,
Greninger,A.L. and Putcharoen,O.
TITLE Direct Submission
JOURNAL Submitted (13-APR-2024) Virology, University of Washington, 850
Republican St, Seattle, WA 98109, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Geneious v. 2023; metaspades v. 2024
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7559
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="PBH23188-STN"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Thailand"
/collection_date="2023-04-11"
/note="genotype: GII"
gene 5..5104
/gene="ORF1"
CDS 5..5104
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="WZF77160.1"
/translation="MKMASNDASAAAVANSNNDIAKSSIHGVLSNMAVTFKRALGARP
KQPPPKETPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPE
ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLVLGVHKPPAAIS
LAKVELSPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKQEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL
AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTM
SDLKQALKNIAIKKCQIVYSGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKGGC
LKPEDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVSGMILEEGAPEG
TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDL
ATMVRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
STQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 5..994
/gene="ORF1"
/product="p48"
mat_peptide 995..2092
/gene="ORF1"
/product="NTPase"
mat_peptide 2093..2629
/gene="ORF1"
/product="p22"
mat_peptide 2630..3028
/gene="ORF1"
/product="VPg"
mat_peptide 3029..3571
/gene="ORF1"
/product="Pro"
mat_peptide 3572..5101
/gene="ORF1"
/product="RdRp"
gene 5085..6707
/gene="ORF2"
CDS 5085..6707
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="WZF77161.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHIAGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFVTDTDRDFEANQNTKFTPVGVIQDG
GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6707..7513
/gene="ORF3"
CDS 6707..7513
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="WZF77162.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKVLDWNGTRHWAPDARTTTYNAGRFSTPQPSGALPGRANFRDAVPARGSPSKSS
NSSTATPVYSNQTVSTRLGSTAGSGTSVSSFPPTARTRSWVEDQSRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN
1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gttgccaaca gcaacaacga
61 catcgcaaaa tcttccattc acggtgtgct ttctaacatg gctgtcactt ttaagcgggc
121 cctcggggcg cggcctaagc agccgccccc gaaggaaaca ccacccagac ccccgcgacc
181 acccacacca gaattagtca aaaagatccc tcctccccca cccaacgggg aggatgaact
241 agtggtctct tacagcgcca aagatggcgt ttccggacta cctgagctca ctactgtcag
301 acaaccggaa gaaaccaaca cggcgttcag tgttccccca ctcaaccaaa gggagagcag
361 ggacgccaag gagccactaa ctggaacaat cattgagatg tgggatggag aaatctacca
421 ttacggcctg tacgtggaac gaggtcttgt acttggtgtg cacaaaccac cggcagccat
481 tagccttgcc aaggtcgagc tatcaccgct ctccttgttc tggagacctg tatacacccc
541 ccagtatctc atctctccag acactcttag gagattacat ggagagtcat tcccctacac
601 tgcatttgac aacaattgct acgccttttg ttgttgggta ttagacctaa acgactcgtg
661 gctaagcagg agaatgattc agagaacaac aggcttcttc aggccgtacc aggattggaa
721 caggaaaccc ctccccacta tggatgattc caaattaaag aaggtagcca acatattctt
781 gtgcactttg tcttcactat tcaccagacc cattaaggac ataataggga agttgaaacc
841 tcttaacatc cttaacattc tggccacatg tgattggacc ttcgcaggca tagtggaatc
901 cttaatactc ttggcagaac tctttggagt tttctggaca cccccagatg tgtctgcgat
961 gatcgccccc ttgctaggtg attatgaact gcaagggcct gaggaccttg cagtggaact
1021 ggtcccaata gtgatggggg ggataggttt ggtgctagga tttaccaaag agaaaatcgg
1081 aaagatgcta tcatccgctg catccacttt gagagcttgt aaagaccttg gtgcatacgg
1141 actggaaatc ttaaaattgg tcatgaagtg gttcttccca aagaaacagg aagcaaatga
1201 actggctatg gtgagatcca tcgaggatgc agtactagac ctcgaggcaa ttgaaaacaa
1261 ccacatgacc accttactca aagacaaaga cagcttggca acctacatga gaaccctcga
1321 ccttgaggag gagaaagcca gaaaactctc aaccaaatct gcttcacccg atattgtggg
1381 cacaatcaac tctcttctgg caagaatcgc tgctgcacgc tccctagtgc atcgggcgaa
1441 agaagagctc tccagcaggc cgagacctgt cgttgtgatg atatcgggaa gaccagggat
1501 agggaaaact caccttgcca gggagctggc caagaagatc gcggcctccc tcacagggga
1561 ccagcgtgtg ggccttatcc cacgcaatgg tgtcgaccac tgggacgcat acaagggcga
1621 aagagttgtt ctatgggacg actatggaat gagcaacccc atccatgatg ccctcaggtt
1681 gcaggagctt gctgacactt gccccctcac gctaaattgt gacagaattg agaacaaagg
1741 gaaagtcttt gacagtgatg ccataattat caccaccaat ctggccaacc cagcaccact
1801 ggattatgtc aactttgaag cgtgctcgag acgcattgac ttcctcgtgt acgcagaagc
1861 ccctgaggtg gagaaggcaa agcgcgactt cccaggtcaa cctgacatgt ggaagaacgc
1921 tttcagtcct gacttctcac acataaaact gtcattggct ccacagggtg gttttgacaa
1981 gaacggcaac accccgcatg gaaaaggggt catgaagacc ctcaccactg gctccctcat
2041 cgcccgagca tcagggttac tccatgagag gctagatgag tatgaactgc aaggcccagc
2101 cctcaccact ttcaactttg accgaaacaa gatacttgcc tttagacagc ttgctgctga
2161 aaacaagtat gggctgatgg acacaatgag agttggaaaa cagctcaagg atgtcaagac
2221 catgtcagac ctcaaacaag cactcaagaa catcgcaatc aagaagtgcc agatagtgta
2281 cagtggtggc acctacacac ttgaggctga tggcaagggt agtgtgaaag ttgacaaagt
2341 gcaaagtgcc actgtgcaga ccaacaatga actagccggt gccctacacc acctaaggtg
2401 cgctagaatc agatactatg tcaagtgcgt ccaggaggca ctgtattcca tcatccaaat
2461 cgctggggct gcattcgtca ccacgcgcat cgctaagcgc atgaatatac agaatctctg
2521 gtccaagcca caggtggaag acacagaaga gatggccaac aaaggtggtt gcctaaaacc
2581 cgaagatgat gaagagtttg tcgtctcatc cgacgacatc aagactgagg gcaagaaagg
2641 gaaaaacaaa tccggccgtg gcaagaagca cacagccttt tcaagtaaag gactcagtga
2701 tgaggagtac gatgagtaca agagaatcag agaagaaagg aatggtaagt actctataga
2761 agagtacctt caggacagag acaggtacta cgaggaggtg gccattgcca gggcaaccga
2821 agaggacttc tgtgaagaag aagaggccaa aatccggcag agaattttta gaccaacaag
2881 gaaacaacgc aaagaagaga gggcctctct cggcttggtc acaggctctg aaatcaggaa
2941 gagaaaccca gaagacttca aacccaaggg gaagctgtgg gctgatgatg acagaagtgt
3001 tgactacaat gagaaactca actttgaggc cccaccaagc atctggtcgc ggatagtcaa
3061 ctttggttca ggctggggct tctgggtctc ccccagtctg ttcataacat caacccatgt
3121 cataccccaa ggtgcaaaag agttcttcgg agtccccatc aagcaaatcc agatacacaa
3181 gtcaggtgaa ttctgccggt tgagattccc aaagccaatc agaactgatg tgtcgggcat
3241 gattctagaa gaaggtgcgc ccgaggggac cgtagccaca ctgcttatca agagaccaac
3301 tggagagctc atgcctctgg cagccagaat ggggacccat gcaaccatga aaattcaggg
3361 gcgcacagtt ggagggcaaa tgggtatgct cctgacagga tccaacgcca agagtatgga
3421 cctaggcaca acaccaggcg actgcggctg cccctacatc tacaagaggg ggaatgacta
3481 cgtggtcata ggagtccata cggccgctgc ccgtggagga aacactgtca tatgtgccac
3541 ccaggggagt gagggagaag ccacacttga aggaggtgac agtaaaggga catactgtgg
3601 cgcaccaatc ttgggcccag ggagtgctcc gaagctcagt accaagacta agttttggag
3661 atcgtccaca acaccactcc cacctggcac ctacgaacca gcctacctcg gtggcaaaga
3721 ccctagagtc aaaggtggcc cttcattgca acaagttatg agggaccagc tgaagccatt
3781 cacagaaccc agaggcaaac caccaagacc aaatgtgttg gaagctgcca agaaaaccat
3841 catcaatgtc cttgagcaaa caattgatcc accccaaaaa tggtcatttg cgcaagcttg
3901 cgcgtccctt gacaaaacca cctccagcgg ccacccgcat cacatgcgga aaaacgactg
3961 ttggaatggg gagtccttca cgggaaaatt ggctgatcaa gcctccaagg ccaacctaat
4021 gtttgaagag ggaaagaaca tgactccagt ctacacaggt gcacttaaag atgagttggt
4081 aaagaccgat aaagtttatg gcaaggtcaa gaagaggctt ctgtggggtt cagatctggc
4141 gaccatggta cggtgcgccc gagcttttgg aggccttatg gatgaactca aggcacactg
4201 tgtcacactc cctgtcagag ttggtatgaa catgaatgag gatggcccca tcatctttga
4261 gaagcactcc agatacagat atcactatga tgctgattat tcccggtggg actcaacaca
4321 acaaagggat gtgctagcag cagcactaga aatcatggtt aagttctctc cagaaccaca
4381 cctggcccag atagttgcag aagacctcct ttcgcctagc gtgatggatg taggtgactt
4441 tcaaatatca ataagtgagg gtctcccatc tggggtacct tgtacctccc agtggaattc
4501 catcgcccac tggctcctca ctctgtgtgc actctctgaa gtcacggacc tgtcccccga
4561 tatcattcag gccaattccc tcttctcctt ctatggtgat gatgagattg taagcacaga
4621 cataaagttg gacccagaga agctgacagc aaagcttaag gagtacgggc tgaaaccaac
4681 ccgccccgac aaaactgaag gaccccttgt tatctctgaa gacctggatg gcctgacatt
4741 cctccggaga actgtgaccc gtgatccagc tggttggttt ggaaaattgg aacaaagttc
4801 aattctcagg caaatgtact ggaccagggg ccccaaccat gaagatccat ttgaaacgat
4861 gataccacac tcccaaagac ccatacaatt gatgtctttg ctgggcgagg ctgcactcca
4921 cggcccggca ttctatagca aaattagcaa actagtcatt gcagagttga aggaaggtgg
4981 catggatttt tacgtaccca gacaagagcc aatgttcaga tggatgagat tctcagatct
5041 gagtacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga
5101 gtgacgccaa cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg
5161 ttatggctct ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccaacaaa
5221 atgtaattga cccctggatt aggaataatt ttgtacaagc ccctggtgga gagtttacag
5281 tatcccctag aaacgctcca ggtgaaatac tatggagcgc gcccttgggc cctgatctaa
5341 atccctacct atcccatttg gccaggatgt acaatggtta tgcaggtggt tttgaagtgc
5401 aggtaattct cgcggggaac gcgttcaccg ctgggaaggt catatttgca gcagtcccac
5461 caaattttcc aactgaaggc ttgagcccca gccaggtcac tatgttcccc catatagtag
5521 tagatgttag gcaactagaa cctgtgttga ttcccttacc cgatgttagg aataatttct
5581 atcattataa tcagtcaaat gatcccacca ttaagttgat agctatgttg tatacaccac
5641 ttagggctaa taatgctggg gatgacgtct tcacagtttc ttgccgagtt ctcacaagac
5701 catcccccga ttttgatttc atatttttag tgccacccac agttgagtca agaactaaac
5761 cattctctgt cccagtttta actgttgagg agatgaccaa ttcaagattc cccattcctt
5821 tggaaaagtt gttcacgggt cccagcagtg cctttgttgt ccaaccacaa aacggtaggt
5881 gcacgactga tggcgtgctt ctaggcacca cccaattgtc tcctgtcaac atctgcacct
5941 tcagaggaga tgtcacccat atcgcaggta gtcataacta cacaatgaat ttggcttctc
6001 aaaattggag caattatgac ccaacagaag aaatcccagc ccctctagga actccagatt
6061 ttgtgggaaa gattcaaggc gtgctcaccc aaaccacaag gacagatggc tcaacacgcg
6121 gccacaaagc cacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc
6181 aatttgtaac tgacacagac cgtgattttg aagctaacca aaacacaaag ttcaccccag
6241 ttggtgtcat ccaagatggt ggcaccaccc accgaaatga accccaacag tgggtgctcc
6301 caagttactc aggcagaaat actcctaatg tgcatctggc ccccgctgta gcccccactt
6361 ttccgggtga gcaacttctc ttcttcagat ccaccatgcc cggatgcagc gggtacccca
6421 acatggattt ggactgtctg ctcccccagg aatgggtgca gtacttctac caagaggcag
6481 ccccagcaca atctgatgtg gctctgctaa gatttgtaaa tccagacaca ggcagggttt
6541 tgtttgagtg taagcttcat aaatcaggct acgttacagt ggctcacact ggccaacatg
6601 atttggttat cccccccaat ggttatttta ggtttgattc ctgggtcaac cagttttaca
6661 cgcttgcccc catgggaaat ggaacggggc gtagacgtgc actataatgg ctggagcttt
6721 ctttgctggg ttggcatctg atgtccttgg ctctggactt ggttccctta tcaatgctgg
6781 ggctggggcc atcaaccaaa aagttgagtt tgaaaataac agaaaattgc aacaagcatc
6841 cttccaattt agcagcaatc tacaacaggc ttcctttcaa catgacaaag agatgctcca
6901 agcacaaatt gaggccacca aaaggctaca acaggaaatg atgaaagtta agcaggcaat
6961 gctcctagag ggtgggttct ctgagacaga tgcagcccgc ggggcaatta acgcccccat
7021 gacaaaagtt ttggactgga acgggacaag gcactgggct cccgatgcta ggactacaac
7081 atacaatgcg ggccgctttt ccacccctca accatcgggg gcactgccag gaagagctaa
7141 ttttagggat gctgtccctg ctcggggttc ccccagtaag tcttctaatt cttctactgc
7201 tactcctgtg tactcaaatc aaactgtttc aacgagactt ggttctacag ctggctctgg
7261 taccagtgtc tcgagcttcc cgccaactgc aaggactagg agctgggttg aggatcaaag
7321 taggaatttg tcacctttca tgaggggggc ccacaacata tcgtttgtca ccccaccatc
7381 tagcagatcc tctagccaag gcacagtctc aaccgtgcct aaagagattt tggactcctg
7441 gactggcgct tttaacacgc gcaggcagcc actcttcgct cacattcgta agcgagggga
7501 gtcacgggcg taatgtgaaa agacaaaatt gattatcttt cttttcttta gtgtctttt
//