![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| PP549884 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P16 |
ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7513
LOCUS PP549884 7569 bp RNA linear VRL 02-APR-2024
DEFINITION Norovirus GII isolate CU-PBH24004, complete genome.
ACCESSION PP549884
VERSION PP549884.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7569)
AUTHORS Doungngern,P., Pittayawong-anont,C., Kraipatanapong,S.,
Waikhruea,K., Menkoon,K., Rattanatumhi,K., Supataragul,A.,
Thippamom,N., Khunnawutmanotham,W., Hirunpatrawong,P.,
Wacharapluesadee,S., Sereewit,J., Sobolik,E.B., Roychoudhury,P.,
Greninger,A.L. and Putcharoen,O.
TITLE Direct Submission
JOURNAL Submitted (28-MAR-2024) Thai Red Cross Emerging Infectious Diseases
Clinical Center, King Chulalongkorn Memorial Hospital, Rama 4 Road,
Bangkok 10330, Thailand
COMMENT ##Assembly-Data-START##
Assembly Method :: Revica v. 04af024
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7569
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="CU-PBH24004"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Thailand"
/collection_date="2023-12-29"
/note="genotype: GII.4 Sydney[P16]"
gene 5..5104
/gene="ORF1"
CDS 5..5104
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="WYA90714.1"
/translation="MKMASDDATVAVACNNNNDKEKSSGEGLFTNMSSTLKKALGARP
KQPAPRDKPQKPPRPPTPELVKRIPPPPPNGEEEEEPVIRYEVKSGISGLPELTTVPQ
PDVANTAFSVPPLSLRENREAKEPLTGAILEMWDGEIYHYGLYVEKGLVLGVHKPPAA
ISMARVELAPLSLYWRVVYTPQYLISPETLRRLNGEAFPYTAFDNNCYAFCCWVLDLN
DSWLSRRMVQRTTGFFRPYQEWNRKPLPTMDDSKIKKVANIFLCSLSTLFTRPIKDLI
GKIKPLNILNILATCDWTFAGIVESLILFAELFGVFWTPPDVSAMIAPLLGDYELQGP
EDLAVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWF
FPKKEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMKTLDMEEEKARRL
STKSASPDIVGTINALLARIAAARSLVHRAKEELSSRIRPVVVMISGKPGIGKTHLAR
EVARKVASTLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELAD
TCPLTLNCDRIENKGKVFDSEVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEV
EKAKRDFPGQPDMWKDAFKPDFSHIKLQLAPQGGFDKNGNTPHGKGVMKTLTTGSLIA
RASGLLHERMDEFELQGPTITTFNFDRNRIAAFRQLAAENKYGLVDTMKVGNQLKGVK
TMEELKQAIRNVTIKRCRIIYGGSTYDLESDGKGKVLVEKVKNTSVQTNNELAGALHH
LKHARIRYYVKCVQEAVYSIIQIAGAAFVTTRIARRMNIQDLWSKPQLDQSEPETKEE
ALKSEDDEFVISSKDIKEEGKKGKNKTGRGRKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLGLV
TGSEIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFEAPPSIWSRIVSFGSGWGFWVSP
SLFITSTHVIPAGITEAFGVSIKQIQIHKSGEFCRFRFPKPIRPDVTGMILEEGAPEG
TVATVLIKRPTGELMPLAARMGTHATMKIQGRMVGGQMGMLLTGSNAKGMDLGTTPGD
CGCPYIYKRGNDYIVIGVHTAAARGGNTVICATQGSEGEATLEGGDDKGTYCGAPILG
PGGAPKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPRPSVLEAAKQTIINVLEQTLDPPQKWTYAQACASLDKTTSSGNPHHVRKNEFW
NGETFTGKLADQASKANLMFEEGKHMTPVYTAALKDELVKTEKIYGKIKKRLLWGSDL
STMVRCARSFGGLMDEMKAHCISLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD
STQQRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPSVVDVGDFKITINEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTKLSPDIIQANSMFSFYGDDEIVSTDIKLDPEQLTAKLK
EYGLKPTRPDKTEGPLIISEDLDGLTFLRRTVSRDPAGWFGKLDQSSILRQMYWTRGP
NHEDPSETMIPHSQRPIQLMALLGEASLHGPSFYSKISKLVITELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPNFVNEDGVE"
mat_peptide 5..1000
/gene="ORF1"
/product="p48"
mat_peptide 1001..2098
/gene="ORF1"
/product="NTPase"
mat_peptide 2099..2629
/gene="ORF1"
/product="p22"
mat_peptide 2630..3028
/gene="ORF1"
/product="VPg"
mat_peptide 3029..3571
/gene="ORF1"
/product="Pro"
mat_peptide 3572..5101
/gene="ORF1"
/product="RdRp"
gene 5085..6707
/gene="ORF2"
CDS 5085..6707
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="WYA90715.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVEVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHTTGSHNFTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFGTDTDHDLEANQNTKFTPVGVIQDG
GTTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
gene 6707..7513
/gene="ORF3"
CDS 6707..7513
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="WYA90716.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVDFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARFTTYNAGHFSTPQPSGALPGRANFGDAVPARGPSTRPS
SSSTATSVYPNQTISTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN
1 gtgaatgaag atggcgtctg acgacgctac cgttgccgtt gcttgcaaca acaacaacga
61 caaggaaaaa tcttcaggtg aaggcttatt cacaaatatg tcttccacct taaagaaagc
121 cctcggggct aggcccaaac agcctgcccc gagagacaaa ccacaaaagc ccccaagacc
181 accgactcct gagttggtca agaggatacc ccctcctcca cctaatggcg aagaagaaga
241 agaaccagtc attaggtatg aggttaagag cgggatctct ggcctgcccg agctcacaac
301 agtcccccaa ccggacgtgg ccaacacagc attcagtgtt ccaccactga gcttgaggga
361 aaacagggag gccaaggaac cgctaacagg ggcaatatta gagatgtggg atggagagat
421 atatcactat ggcctatacg tggagaaagg tctagtgttg ggtgtgcaca aaccacctgc
481 agccataagc atggcaagag tggaactggc gccgctgtca ttatactggc gtgtagtgta
541 cactccccaa tacctcatct cccctgaaac tcttaggagg ctcaacggag aggctttccc
601 ttacaccgcc tttgacaaca actgctatgc cttttgttgc tgggtgctag acctcaacga
661 ctcatggctt agcaggagaa tggtgcagag aacaacgggc ttcttcagac cttaccaaga
721 gtggaacaga aaacccctgc ctaccatgga tgactccaaa attaagaagg tggcaaatat
781 attcctatgt tcattgtcca cattattcac cagacccata aaagatctca taggaaaaat
841 taaaccacta aacatattga acattctggc aacgtgtgac tggacgtttg ccgggatagt
901 ggagtctcta atattatttg ctgaactctt tggagttttc tggacacccc cagatgtgtc
961 tgctatgatc gctcccttgc tcggggacta cgagttgcag gggccagagg acctcgccgt
1021 tgaactcgtg cctgtggtaa tgggagggat tggtttggtg ttgggattca ccaaagagaa
1081 aattggcaaa atgttgtctt cagcagcgtc aacactcagg gcttgcaaag atcttggtgc
1141 ctatggctta gagatactaa aactggtcat gaagtggttc ttcccaaaga aagaggaggc
1201 caatgagcta gccatggtga gggccataga ggatgctgta ctagatcttg aggcaataga
1261 aaataaccac atgacaaccc tgttgaaaga caaagacagc ttagcaacat acatgaagac
1321 actagacatg gaggaggaga aagccagaag gttgtccaca aaatctgcat cccctgacat
1381 agttgggaca atcaacgccc tactagctcg aatagcagcg gccaggtcat tagtccacag
1441 ggccaaggaa gagctatcta gcaggataag accagtagtt gttatgatat ctggcaaacc
1501 aggaataggc aaaactcatc tggccaggga ggtggcaaga aaggtggctt ccactctcac
1561 aggggaccag agagtcggac tcataccaag aaacggtgtg gaccattggg atgcatacaa
1621 aggcgagaga gtcgtgctgt gggacgacta tggcatgagt aaccccatcc atgatgctct
1681 tcgcatacaa gaattggctg acacgtgtcc cctcacctta aattgtgaca gaattgaaaa
1741 caaaggaaaa gtttttgaca gtgaagtcat aataattaca acaaaccttg ccaacccagc
1801 cccacttgat tatgtcaact ttgaggcttg ttccaggaga attgacttcc tggtgtacgc
1861 tgaggcacca gaagtggaga aggcaaagcg ggacttccct ggtcagccag atatgtggaa
1921 ggacgccttc aagccggact tttcacacat caagctgcaa cttgcacctc agggcggctt
1981 tgacaagaat ggcaacaccc cacatgggaa aggagtgatg aagaccctca ctaccggctc
2041 tctgatcgct cgtgcatcag gcctactgca tgagaggatg gatgaatttg aactccaagg
2101 ccccacaatc accaccttca atttcgaccg gaacagaatc gcagcattca gacaactggc
2161 tgcagaaaac aagtatggat tagtggatac catgaaagtt ggcaatcaat taaaaggagt
2221 gaaaaccatg gaagaactta aacaagcaat taggaatgtg accatcaaga ggtgccggat
2281 catttacggt ggctctacat atgaccttga atctgacggc aagggcaaag ttttggtgga
2341 aaaggtcaag aacacctctg tacagaccaa caacgagttg gccggggcct tacaccatct
2401 caaacacgcc cgaatcagat actatgtcaa atgcgtgcaa gaagcagtct attccatcat
2461 acagattgcc ggcgctgcgt ttgtcaccac gcgcattgca cgccgcatga acatacaaga
2521 tctctggtcg aagccacaat tagatcaaag tgaaccagag actaaggaag aggccctcaa
2581 gtcagaagac gacgagttcg tcatatcttc taaggacatc aaggaggaag gaaagaaggg
2641 caaaaataaa actggccgtg gcaggaaaca cactgcattc tccagcaagg gcttgagtga
2701 tgaggagtat gacgaatata agaggataag agaagagaga aatgggaagt actctataga
2761 ggagtatctt caagacagag acaggtacta tgaggagctc gccattgcca aggccacgga
2821 agaagacttc tgtgaagagg aggagatcaa aatccgtcag agaattttcc gtcccaccag
2881 gaaacaaaga aaggaagaga gggccacatt agggctagta acgggttcag aaatcagaaa
2941 gagaaaccct gatgacttca aacccaaagg gaagctgtgg gccgatgaca acagaagtgt
3001 tgactacaat gaaaaactgg actttgaggc ccccccaagc atatggtcta ggatcgtgag
3061 ctttggttct ggctggggct tctgggtatc accaagcctg tttataacat caactcatgt
3121 gatccccgca ggcataacag aagcatttgg agtctccatt aaacaaattc agatccacaa
3181 atcaggtgaa ttttgccggt tcagattccc aaaaccaatt agaccagatg tcacaggaat
3241 gatcttggaa gaaggtgcgc ctgagggtac cgtggcaact gtgctcatca aacgccccac
3301 cggagagctc atgcctcttg cagccagaat gggaacacac gcaaccatga aaattcaagg
3361 ccgcatggtt ggcgggcaga tgggtatgtt gctcactgga tcaaatgcta agggaatgga
3421 tttggggaca acccctggtg attgtggctg cccctacatc tataaaagag gcaacgacta
3481 tatagtcatt ggggtgcaca ctgcagcagc ccgtggtgga aacaccgtca tctgtgccac
3541 acagggaagt gagggtgagg caactcttga gggtggagat gacaaaggaa catactgtgg
3601 ggcacctatt ctgggccctg ggggcgcacc aaaattgagc accaaaacca aattttggag
3661 atcatcgaac acgccccttc caccagggac gtacgagcct gcctacctcg gcggccgtga
3721 tccacgtgtc aagggcgggc cctccttgca gcaggtaatg agagatcagt tgaagccatt
3781 cactgaaccc aggggcaaac ctccaagacc aagtgtatta gaagcagcca aacaaaccat
3841 catcaatgta ctcgaacaaa ccctggatcc tccacaaaaa tggacatacg cacaggcgtg
3901 tgcctcactt gacaaaacca cctccagcgg gaacccccat cacgtccgaa agaatgaatt
3961 ctggaatggt gagaccttta ctggaaaatt ggcagaccaa gcatcaaaag caaatctaat
4021 gtttgaggaa gggaaacaca tgacaccagt gtatacagca gcactcaagg acgagctagt
4081 taagaccgag aaaatctatg gaaagatcaa gaagaggctg ctctggggct ctgacttgtc
4141 caccatggtc cggtgtgcta ggtcatttgg tgggctcatg gacgagatga aggcacactg
4201 catatcactc ccagtccgag ttggcatgaa tatgaatgaa gatggcccaa taatatttga
4261 gaaacattcc agatataagt atcactatga tgcagactac tctcgttggg attcaacaca
4321 acagagggca gtactggcag cagccttgga aatcatggtc aggttctctg cagaaccaca
4381 actggcacaa atagtcgctg aggatctgct ggcccctagt gtagtagatg taggagactt
4441 caaaattaca ataaatgaag ggctcccctc tggtgttcca tgcacttctc aatggaactc
4501 cattgcacac tggctactaa ctctctgtgc cttgtctgaa gtcaccaaat tgtcccctga
4561 cattatacag gcaaattcca tgttctcatt ttacggtgat gacgagattg tcagcaccga
4621 cataaaatta gaccctgaac agttaaccgc caaattgaag gagtacggcc tgaaaccaac
4681 ccgcccagac aaaactgagg gacccctgat catcagtgaa gatttggacg gactcacttt
4741 cctccgaagg acggtgtctc gtgacccagc tggctggttt ggtaaactgg accagagctc
4801 cattttgagg cagatgtact ggactagagg gccaaatcat gaagacccca gtgagacaat
4861 gataccccat tctcagagac ccatacagct tatggcactg cttggtgaag cctctcttca
4921 cggaccctct ttctatagta aaatcagcaa attggtcata actgaactca aggaaggtgg
4981 gatggacttt tacgtgccaa gacaggaacc catgttcagg tggatgaggt tttctgactt
5041 gagcacgtgg gagggcgatc gcaatctggc tcccaatttt gtgaatgaag atggcgtcga
5101 gtgacgccaa cccatctgat gggtccgcag ccaacctcgt accagaggtc aacaatgagg
5161 ttatggcttt ggagcccgtt gtcggtgccg ctattgcggc acctgtagcg ggccaacaaa
5221 atgtaattga cccctggatt agaaataatt ttgtacaagc ccctggtggg gagtttacag
5281 tatcccctag aaatgctcca ggtgaaatac tatggagcgc gcccctaggc cctgatctaa
5341 atccctacct atcccatttg gccaggatgt ataatggtta tgcaggtggc tttgaagtgg
5401 aggtaattct cgcggggaac gcgttcaccg ctgggaagat tatatttgca gcagtcccac
5461 caaattttcc aactgaaggc ttaagcccta gtcaggtcac tatgttcccc catataatag
5521 tagatgttag acaattggaa cctgttctga tccccttacc cgacgttagg aataatttct
5581 atcattataa tcagtcaaat gactccacta ttaagttgat agcaatgttg tacacaccac
5641 ttagggctaa taatgctgga gatgatgttt tcacagtctc gtgccgagtt ctcacgagac
5701 catcccccga ttttgatttc atattcttag taccacccac agttgagtca aggactaaac
5761 cattctccgt tccagtttta actgttgagg agatgaccaa ttcaagattc cccatccctt
5821 tggaaaagtt gttcacaggc cccagcagtg catttgttgt tcaaccacaa aacggcagat
5881 gcacaactga tggcgtgctc ctaggcacca cccaactttc tcctgtcaac atctgcacct
5941 tcagagggga tgtcacccac accacaggca gtcacaactt cactatgaat ttggcttctc
6001 aaaattggaa caactacgac ccaacagaag aaatcccagc tcccctagga actccagatt
6061 ttgtggggaa gattcaaggc atgctcaccc aaaccacaag gacagatggc tcaacacgcg
6121 gccacaaagc tacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc
6181 aatttggaac tgacacagac catgatcttg aagctaatca aaacacaaag ttcactccag
6241 tcggtgttat ccaagatggt ggcaccaccc accgaaacga accccaacag tgggtgcttc
6301 caagttactc aggcagaaat actcataatg tacatttggc ccccgctgta gcccccacct
6361 ttccgggtga gcaacttctc ttctttagat ctaccatgcc cggatgtagc gggtacccca
6421 acatggattt ggactgtcta ctcccccagg aatgggtgca gtatttctac caagaggcag
6481 ccccagcaca atctgatgtg gctctgctaa gatttgtgaa cccagacaca ggtagggttt
6541 tatttgagtg caagcttcac aaatcaggct atgttacagt ggctcacact ggccaacatg
6601 atttggttat cccccccaat ggctacttta ggtttgattc ctgggtcaac cagttctaca
6661 cgcttgcccc catgggaaat ggaacggggc gtagacgtgc agtataatgg ctggagcttt
6721 ctttgctgga ttggcatctg atgtccttgg ctctggactt ggttccctta tcaatgctgg
6781 ggctggggcc atcaaccaaa aagttgattt tgaaaataac agaaaattgc aacaagcatc
6841 cttccaattt agcagcaatt tgcaacaggc ttcctttcaa catgataaag agatgctcca
6901 agcacaaatt gaggccacca aaaagctaca acaggaaatg atgagagtta agcaggcaat
6961 gctcctagag ggtgggttct ctgagacaga tgcggcccgc ggggcaatta acgcccccat
7021 gacaaaggct ttggactgga gcgggacaag gtactgggct ccagatgcta gatttacaac
7081 atataatgca ggccactttt ccacccctca accatcgggg gcactgccag gaagagctaa
7141 ttttggggat gctgtccctg ctcggggacc ctccaccaga ccttctagtt cttctactgc
7201 cacctctgtg tatccaaatc aaactatttc aacgagactt ggttctacag ctggttctgg
7261 aaccagtgtc tcgagcctcc cgtcaactgc aaggactagg agctgggttg aggatcaaaa
7321 taggaatttg tctcctttca tgaggggggc ccacaacata tcatttgtca ccccaccatc
7381 tagcagatcc tctagccaag gcacagtctc aaccgtgcct aaagaaattt tggactcctg
7441 gactggcgct ttcaacacgc gcaggcagcc tctcttcgct cacattcgta agcgagggga
7501 gtcacgggtg taatgtgaaa agacaaaact gatcattttc tttttctttt ctttagtgtc
7561 ttttaaaaa
//