![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| PP549878 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P16 |
ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7510
LOCUS PP549878 7562 bp RNA linear VRL 02-APR-2024
DEFINITION Norovirus GII isolate CU-PBH23566, complete genome.
ACCESSION PP549878
VERSION PP549878.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7562)
AUTHORS Doungngern,P., Pittayawong-anont,C., Kraipatanapong,S.,
Waikhruea,K., Menkoon,K., Rattanatumhi,K., Supataragul,A.,
Thippamom,N., Khunnawutmanotham,W., Hirunpatrawong,P.,
Wacharapluesadee,S., Sereewit,J., Sobolik,E.B., Roychoudhury,P.,
Greninger,A.L. and Putcharoen,O.
TITLE Direct Submission
JOURNAL Submitted (28-MAR-2024) Thai Red Cross Emerging Infectious Diseases
Clinical Center, King Chulalongkorn Memorial Hospital, Rama 4 Road,
Bangkok 10330, Thailand
COMMENT ##Assembly-Data-START##
Assembly Method :: Revica v. 04af024
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7562
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="CU-PBH23566"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Thailand"
/collection_date="2023-10-12"
/note="genotype: GII.4 Sydney[P16]"
gene 5..5104
/gene="ORF1"
CDS 5..5104
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="WYA90696.1"
/translation="MKMASNDATVAVACNNNNDKEKSSGEGLFTNMSSTLKKALGARP
KQPAPRDKPQKPPRPPTPELVKRIPPPPPNGEEEEEPVIRYEVKSGISGLPELTTVPQ
PDVANTAFSVPPLSLRENREAKEPLTGAILEMWDGEIYHYGLYVEKGLVLGVHKPPAA
ISMARVELTPLSLYWRVVYTPQYLISPETLRRLNGEAFPYTAFDNNCYAFCCWVLDLN
DSWLSRRMVQRTTGFFRPYQEWNRKPLPTMDDSKIKKVANIFLCSLSTLFTRPIKDLI
GKIKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGP
EDLAVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWF
FPKKEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMKTLDMEEEKARRL
STKSASPDIVGTINALLARIAAARSLVHRAKEELSSRVRPVVVMISGKPGIGKTHLAR
EVARKVASTLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELAD
TCPLTLNCDRIENKGKVFDSEVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEV
EKAKRDFPGQPDMWKDAFKPDFSHIKLQLAPQGGFDKNGNTPHGKGVMKTLTTGSLIA
RASGLLHERMDEFELQGPTITTFNFDRNRIAAFRQLAAENKYGLVDTMKVGNQLKGVK
TMEELKQAIRNVTIKRCRIIYGGSTYDLESDGGGKVLVEKVKNTSVQTNNELAGALHH
LKHARIRYYVKCVQEAVYSIIQIAGAAFVTTRIARRMNIQELWSKPQLDQSEPETREE
ALKSEDDEFVISSKDIKEEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLGLV
TGSEIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFEAPPSIWSRIVSFGSGWGFWVSP
SLFITSTHVIPAGITEAFGVSIKQIQIHKSGEFCRFRFPKPIRPDVTGMILEEGAPEG
TVATVLIKRPTGELMPLAARMGTHATMKIQGRMVGGQMGMLLTGSNAKGMDLGTTPGD
CGCPYIYKRGNDYIVIGVHTAAARGGNTVICATQGSEGEATLEGGDDKGTYCGAPILG
PGGAPKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPRPSVLEAAKQTIINVLEQTLDPPQKWTYAQACASLDKTTSSGNPHHVRKNEFW
NGETFTGKLADQASKANLMFEEGKHMTPVYTAALKDELVKTEKIYGKIKKRLLWGSDL
STMVRCARSFGGLMDEMKAHCISLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD
STQQRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPSVVDVGDFKITINEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTKLSPDIIQANSMFSFYGDDEIVSTDIKLDPEQLTAKLK
EYGLKPTRPDKTEGPLIISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGP
NHEDPNETMIPHSQRPIQLMALLGEASLHGPSFYSKISKLVITELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPNFVNEDGVE"
mat_peptide 5..1000
/gene="ORF1"
/product="p48"
mat_peptide 1001..2098
/gene="ORF1"
/product="NTPase"
mat_peptide 2099..2629
/gene="ORF1"
/product="p22"
mat_peptide 2630..3028
/gene="ORF1"
/product="VPg"
mat_peptide 3029..3571
/gene="ORF1"
/product="Pro"
mat_peptide 3572..5101
/gene="ORF1"
/product="RdRp"
gene 5085..6707
/gene="ORF2"
CDS 5085..6707
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="WYA90697.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSHNFTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG
STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
gene 6707..7510
/gene="ORF3"
CDS 6707..7510
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="WYA90698.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMRVKQAVLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARITTYNAGHFPTPQPSGALPGRAIRDAVPARGPSTRLSN
SSTATSVHSNQTISTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNIS
FVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN
1 gtgaatgaag atggcgtcta acgacgctac cgttgccgtt gcttgcaaca acaacaacga
61 caaggaaaaa tcttcaggtg aaggcttatt cacaaatatg tcttctacct taaagaaagc
121 cctcggggct aggcccaaac agcctgcccc gagagacaaa ccacaaaagc ccccaagacc
181 accaactcct gagttggtca agaggatacc ccctcctcca cctaatggcg aagaagaaga
241 agaaccagtc attaggtatg aggttaagag cggaatctct ggcctgcccg agctcacaac
301 agtcccccaa ccggacgtgg ccaacacagc attcagtgtt ccaccactga gcttgagaga
361 aaacagggag gccaaggaac cgctaacagg ggcaatatta gagatgtggg acggagagat
421 atatcactat ggcctatacg tggagaaagg cctagtgttg ggtgtgcaca aaccacctgc
481 agccataagc atggcaagag tggaactgac gccgctgtca ttatactggc gtgtagtgta
541 cactccccaa tacctcatct cccctgaaac tcttaggagg ctcaatgggg aggctttccc
601 ttacaccgcc tttgacaaca actgctatgc cttttgttgc tgggtgctag acctcaacga
661 ctcatggctt agcaggagaa tggtgcagag aacaacgggc ttcttcagac cttaccaaga
721 gtggaacaga aaacccctgc ctaccatgga tgactccaaa attaagaagg tggcaaatat
781 attcctatgt tcattgtcca cattattcac cagacccata aaagatctca taggaaaaat
841 taaaccatta aacatattga acattctggc aacgtgtgac tggacgtttg ccgggatagt
901 ggagtctcta atattacttg ctgaactctt cggagttttc tggacacccc cagatgtgtc
961 tgctatgatt gctcccttgc tcggggacta cgagttgcaa gggccagagg acctcgccgt
1021 tgaactcgtg cctgtggtaa tgggagggat tggtttggtg ttgggattca ccaaagagaa
1081 aattggcaaa atgttgtctt cagcagcatc aacactcagg gcttgcaaag atcttggtgc
1141 ctatggctta gagatactaa aattggtcat gaagtggttc ttcccaaaga aagaggaggc
1201 caatgagcta gccatggtga gggccataga ggatgctgta ctagatcttg aggcaataga
1261 aaataaccac atgacaaccc tgttgaaaga caaagacagc ttagcaacat acatgaagac
1321 actagacatg gaggaggaaa aagccagaag gttgtccaca aaatctgcat cccctgacat
1381 agttgggaca atcaacgccc tactggctcg aatagcagcg gccaggtcat tagtccacag
1441 ggccaaggaa gagctatcta gcagggtaag accagtagtt gttatgatat ctggcaaacc
1501 aggaataggc aaaactcatc tggccaggga ggtggcaaga aaggtggctt ccactctcac
1561 aggggaccag agagttggac tcataccaag aaacggtgtg gaccattggg atgcatacaa
1621 aggcgagaga gtcgtgctgt gggacgacta tggtatgagt aaccccatcc atgatgctct
1681 tcgcatacaa gaattggctg atacgtgtcc cctcacttta aattgtgaca gaattgaaaa
1741 caaaggaaaa gtttttgaca gtgaagtcat aataattaca acaaaccttg ccaacccagc
1801 cccacttgat tatgtcaact ttgaggcttg ttccaggaga attgacttcc tggtgtacgc
1861 tgaggcacca gaagtagaga aggcaaagcg agacttccct ggtcagccag atatgtggaa
1921 ggacgccttc aagccggact tttcacacat caagctgcaa cttgcacctc agggcggctt
1981 tgacaagaat ggcaacaccc cacatgggaa aggagtgatg aagaccctca ctaccggctc
2041 tctgatcgct cgtgcatcag gcctactgca tgagaggatg gatgaatttg aactccaagg
2101 ccccacaatc accaccttca atttcgaccg gaacagaatc gcagcattta ggcaactggc
2161 tgcagaaaac aagtatggat tagtggacac catgaaagtt ggcaatcaat taaaaggagt
2221 gaaaaccatg gaagaactca aacaagcaat taggaatgtg accatcaaga ggtgccggat
2281 catttacggt ggctctacat atgaccttga atctgacggc gggggcaaag ttttggtgga
2341 aaaggtcaag aacacctctg tacagaccaa caacgagttg gccggggcct tgcaccatct
2401 caaacacgcc cgaatcagat actatgtcaa atgcgtgcaa gaagcggtct attccatcat
2461 acaaattgcc ggcgctgcgt ttgtcaccac gcgtattgca cgccgcatga acatacaaga
2521 actctggtcg aagccacaat tagatcaaag tgaaccagag actagggaag aggcccttaa
2581 gtcagaagac gacgagttcg tcatatcttc taaggacatc aaggaggaag gaaagaaggg
2641 caagaataaa actggccgtg gcaagaaaca cactgcattc tccagcaagg gcttgagtga
2701 tgaggagtat gacgagtata agaggataag agaagagaga aatgggaagt actctataga
2761 ggagtatctt caagacagag acaggtacta tgaggagctc gccattgcca aggccacgga
2821 agaagacttc tgtgaagagg aggagattaa aatccgtcag agaattttcc gtcccaccag
2881 gaaacaaaga aaggaagaaa gggccacatt agggctagta acgggttcag aaatcagaaa
2941 gagaaaccct gatgacttca aacccaaagg gaagctgtgg gccgatgaca acagaagtgt
3001 tgactacaat gaaaaactgg actttgaggc ccccccaagc atatggtcta ggatcgtgag
3061 ctttggttct ggctggggct tctgggtatc accaagcctg tttataacat caactcatgt
3121 gatccccgca ggcataacag aagcatttgg agtctccatc aaacaaattc agatccacaa
3181 atcaggtgaa ttttgccggt tcagattccc aaaaccaatt agaccagatg ttacaggaat
3241 gatcttggaa gaaggtgcgc ctgagggtac cgtggcaact gtgctcatca aacgccccac
3301 cggagagctt atgcctcttg cagccagaat gggaacacac gcaaccatga aaattcaagg
3361 ccgcatggtt ggcgggcaga tgggtatgtt gctcactgga tcaaatgcta aaggaatgga
3421 tttggggaca acccctggtg actgtggctg cccctacatc tataaaagag gcaacgacta
3481 tatagtcatt ggggtgcata ctgcagcagc ccgtggtggg aatactgtca tctgtgccac
3541 acagggaagt gagggtgagg caactcttga gggtggagat gacaaaggaa catactgtgg
3601 ggcacccatt ctgggccctg ggggcgcacc aaaattgagc accaaaacca aattttggag
3661 gtcatcgaac acgccccttc caccagggac gtatgagcct gcctacctcg gcggccgtga
3721 tccacgtgtc aagggcgggc cctccttgca gcaggtaatg agagatcagt tgaagccatt
3781 cactgaaccc aggggcaaac ctccaagacc aagtgtatta gaagcagcca aacaaaccat
3841 catcaatgtc ctcgaacaaa ccctggatcc tccacaaaaa tggacatacg cacaggcgtg
3901 tgcctcactt gacaaaacca cctccagcgg gaacccccat cacgtccgaa agaatgaatt
3961 ctggaatggt gagaccttta ctggaaaatt ggcagaccaa gcatcaaaag caaatctaat
4021 gtttgaggaa gggaaacata tgacaccagt gtatacagca gcactcaagg acgagctcgt
4081 taagactgag aaaatctatg gaaagattaa gaagaggctg ctctggggct ctgacttgtc
4141 caccatggtc cggtgtgcta ggtcatttgg tgggcttatg gacgagatga aggcacactg
4201 catatcactc ccagtccgag ttggcatgaa tatgaatgaa gatggcccaa taatatttga
4261 gaaacattcc agatataagt accactatga tgcagactac tctcgttggg attcaacaca
4321 acagagggca gtactggcag cagccttgga aatcatggtc agattctctg cagaaccaca
4381 actggcacaa atagtcgctg aggatctgct ggcccctagt gtagtagatg taggagactt
4441 caaaatcaca ataaatgaag ggctcccctc tggtgttcca tgcacttctc aatggaattc
4501 cattgcacac tggctactaa ctctctgtgc cttgtctgaa gtcaccaaac tgtcccctga
4561 cattatacag gcaaattcca tgttctcatt ttacggtgat gacgagattg tcagcaccga
4621 cataaaatta gatcctgaac agttaaccgc caaattgaag gagtacggtc tgaaaccaac
4681 ccgcccagac aaaaccgagg gacccctgat catcagtgaa gatttggacg gactcacttt
4741 cctccgaagg acggtgactc gcgacccagc tggttggttt ggtaaactgg accagagctc
4801 cattttgagg cagatgtact ggactagagg gccaaatcat gaagacccca atgagacaat
4861 gataccccat tctcagagac ccatacaact catggcactg cttggtgaag cttcccttca
4921 cgggccctct ttctatagta aaatcagcaa attggtcata actgaactca aggaaggtgg
4981 aatggacttt tacgtgccaa gacaggaacc catgttcagg tggatgaggt tttctgactt
5041 gagcacgtgg gagggcgatc gcaatctggc tcccaatttt gtgaatgaag atggcgtcga
5101 gtgacgccaa cccatctgat gggtccgcag ccaacctcgt accagaggtc aacaatgagg
5161 ttatggcttt ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccaacaaa
5221 atgtaattga cccctggatt agaaataatt ttgtacaagc ccctggtggg gagttcacag
5281 tatcccctag aaatgctcca ggtgaaatac tatggagcgc gcccctaggc cctgacctaa
5341 atccctacct atcccatttg gccagaatgt acaatggtta tgcaggtggc tttgaagtgc
5401 aggtaattct cgcggggaac gcgttcaccg ctgggaagat tatatttgca gcagtcccac
5461 caaattttcc aactgaaggc ttaagcccta gccaggtcac tatgttccct catataatag
5521 tagatgttag acaattggaa cctgtgctga tccccttacc cgatgttagg aataatttct
5581 atcattacaa tcagtcaaat gactccacta ttaagttgat agcaatgttg tacacaccac
5641 ttagggctaa taatgctgga gatgatgttt tcacagtctc gtgccgagtt ctcacgagac
5701 catcccccga ttttgatttc atattcttag taccacccac agttgagtca agaactaaac
5761 cattctccgt cccagtttta actgttgagg agatgaccaa ttcaagattc cccatccctt
5821 tggaaaagtt gttcacaggc cccagcagtg catttgttgt tcaaccacaa aacggcagat
5881 gcacaactga tggcgtgctc ctaggcacca cccaactttc tcctgtcaac atctgcacct
5941 tcagagggga tgtcacccac atcacaggta gccacaactt cactatgaat ttggcttctc
6001 aaaattggaa caactacgac ccaacagaag aaatcccagc tcccctagga actccagatt
6061 ttgtggggaa gattcaaggc atgctcaccc aaaccacaag gacagatggt tcaacacgcg
6121 gccacaaagc tacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc
6181 aatttgaaac tgacacaaac catgattttg aagctaatca aaacacaaag ttcaccccag
6241 tcggtgttat ccaagatggt agcaccaccc accgaaacga accccaacag tgggtgcttc
6301 caagttactc aggcagaaac actcacaatg tacatttggc ccccgctgta gcccccacct
6361 ttccgggtga gcaacttctc ttctttagat ctaccatgcc cggatgtagc gggtacccca
6421 acatggattt ggactgtctg ctcccccagg aatgggtgca gtatttctac caagaggcag
6481 ccccagcaca atctgatgtg gccctgctaa gatttgtgaa cccagacaca ggtagggttt
6541 tgtttgagtg caagcttcac aaatcaggct atgttacagt ggctcacact ggccaacatg
6601 atttggttat cccccccaat ggttacttta ggtttgattc ctgggtcaac cagttctaca
6661 cgcttgcccc catgggaaat ggaacggggc gtagacgtgc agtataatgg ctggagcttt
6721 ctttgctgga ttggcatctg atgtccttgg ctctggactt ggttccctta tcaatgctgg
6781 ggctggggcc atcaaccaaa aagttgagtt tgaaaataat agaaaattgc aacaagcatc
6841 cttccaattt agcagtaatt tgcaacaggc ttcctttcaa catgataaag agatgctcca
6901 agcacaaatt gaggccacca aaaagctaca acaggaaatg atgagagtta agcaggcagt
6961 gctcctagag ggtgggttct ctgagacaga tgcggcccgc ggggcaatta acgcccccat
7021 gacaaaggct ttggactgga gcgggacaag gtactgggct ccagatgcta gaattacaac
7081 atacaatgca ggccactttc ccacccctca accatcgggg gcactaccag gaagagctat
7141 tagggatgct gtccctgctc ggggaccctc caccagactt tctaactcct ctactgccac
7201 ctctgtgcat tcaaatcaaa ctatttcaac gagacttggt tctacagctg gttctggaac
7261 cagtgtctcg agcctcccgt caactgcaag gactaggagc tgggttgagg atcaaaatag
7321 gaatttatct cctttcatga ggggggccca caacatatca tttgtcaccc caccatctag
7381 cagatcctct agccaaggca cagtctcaac cgtgcctaaa gaaattttgg actcctggac
7441 tggcgctttt aacacgcgca ggcagcctct cttcgcccac attcgtaagc gaggggagtc
7501 acgggtgtaa tgtgaaaaga caaaactgat tatttttctt tttctttagt gtcttttaaa
7561 aa
//