![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| PP658551 | GII.4 San Francisco | ||
|---|---|---|---|
| GII.P31 |
ORF1: 5..5104
ORF2: 5085..6710
ORF3: 6710..7516
LOCUS PP658551 7539 bp RNA linear VRL 01-JUN-2024
DEFINITION Norovirus GII isolate PBH23139-STN nonstructural polyprotein
(ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds.
ACCESSION PP658551
VERSION PP658551.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7539)
AUTHORS Doungngern,P., Pittayawong-anont,C., Kraipatanapong,S.,
Waikhruea,K., Menkoon,K., Rattanatumhi,K., Supataragul,A.,
Thippamom,N., Khunnawutmanotham,W., Hirunpatrawong,P.,
Wacharapluesadee,S., Sereewit,J., Sobolik,E.B., Roychoudhury,P.,
Greninger,A.L. and Putcharoen,O.
TITLE Direct Submission
JOURNAL Submitted (13-APR-2024) Virology, University of Washington, 850
Republican St, Seattle, WA 98109, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Geneious v. 2023; metaspades v. 2024
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7539
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="PBH23139-STN"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Thailand"
/collection_date="2023-03-01"
/note="genotype: GII"
gene 5..5104
/gene="ORF1"
CDS 5..5104
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="WZF77154.1"
/translation="MKMASNDASAAAAANSNNDIEKSSSDGVFSNMAVTFKRALGARP
KQPPPREKPPRPPRPPTPELVKRIPPPPPNGEDELVVSYSAKDGVSGLPELTTVSQPE
ENNTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS
LAKVELAPLSLFWRPVYTPQYLISPDTLKRLHGESFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQEWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
LKPLNILNILATCDWTFPGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
LAVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKEETNELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLATYMKTLDLEEEKARKLST
KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLAREL
AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPGVEK
AKHDFPGQPDMWKNAFSPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEYELQGPALTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGKQLKDVKTM
PDLKQALKNVAIKKCQIVYNGGTYTLEADGKGGVRVDKVQSATVQTNNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETTSKDGC
PKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
TVATLLIKRPTGELMPLAARMGTHATMRIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILG
PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL
ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD
STQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 5..994
/gene="ORF1"
/product="p48"
mat_peptide 995..2092
/gene="ORF1"
/product="NTPase"
mat_peptide 2093..2629
/gene="ORF1"
/product="p22"
mat_peptide 2630..3028
/gene="ORF1"
/product="VPg"
mat_peptide 3029..3571
/gene="ORF1"
/product="Pro"
mat_peptide 3572..5101
/gene="ORF1"
/product="RdRp"
gene 5085..6710
/gene="ORF2"
CDS 5085..6710
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="WZF77155.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRSNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
VRNNFYHYNQSSDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPILTVEEMTNSRFPIPLEKLFTGPSSTFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGSVTQTAAGTHNYTMNLASQNWNSYDPTEEIPAPLGTPDFVGKIRGM
LTQTTRGNGSTRGHIATVYTGSNDFAPKLGRVQFETDTTNDFETNQNTKFTPVGVIQD
GDTAHRSEPLQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTLPGCSGYPNMDL
DCLLPQEWVQYFYQEAAPAQSEVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDL
VIPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6710..7516
/gene="ORF3"
CDS 6710..7516
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="WZF77156.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKIEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAVLLEGGFSETDAARGAIN
APMTKTLDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRNAVPARGSSSTPS
NSSIAISVHSNQTASTRLGSTAGSGTSVSSFPSTARTRSWVEDQNRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTHRQPLFAHIRRRGESRV"
ORIGIN
1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gctgctaaca gcaacaacga
61 catcgaaaaa tcttcaagtg acggtgtgtt ttctaacatg gctgtcactt ttaagcgggc
121 cctcggggcg cggcctaaac agccgccccc gagggaaaaa ccacccagac ccccacgacc
181 acccacacca gaattagtca aaaggattcc acctccccca cccaacgggg aggatgaact
241 agtggtttct tacagcgcca aagacggcgt ttccggattg cctgagctca ccaccgtcag
301 ccaaccggaa gaaaacaaca cggcgttcag tgttcccccg ctcaatcaaa gggagaacag
361 ggacgctaag gaaccactaa ctggaacaat cattgagatg tgggatgggg aaatctatca
421 ctacggcctg tacgtagaac gaggtcttat acttggtgtg cataagccac cggcggccat
481 cagtcttgcc aaagttgagc tagcaccact ctctttgttc tggagacctg tgtatacccc
541 ccagtacctc atctctccag acactcttaa gagactacat ggagagtcat tcccctacac
601 cgcatttgac aacaattgct acgccttctg ctgttgggtg ctagacctaa acgactcatg
661 gctgagtagg agaatgattc agaggacaac aggtttcttc agaccatacc aagaatggaa
721 caggaaaccc ctccccacta tggatgactc caaattgaag aaggtagcca acatattctt
781 gtgcaccttg tcctcactat tcaccagacc cattaaggac ataataggga aattgaaacc
841 tcttaacatc ctcaatattc tggccacatg tgattggacc ttcccaggca tagtggaatc
901 cctaatactc ttggcagaac tctttggagt tttctggaca cccccagatg tgtctgcgat
961 gatcgccccc ttactaggtg attatgaact gcaaggacct gaggaccttg cagtagaact
1021 ggtcccagtg gtgatggggg ggataggttt ggtgctagga ttcaccaaag agaaaattgg
1081 aaagatgctg tcgtccgctg catccaccct gagagcttgc aaagaccttg gtgcatacgg
1141 actggaaatt ttgaagctag tcatgaagtg gttcttccca aagaaagagg aaacaaatga
1201 actggctatg gtgagatcca tcgaggacgc agtactagac ctcgaggcaa ttgaaaacaa
1261 ccacatgacc gccctgctca aagacaaaga cagtttggca acctatatga aaacccttga
1321 tcttgaggag gagaaagcca gaaaactctc gactaaatcc gcttcacctg atattgtggg
1381 cacaatcaac gctcttctgg cacgaatcgc cgctgcacgc tccctggtgc atcgggcgaa
1441 agaagagctc tccagcaggc cgagacctgt tgtcgtgatg atatcgggaa aaccagggat
1501 agggaaaact caccttgcca gggagttggc caaaaagatc gcagcctccc tcacagggga
1561 ccagcgcgtg ggtttgatcc cacgcaatgg cgtcgaccac tgggatgcgt acaagggtga
1621 aagagttgtc ctatgggacg actatgggat gagcaacccc atacacgatg ctctcaggtt
1681 gcaggaactt gctgacactt gccccctcac actaaattgt gataggattg agaataaagg
1741 aaaagtcttt gatagtgatg ccataattat taccaccaat ctggccaacc cagcaccact
1801 ggactatgtt aactttgaag cgtgttcgag gcgcattgac ttcctcgtgt acgcggaagc
1861 ccctggggtg gagaaggcaa aacacgactt cccaggccaa cctgacatgt ggaagaacgc
1921 tttcagccct gacttctcac acataaaact ggcattggct ccacagggag gttttgacaa
1981 gaacggcaac accccgcatg gaaaaggtgt catgaagacc ctcaccactg gctccctcat
2041 cgcccgagca tcagggttac tccatgagag gctagatgaa tatgaattgc aaggcccagc
2101 cctcaccact ttcaacttcg accgcaacaa ggtacttgct tttagacagc ttgctgctga
2161 aaacaagtat ggcctaatgg acacaatgag agttggaaaa cagctcaagg atgttaagac
2221 tatgccagac cttaaacaag cactcaagaa tgttgcgatc aagaagtgcc agatagtgta
2281 caatggtggc acctacacac ttgaggctga cggcaagggt ggtgtgagag ttgacaaagt
2341 gcaaagtgcc accgtgcaaa ccaacaatga gctagccggc gccctgcacc acctaaggtg
2401 cgccagaatc aggtattatg ttaagtgtgt ccaggaggca ctgtactcca tcatccaaat
2461 cgctggggct gcgtttgtca ccacgcgcat cgccaagcgc atgaatatac aaaatctctg
2521 gtctaagcca caggtggaag acacagaaga gacaaccagc aaagatggtt gcccaaaacc
2581 caaagatgat gaagagttcg tcgtttcatc cgacgacatc aaaactgagg gcaagaaagg
2641 gaagaacaag tccggccgtg gcaagaagca cacagccttc tcaagtaaag ggctcagtga
2701 tgaagagtat gatgagtaca agaggattag ggaagagagg aatggtaagt actccataga
2761 ggagtacctc caggacagag acaagtacta tgaggaagtg gccattgcca gggcaactga
2821 agaggacttc tgtgaagaag aagaggccaa aatccggcag agaattttca gaccaacaag
2881 gaaacaacgt aaagaagaga gggcctctct aggcttggtc acaggctcag aaatcaggaa
2941 gagaaaccca gaagacttca aacccaaggg aaagctgtgg gctgatgatg acagaagtgt
3001 tgactacaat gagaagctca actttgaggc cccaccaagc atctggtcgc ggatagtcaa
3061 ctttggttca ggttggggtt tctgggtctc ccccagtctg tttataacat caacccatgt
3121 cataccccaa ggtgcaaaag agttcttcgg agtccccatc aaacaaatcc agatacacaa
3181 atcaggtgaa ttctgccgac tgagattccc aaaaccaatc agaactgatg tgacgggcat
3241 gattctggaa gaaggtgcgc cagagggaac cgtggccaca ctgctcatca agagaccaac
3301 tggagagctc atgcctttgg cagccagaat gggaacccat gcaaccatga ggattcaggg
3361 gcgcacggtt ggaggacaaa tgggcatgct cttgacagga tccaacgcca agagtatgga
3421 cctgggcaca acaccaggcg attgtggctg tccctacatc tacaaaagag ggaatgacta
3481 cgtagttata ggagtccata cagccgctgc ccgtggagga aacaccgtca tctgcgccac
3541 ccagggtagt gaaggagaag ccacacttga aggaggtgac aacaaaggaa cgtactgtgg
3601 tgcaccaatt ttgggcccag ggagtgctcc gaaactcagc actaaaacta agttttggag
3661 atcatccaca acgccactcc cgccaggcac ctacgaacca gcttacctcg gtggaaagga
3721 ccctagagtc aaaggtggcc cttcattgca acaagttatg agggaccaac taaaaccatt
3781 cacagaaccc agaggcaaac cgccaagacc aaatgtgttg gaagctgcca agaaaaccat
3841 cattaatgtt cttgagcaaa caattgaccc accccaaaaa tggtcattcg cgcaagcttg
3901 cgcatccctc gacaaaacca cctccagcgg ccatccgcac cacatgcgga aaaacgactg
3961 ctggaatggg gaatccttta ctggaaaatt ggcagaccag gcttctaagg ccaacctaat
4021 gtttgaagag ggaaagaaca tgactccagt ttacacaggt gcacttaaag atgagttagt
4081 gaagactgac aaaatttatg gtaagatcaa gaagaggctc ctgtggggct cggacctggc
4141 gaccatgata cggtgcgccc gggcttttgg gggccttatg gatgaactca aggcgcattg
4201 tgtcaccctt cctgttagag ttggtatgaa tatgaatgaa gatggcccta taatctttga
4261 gaagcactcc agatataaat atcattatga tgctgattac tccaggtggg actcgacaca
4321 acaaagggat gtgttagcag cagcactaga aatcatggtt aaattctctc cagaaccaca
4381 cttggcccag atagttgcag aagacctcct ttcccctagc gtaatggatg tgggtgactt
4441 tcaaatatca ataagcgaag gactcccctc cggggtacct tgcacctccc agtggaattc
4501 catcgcccac tggctcctca ccctttgtgc actctctgag gtcacggacc tgtcccctga
4561 cattattcag gccaactccc ttttctcctt ctatggtgat gacgagattg tgagtacaga
4621 cataaagttg gacccagaga agctgacggc aaaactcaag gagtacggac tgaagccaac
4681 ccgccccgac aagactgaag gaccccttgt tatctctgaa gatctggatg gcctgacatt
4741 cctccggagg actgtgaccc gtgacccagc tggctggttt ggtaaattgg aacaaagttc
4801 aatcctcagg caaatgtact ggaccagggg ccccaatcat gaagacccat ctgaaacaat
4861 gataccacac tcccaaagac ccatacaatt gatgtccctg ctaggcgagg ctgcactcca
4921 cggcccagca ttttacagca aaattagcaa attggtcatt gcagaattga aggaaggtgg
4981 tatggacttt tacgtgccca gacaagaacc aatgttcaga tggatgagat tctcagatct
5041 gagcacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga
5101 gtgacgccaa cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg
5161 ttatggctct ggagcccgtt gttggtgccg ctattgcggc acctgtagcg ggccaacaaa
5221 atgtaattga cccctggatt agaagtaatt ttgtgcaagc ccctggtgga gagtttacag
5281 tatcccctag aaacgctcca ggtgaaatat tatggagcgc tcccctaggc cctgatctaa
5341 acccctacct atcccatttg gccagaatgt acaatggtta tgcaggtggt tttgaagtgc
5401 aggtaatcct tgcggggaac gcgttcaccg ccggaaagat catatttgca gcagtcccac
5461 ctaattttcc aactgaaggt ttgagcccca gccaggtcac tatgttcccc catataatag
5521 tagatgttag acaactggaa cctgtgttga ttcccctacc cgatgttagg aataatttct
5581 accactataa ccaatcaagt gaccccacta ttaagttgat agcaatgttg tatacaccac
5641 ttagggctaa taatgctggg gatgatgtct tcacggtttc ttgccgagtt ctcacgagac
5701 catcccctga ttttgatttc atattcctgg tgccacccac agttgagtca agaactaaac
5761 cattctctgt cccaatttta actgttgagg agatgactaa ctcaagattc cccattcctt
5821 tggaaaagtt gttcacaggt cccagcagca cctttgttgt tcagccacaa aatggcaggt
5881 gcacgactga tggcgtgctc ctaggcacca cccaattgtc ccctgtcaac atctgtacct
5941 tcagaggtag tgtcacccaa acagcggcag gtactcataa ctacacaatg aatttggctt
6001 cccaaaattg gaacagttat gatccaacag aagagatccc agccccttta ggaaccccag
6061 atttcgtagg gaagattcga ggtatgctca cccaaaccac aaggggaaat ggctcaacac
6121 gcggccacat agccacagtg tacactggga gcaacgactt tgctccaaaa ctgggtaggg
6181 tccaatttga aactgacaca accaacgatt ttgaaactaa ccaaaacaca aagttcaccc
6241 cggttggcgt catccaggat ggtgacactg cccaccgaag tgaaccccta caatgggtgc
6301 tcccaagtta ttcaggtaga aatactcata atgtgcatct ggcccccgct gtagctccca
6361 cttttccggg cgagcagctc ctcttcttta gatctacctt gcccggatgc agcgggtacc
6421 ccaacatgga tttggattgt ctgcttcccc aggaatgggt gcagtacttc tatcaagagg
6481 cagccccagc acaatctgaa gtggctctgt taagatttgt gaatccagat acaggtaggg
6541 ttttgtttga gtgtaagctc cacaaatcgg gctatgtcac agtggctcat actggccaac
6601 atgatttggt tatccccccc aatggttatt tcagatttga ctcctgggtc aatcaattct
6661 acacgcttgc ccccatggga aatggaacgg ggcgtagacg tgcattataa tggccggagc
6721 tttctttgct ggattggcat ctgatgtcct tggctctgga cttggttccc tgatcaacgc
6781 tggggctggg gccatcaacc aaaagattga atttgaaaat aacagaaaat tgcaacaggc
6841 atccttccaa tttagtagca acctacaaca ggcttccttt caacatgaca aagagatgct
6901 ccaagcacaa attgaggcca ccaaaaagct gcaacaggaa atgatgaaag ttaagcaggc
6961 agtgctctta gagggcgggt tttctgagac agatgcagcc cgcggggcaa ttaacgcccc
7021 tatgacaaaa actttggatt ggagcggtac aaggtattgg gcccctgatg ctagaactac
7081 aacatacaat gcaggccgct tctccacacc ccaaccatcg ggggcactgc caggaagagc
7141 taatctcagg aatgctgtcc ccgctcgggg ttcctccagt acaccttcta attcctctat
7201 tgctatttct gtgcactcaa atcaaactgc ttcgacgaga cttggttcta cagctggttc
7261 tgggaccagt gtctcaagct tcccgtcaac tgcaaggact aggagctggg ttgaggatca
7321 aaataggaat ctgtcacctt tcatgagggg ggcccacaac atatcgtttg tcaccccacc
7381 atctagcaga tcctctagcc aaggcacagt ctcaaccgtg cctaaagaag ttttggactc
7441 ctggactggt gcttttaaca cgcacaggca gcctctcttc gctcacattc gtaggcgagg
7501 ggagtcacgg gtgtaatgtg aaaagataaa attgattat
//