![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| OR844382 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7513
LOCUS OR844382 7559 bp RNA linear VRL 29-NOV-2023
DEFINITION Norovirus GII isolate NGII.4/18958/Shizuoka/Jan/2021-2022, complete
genome.
ACCESSION OR844382
VERSION OR844382.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7559)
AUTHORS Hoque,S.A., Pham,N., Okitsu,S., Shimizu,Y.O. and Ushijima,H.
TITLE Direct Submission
JOURNAL Submitted (24-NOV-2023) Department of Pathology and Microbiology,
Nihon University Graduate School of Medicine, Itabashi-ku,
Itabashi-Ku, Itabashi 173-8610, Japan
COMMENT ##Assembly-Data-START##
Assembly Method :: SOAPdenovo v. May-2022
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7559
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="NGII.4/18958/Shizuoka/Jan/2021-2022"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Japan"
/collection_date="27-Jan-2022"
/note="genotype: GII.4"
gene 5..5104
/gene="ORF1"
CDS 5..5104
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="WPK51441.1"
/translation="MKMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARP
KQPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYNAKDGVSGLPELTTVRQPE
ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS
LAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
LKPLNILNVLATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDRDSLTTYMRTLDLEEEKARKLST
KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL
AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTM
SDLKQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVENTEEMANKDGC
LKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDL
ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
STQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPERLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 5..994
/gene="ORF1"
/product="p48"
mat_peptide 995..2092
/gene="ORF1"
/product="NTPase"
mat_peptide 2093..2629
/gene="ORF1"
/product="p22"
mat_peptide 2630..3028
/gene="ORF1"
/product="VPg"
mat_peptide 3029..3571
/gene="ORF1"
/product="Pro"
mat_peptide 3572..5101
/gene="ORF1"
/product="RdRp"
gene 5085..6707
/gene="ORF2"
CDS 5085..6707
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="WPK51442.1"
/translation="MKMASSDVNPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGML
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNNDFEANQNTKFTPVGVIQDG
GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRAL"
gene 6707..7513
/gene="ORF3"
CDS 6707..7513
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="WPK51443.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANFRDAVPARGSSSKSS
NSSIATSVYSNQTASTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN
1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gttgccaaca gcaacaacga
61 catcgcaaaa tcttcaagtg acggtgtgtt ttctaacatg gctgtcactt ttaagcgggc
121 cctcggggcg cggcctaaac agccgccccc gaaggaaata ccacccagac ccccgcgacc
181 acccacacca gaattggtca aaaagatccc tcctccccca cccaacgggg aggatgaact
241 agtggtctct tacaacgcca aagatggcgt ttccggactg cctgagctca ccactgtcag
301 acaaccggaa gaaaccaaca cggcgttcag tgtcccccca ctcaaccaaa gggagagcag
361 ggacgccaag gagccactaa ctggtacaat cattgaaatg tgggatggag aaatctacca
421 ttacggcctg tacgtggaac gaggtcttat acttggtgtg cacaagccac cagcagccat
481 cagcctcgct aaggtcgagc tggcaccact ctctttgttt tggagacctg tatacacccc
541 ccagtacctt atctctccag atactcttag gagattacat ggagagtcat tcccctacac
601 tgcatttgac aacaattgct acgccttttg ttgttgggta ttagacctaa acgactcatg
661 gctaagcagg agaatgattc agagaacaac aggcttcttt aggccgtacc aggattggaa
721 caggaaaccc ctccccacta tggatgattc caaattaaag aaggtagcca acatattttt
781 gtgcactttg tcttcactat tcaccaggcc catcaaggac ataataggga agttgaaacc
841 tctcaacatc cttaatgttc tggctacatg tgattggact ttcgcaggca tagtggaatc
901 cttaatactc ttggcagaac tctttggagt tttctggaca cccccagatg tgtctgcgat
961 gatcgccccc ttgctaggtg attacgaact gcaaggaccc gaggaccttg cagtggaact
1021 ggtcccaata gtgatggggg ggataggttt ggttctagga tttaccaaag agaagatcgg
1081 aaagatgctg tcatcagctg catccacttt aagagcttgt aaagaccttg gtgcatacgg
1141 actggaaatc ttaaaattgg tcatgaagtg gttcttccca aagaaagagg aagcaaatga
1201 actggctatg gtgagatcca tcgaggatgc agtactagac ctcgaggcaa ttgaaaacaa
1261 ccacatgacc accctgctca aagacagaga cagcttgaca acctacatga gaacccttga
1321 ccttgaggag gagaaagcca gaaaactctc aaccaaatct gcttcacccg atattgtggg
1381 cacaatcaac tctcttctgg caagaatcgc tgctgcacgc tccttagtgc atcgggcgaa
1441 agaagagctc tccagcaggc cgagacctgt cgttgtaatg atatcgggaa gaccagggat
1501 agggaaaact caccttgcca gggagctggc caagaagatc gcggcctccc tcacagggga
1561 ccagcgtgtg ggccttatcc cacgcaatgg tgtcgaccac tgggacgcat acaagggcga
1621 aagggtcgtc ctatgggacg actatggaat gagcaacccc atccatgatg cccttaggtt
1681 gcaggagctt gctgacactt gccccctcac actaaattgt gacagaattg agaacaaagg
1741 aaaagtcttt gacagtgacg ccataattat caccaccaac ctggccaacc cagcaccact
1801 ggattatgtc aactttgaag cgtgctcgag acgcattgat ttcctcgtgt acgcagaagc
1861 ccctgaggtg gagaaggcga agcgcgactt cccaggtcaa cctgacatgt ggaagaacgc
1921 tttcagtcct gacttctcac acataaaact gtcattggct ccacagggtg gttttgacaa
1981 gaacggcaac accccgcatg gaaaaggggt catgaagacc ctcaccactg gctccctcat
2041 cgcccgagca tcagggttac tccatgagag gctagatgaa tatgaattgc aaggcccagc
2101 cctcaccact ttcaactttg accgcaacaa gatacttgct tttagacagc tcgctgctga
2161 aaacaagtat gggctgatgg acacaatgag agttggaaaa cagctcaagg atgtcaagac
2221 tatgtcagac ctcaaacagg cactcaagaa catcgcgatc aagaagtgcc agatagtgta
2281 caatggtggc acctacacac ttgaggctga tggcaagggc agtgtgaaag ttgacaaagt
2341 gcaaagtgcc actgtgcaga ccaacaatga actagccggt gccctacacc acctaagatg
2401 cgctagaatc agatactatg ttaagtgcgt ccaggaggca ctgtattcca tcatccaaat
2461 cgctggggct gcattcgtca ccacgcgcat cgctaagcgc atgaatatac agaacctctg
2521 gtccaagcca caggtggaaa acacagaaga gatggccaac aaagatggtt gcctaaaacc
2581 taaagatgat gaagagtttg tcgtttcatc cgacgacatc aaaactgagg gcaagaaagg
2641 gaagaacaag tccggccgtg gcaagaagca cacagccttt tcaagtaaag ggctcagtga
2701 cgaggagtat gatgaataca agagaatcag agaagaaagg aatggtaagt actccataga
2761 agagtacctt caggacagag acaggtacta cgaggaggtg gccattgcta gggcaaccga
2821 agaggatttc tgtgaggagg aggaggccaa aatccggcag aggattttca gaccaacaag
2881 gaaacaacgc aaagaagaga gggcttctct cggcttggtc acaggctctg aaatcaggaa
2941 gagaaaccca gaagacttca aacccaaggg aaagctgtgg gctgatgacg acagaagtgt
3001 tgactacaat gagaaactca actttgaggc cccaccaagc atctggtcgc ggatagtcaa
3061 ctttggttca ggctggggct tctgggtctc ccccagtctg tttataacat caacccatgt
3121 cataccccaa ggtgcaaagg agttcttcgg agtccctatc aagcaaatcc agatacacaa
3181 gtcaggtgaa ttctgccgat tgagattccc aaagccaatc agaaccgatg tgacgggcat
3241 gattctagaa gaaggtgcgc ccgaggggac cgtggccaca ctgctcatca agagaccaac
3301 tggagagctc atgcctctgg cagccagaat ggggacccat gcaaccatga aaattcaggg
3361 gcgcacagtt ggagggcaaa tgggtatgct cctgacagga tccaacgcca agagtatgga
3421 cctaggcaca acaccaggcg actgcggctg cccctacatc tacaagaggg ggaatgacta
3481 cgtggtcata ggggtccata cggctgctgc ccgtggagga aacactgtca tatgtgccac
3541 ccagggaagt gagggagagg ccacacttga aggaggtgac agtaaaggga catactgtgg
3601 cgcaccaatc ttgggcccag ggagcgctcc gaagctcagc accaagacca agttttggag
3661 atcatccaca acaccactcc cacctggcac ctacgaacca gcctacctcg gtggcaaaga
3721 ccctagagtc aaaggtggcc cttcattgca acaagttatg agggaccagc tgaagccatt
3781 cacagaaccc agaggcaaac caccaagacc aaatgtgttg gaagctgcca agaaaaccat
3841 catcaatgtc cttgagcaaa caattgatcc accccaaaaa tggtcatttg cgcaagcttg
3901 cgcatccctt gacaaaacca cctccagcgg ccacccgcac cacatgcgga aaaatgactg
3961 ttggaatggg gagtccttca caggaaagtt ggctgaccaa gcctccaagg ccaacctaat
4021 gtttgaagag ggaaagaaca tgactccagt ctacacaggt gcacttaaag atgagttggt
4081 aaagaccgat aaagtttatg gtaaggtcaa gaagaggctt ctatggggtt cagatctggc
4141 gaccatgata cggtgcgccc gagcttttgg aggcctcatg gatgaactca aggcacactg
4201 tgttacactc cctgtcagag ttggtatgaa catgaatgag gatggcccca tcatcttcga
4261 gaagcactcc agatatagat atcattatga tgctgactat tcccggtggg actcaacaca
4321 acaaagggat gtgctagcag cagcactaga aatcatggtt aagttctctc cagaaccaca
4381 cctggcccag atagttgcag aagacctcct ttcccctagc gtgatggatg tgggtgactt
4441 tcaaatatca ataagtgagg gtctcccctc tggggtgcct tgcacctccc agtggaattc
4501 catcgcccac tggctcctca ctctgtgtgc actctctgaa gtcacggacc tgtcccctga
4561 tatcattcag gccaactccc ttttctcctt ctatggtgat gatgagattg taagcacaga
4621 cataaagttg gacccagaga ggctaacagc aaaactcaag gagtacgggc tgaaaccaac
4681 ccgccccgac aaaactgaag gacccctcgt catctctgaa gacctggatg gtctgacatt
4741 cctccggaga actgtgaccc gtgatccagc tggctggttt ggaaaattgg aacaaagttc
4801 aattcttagg caaatgtact ggaccagggg tcccaaccat gaagacccat ttgaaacaat
4861 gataccacac tcccaaagac ccatacaatt gatgtccttg ttgggcgagg ctgcactcca
4921 cggcccggca ttctacagca aaatcagcaa attagtcatt gcagagttga aggaaggtgg
4981 catggatttt tacgtaccca gacaagagcc aatgttcagg tggatgagat tctcagatct
5041 gagcacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga
5101 gtgacgtcaa cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg
5161 ttatggcgct ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccaacaaa
5221 atgtaattga cccctggatt agaaacaatt ttgtacaagc ccctggtgga gagtttacag
5281 tatcccctag aaacgctcca ggtgaaatac tatggagcgc gcccttgggc cctgatctaa
5341 atccctactt gtctcatttg gccagaatgt acaatggtta tgcaggtggt tttgaagtgc
5401 aggtaattct cgcggggaac gcgttcaccg ccgggaaggt catatttgca gcagtcccac
5461 caaatttccc aactgaaggc ttgagcccca gccaagtcac tatgttcccc catatagtag
5521 tagatgttag gcaactagaa cctgtgttga ttcccttacc cgatgttagg aataatttct
5581 atcattataa tcaatcaaat gaccccacca ttaagttgat agcaatgttg tacacaccac
5641 ttagggctaa taatgctggg gatgatgtct tcacagtctc ttgccgagtt ctcacgagac
5701 catcccccga ctttgacttc atatttctag tgccacccac agttgagtca agaactaaac
5761 cattctctgt cccagtctta actgttgagg agatgaccaa ttcaagattc cccattcctt
5821 tggaaaagtt gttcacgggt cccagcagtg cctttgttgt ccaaccacaa aacggtaggt
5881 gcacgactga tggcgtgctc ctaggcacca cccaactgtc tcctgtcaac atctgcacct
5941 ttagaggaga tgtcacccat atcacaggta gtcataacta tacaatgaat ttggcttctc
6001 aaaattggag taactacgac ccaacagaag aaatcccagc ccctctaggg actccagact
6061 ttgtggggaa gattcaaggc atgcttaccc aaaccacaag gacagatggc tcaacacgcg
6121 gccacaaagc cacagtgtat actgggagtg ccgactttgc tccaaaactg ggtagagttc
6181 aatttgaaac tgacacaaac aatgattttg aagctaacca aaacacaaag ttcaccccag
6241 ttggtgtcat ccaagatggt ggcaccaccc atcgaaatga accccaacag tgggtactcc
6301 caagttactc aggcaggaac actcctaatg tgcatctggc ccccgctgtg gcccccactt
6361 ttccgggtga gcaactcctc ttctttagat ccaccatgcc cggatgcagc gggtacccca
6421 atatggattt ggattgtctg ctcccccagg aatgggtgca gtacttctac caggaggcag
6481 ccccagcaca atctgatgtg gctctgctaa gatttgtgaa tccagacaca ggtagggttt
6541 tgtttgaatg taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg
6601 atttggttat cccccccaat ggttatttta ggtttgattc ctgggtcaac cagttttaca
6661 cgcttgcccc catgggaaat ggagcggggc gtaggcgtgc actataatgg ctggggcttt
6721 ctttgctgga ttggcatctg atgtccttgg ctctggactt ggttccctta tcaatgctgg
6781 ggctggggcc atcaaccaaa aagttgagtt tgaaaataac agaaaattgc aacaagcatc
6841 cttccaattt agcagcaatc tacaacaggc ttcctttcaa catgacaaag agatgctcca
6901 agcacaaatt gaggctacca aaaggttgca acaggaaatg atgaaagtta agcaggcaat
6961 gctcctagag ggtgggttct ctgagacaga tgcagcccgc ggggcaatca acgcccctat
7021 gacaaaagcc ttggactgga gcgggacaag gtactgggct cctgatgcta ggactacaac
7081 atacaatgca ggccgctttt ccacccccca accatcaggg gcgctgccag gaagagctaa
7141 ttttagggat gctgtccctg ctcggggttc ctctagtaag tcttctaatt cttctattgc
7201 tacttctgtg tattcaaatc aaactgcttc aacgagactt ggttctacag ctggttctgg
7261 caccagtgtc tcgagcttcc cgtcaactgc aaggactagg agctgggttg aggaccaaag
7321 taggaatttg tcacctttca tgaggggggc ccataacata tcgtttgtca ccccaccatc
7381 tagcaggtcc tctagccaag gcacagtctc aaccgtgcct aaagagattt tggactcctg
7441 gactggcgct ttcaacacgc gcaggcagcc actcttcgct cacattcgta agcgagggga
7501 gtcacgggtg taatgtgaaa agacaaaatt gattatcttt cttttcttta gtgtctttt
//