![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| OR844383 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7513
LOCUS OR844383 7559 bp RNA linear VRL 29-NOV-2023
DEFINITION Norovirus GII isolate NGII.4/18968/Shizuoka/Feb/2021-2022, complete
genome.
ACCESSION OR844383
VERSION OR844383.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7559)
AUTHORS Hoque,S.A., Pham,N., Okitsu,S., Shimizu,Y.O. and Ushijima,H.
TITLE Direct Submission
JOURNAL Submitted (24-NOV-2023) Department of Pathology and Microbiology,
Nihon University Graduate School of Medicine, Itabashi-ku,
Itabashi-Ku, Itabashi 173-8610, Japan
COMMENT ##Assembly-Data-START##
Assembly Method :: SOAPdenovo v. May-2022
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7559
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="NGII.4/18968/Shizuoka/Feb/2021-2022"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Japan"
/collection_date="10-Feb-2022"
/note="genotype: GII.4"
gene 5..5104
/gene="ORF1"
CDS 5..5104
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="WPK51444.1"
/translation="MKMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARP
KQPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPE
ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS
LAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLTTYMRTLDLEEEKARKLST
KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL
AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEYELQGSALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTM
SDLKQALKNIAIKKCQIVYNGGTYTLEADGRGSVKVDKVQSATVQTNNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGC
LKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDRYYEELAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPVKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
RGRPPRPNVLEAAKKTIINVLEQTIDPPQKWSFSQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDL
ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
STQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTDLSPDIIQANSFFSFYGDDEIVSTDIKLDPERLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 5..994
/gene="ORF1"
/product="p48"
mat_peptide 995..2092
/gene="ORF1"
/product="NTPase"
mat_peptide 2093..2629
/gene="ORF1"
/product="p22"
mat_peptide 2630..3028
/gene="ORF1"
/product="VPg"
mat_peptide 3029..3571
/gene="ORF1"
/product="Pro"
mat_peptide 3572..5101
/gene="ORF1"
/product="RdRp"
gene 5085..6707
/gene="ORF2"
CDS 5085..6707
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="WPK51445.1"
/translation="MKMASSDANPSDGSAASLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNSYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPILTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGML
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTSNDFETNQNTKFTPVGVIQDG
GTTHRNEPQQWVLPGYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRAL"
gene 6707..7513
/gene="ORF3"
CDS 6707..7513
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="WPK51446.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDTAPARGSSSKSS
NSSTATSVYSNQTISTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN
1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gttgccaaca gcaacaacga
61 catcgcaaaa tcttcaagtg acggtgtgtt ttctaacatg gctgtcactt ttaagcgggc
121 cctcggggcg cggcctaaac agccgccccc gaaggaaata ccacccagac ccccgcgacc
181 acccacacca gaattggtca aaaagatccc tcctccccca cccaacgggg aggatgaact
241 agtggtctct tacagcgcca aagatggcgt ttccggattg cctgagctca ccactgtcag
301 acaaccggaa gaaaccaaca cggcgttcag tgtcccccca ctcaaccaaa gggagagcag
361 ggacgccaag gagccactaa ctggaacaat cattgaaatg tgggatggag aaatctacca
421 ttacggcctg tacgtggaac gaggtcttat acttggtgtg cacaagccac cagcagcaat
481 cagcctcgcc aaggtcgagt tggcaccgct ctctttgttc tggagacctg tatacacccc
541 ccagtacctt atctctccag acactcttag gagattacat ggagagtcat tcccctacac
601 tgcatttgac aacaattgct acgccttttg ttgttgggta ttagacctaa acgactcatg
661 gctaagcagg agaatgattc agagaacaac aggcttcttc aggccgtacc aggattggaa
721 caggaaacct ctccccacta tggatgattc caaattaaag aaggtagcca acatattctt
781 gtgcactttg tcttcactat tcaccagacc cattaaggac ataataggga agttgaaacc
841 tctcaacatc cttaacattc tggccacatg tgattggacc ttcgcaggca tagtggaatc
901 cttaatactc ttggcagaac tctttggagt tttctggaca cccccagatg tgtctgcgat
961 gatcgccccc ttgctaggtg attatgaact gcaaggaccc gaggaccttg cagtggaact
1021 ggtcccaata gtgatggggg ggataggttt ggtgctagga tttaccaaag agaaaatcgg
1081 aaagatgctg tcatccgctg cgtccacttt aagagcttgt aaagaccttg gtgcatacgg
1141 actggaaatc ttaaaattgg tcatgaagtg gttcttccca aagaaagagg aagcaaatga
1201 actggctatg gtgagatcca tcgaggatgc agtactagac ctcgaggcaa ttgaaaacaa
1261 ccatatgacc actctgctca aagacaaaga cagcttgaca acctacatga gaacccttga
1321 ccttgaggag gagaaagcca gaaaactctc gaccaaatct gcttcacccg atattgtggg
1381 cacaattaac tctcttctgg caagaatcgc tgctgcacgc tccctagtgc atcgggcgaa
1441 agaagagctc tccagcaggc cgagacctgt cgttgtaatg atatcgggaa gaccagggat
1501 agggaaaact caccttgcca gggagctggc caagaagatc gcggcctccc tcacagggga
1561 ccagcgtgtg ggccttatcc cacgcaatgg tgtcgaccac tgggacgcat acaagggcga
1621 aagagttgtc ctatgggacg actatggaat gagcaacccc atccatgatg ccctcagatt
1681 gcaggagctt gctgacactt gccccctcac gctaaattgt gacagaattg agaacaaggg
1741 aaaagtcttt gacagcgatg ccataattat caccaccaac ctggccaacc cagcaccact
1801 ggattatgtc aactttgaag cgtgctcgag acgcattgat ttcctcgtgt acgcagaagc
1861 ccctgaggtg gagaaggcga agcgcgactt cccaggtcaa cctgacatgt ggaagaacgc
1921 tttcagtcct gacttctcac acataaaact gtcattggct ccacagggcg gttttgacaa
1981 gaacggcaac accccgcatg gaaaaggggt catgaagacc ctcaccactg gctccctcat
2041 cgcccgagca tcagggttac tccatgagag gctagatgaa tatgaattgc aaggctcagc
2101 cctcaccact ttcaactttg accgcaacaa gatacttgcc tttagacagc ttgctgctga
2161 aaacaagtat gggctgatgg acacaatgag agttgggaaa cagcttaagg atgtcaagac
2221 tatgtcagac ctcaaacaag cactcaagaa catcgcgatc aagaagtgcc agatagtgta
2281 caatggtggc acctacacac ttgaggctga tggcaggggc agtgtgaaag ttgacaaagt
2341 gcaaagtgcc actgtgcaga ccaacaatga actagccggt gccctacacc acctaagatg
2401 cgctagaatc aggtactatg ttaagtgcgt ccaggaggca ctgtattcca tcatccaaat
2461 cgctggggct gcgttcgtca ccacgcgcat cgctaagcgc atgaatatac agaatctctg
2521 gtccaagcca caggtggaag acacagaaga gatggccaac aaagatggtt gcctaaaacc
2581 caaagatgat gaagagtttg tcgtttcatc cgacgacatc aaaactgagg gcaagaaagg
2641 gaagaacaag tccggccgtg gcaagaaaca tacagccttt tcaagtaaag ggctcagtga
2701 tgaggagtac gatgagtaca aaagaatcag agaagaaagg aatggtaagt actccataga
2761 agagtacctt caggacagag acaggtacta cgaggagttg gccattgcca gggcaaccga
2821 agaggacttc tgtgaagaag aagaggccaa aatccggcag aggattttca gaccaacaag
2881 gaaacaacgc aaagaagaga gggcctctct cggcttggtc acaggctctg aaatcaggaa
2941 gagaaaccca gaagacttca aacccaaggg aaagctgtgg gctgatgatg acagaagtgt
3001 tgactacaat gagaaactca actttgaggc cccaccaagc atctggtcgc ggatagtcaa
3061 ctttggttca ggctggggct tctgggtctc ccccagtctg tttataacat caacccatgt
3121 tataccccaa ggtgcaaaag agtttttcgg agtccctgtc aagcaaatcc agatacacaa
3181 gtcaggtgaa ttctgccgat tgagattccc aaagccaatc agaaccgatg tgacgggcat
3241 gattctagaa gaaggtgcgc ccgaggggac cgtggccaca ctgctcatca agagaccaac
3301 tggagagctc atgcccctgg cagccagaat ggggacccat gcaaccatga aaattcaggg
3361 gcgcacagtt ggagggcaaa tgggtatgct cctgacagga tccaacgcca agagtatgga
3421 cctaggcaca acaccaggcg actgcggctg cccctacatc tacaagaggg ggaatgacta
3481 cgtggtcata ggggtccata cggccgctgc ccgcggagga aacactgtca tatgtgccac
3541 ccagggaagt gagggagagg ccacactcga aggaggtgat agtaaaggga catactgtgg
3601 cgcgccaatc ttgggcccag ggagcgctcc gaagctcagc accaagacta agttttggag
3661 atcatccacg acaccactcc cacctggcac ctacgaacca gcctacctcg gtggcaaaga
3721 ccctagagtc aaaggtggcc cttcattgca acaagttatg agggaccagc tgaagccatt
3781 cacagaaccc agaggcagac caccaagacc aaatgtgttg gaagctgcca agaaaaccat
3841 catcaatgtc cttgagcaaa caattgatcc accccaaaaa tggtcatttt cgcaagcttg
3901 cgcatccctt gacaaaacca cctccagcgg ccacccgcac cacatgcgga aaaacgactg
3961 ttggaatggg gagtccttca caggaaaatt ggctgaccaa gcctccaagg ccaacctaat
4021 gtttgaagag ggaaagaaca tgactccagt ctacacaggt gcacttaaag atgagttggt
4081 aaagaccgat aaagtttatg gtaaggtcaa gaagaggctt ctgtggggtt cagatctggc
4141 gaccatgata cggtgcgccc gagcttttgg aggcctcatg gatgaactca aggcacactg
4201 tgtcacgctc cctgtcagag ttggtatgaa catgaatgag gatggcccca tcatcttcga
4261 gaagcactcc agatatagat atcactatga tgctgactat tcccgatggg actcaacaca
4321 acaaagggat gtgctagcag cagcactaga aatcatggtt aagttctctc cagaaccaca
4381 cctggcccag atagttgcag aagacctcct ctcccctagc gtgatggatg tgggtgactt
4441 tcaaatatca ataagtgagg gtctcccctc tggggtgcct tgcacttccc agtggaattc
4501 catcgcccac tggctcctca ctctgtgtgc actctctgaa gtcacggacc tgtcccctga
4561 tatcattcag gccaactcct ttttctcctt ttatggtgat gatgagattg taagcacaga
4621 cataaagttg gacccagaga ggctgacagc aaaactcaag gagtacgggc tgaaaccaac
4681 ccgccccgac aaaactgaag gaccccttgt tatctctgaa gacctggatg gtctgacatt
4741 cctccggaga actgtgaccc gtgacccagc tggttggttt ggaaaattgg aacaaagttc
4801 aattcttagg caaatgtatt ggaccagggg tcccaaccat gaagatccat ttgaaacaat
4861 gataccacac tcccaaagac ccatacagtt gatgtccctg ctgggcgagg ctgcactcca
4921 cggcccggca ttctacagca aaattagcaa attagtcatt gcagagttga aggaaggtgg
4981 catggatttt tacgtaccca gacaagagcc aatgttcaga tggatgagat tctcagatct
5041 gagcacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga
5101 gtgacgccaa cccatctgat gggtccgcag ccagcctcgt cccagaggtc aacaatgagg
5161 ttatggctct ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccagcaaa
5221 atgtaattga cccctggatt agaaacaatt ttgtacaagc ccctggtgga gagttcacag
5281 tgtcccctag aaacgctcca ggtgaaatac tatggagcgc gcccttgggc cctgatctaa
5341 atccctactt atctcatttg gccagaatgt acaatagtta tgcaggtggt tttgaagtgc
5401 aggtaattct cgcggggaac gcgttcaccg ccgggaaggt catatttgca gcagtcccac
5461 caaattttcc aactgaaggc ttgagcccca gccaggtcac tatgttcccc catatagtag
5521 tagatgttag gcaattagaa cctgtgttga ttcccttacc cgatgttagg aataatttct
5581 atcattacaa tcaatcaaat gaccccacta ttaagctgat agcaatgttg tatacaccac
5641 ttagggctaa taatgctggg gatgatgtct tcacagtttc ttgccgagtt ctcacgaggc
5701 catcccccga ttttgacttc atatttctag tgccacccac agttgagtca agaactaagc
5761 cattctctgt cccaatttta actgttgagg agatgaccaa ttcaagattc cccattcctc
5821 tggaaaagtt gttcacgggt cccagcagtg cctttgttgt ccaaccacaa aacggtaggt
5881 gcacgactga tggcgtgctc ctaggcacca cccaactgtc tcctgtcaac atctgcacct
5941 tcagaggaga tgtcacccat atcacaggta gtcacaacta cacaatgaat ttggcttctc
6001 aaaattggag caattacgac ccaacagaag aaattccagc ccctctaggg actccagact
6061 ttgtggggaa gattcaaggc atgcttaccc aaaccacaag gacagatggc tcaacacgcg
6121 gccacaaagc cacagtgtac actgggagcg ccgactttgc cccaaaactg ggtagagttc
6181 aatttgaaac tgacacaagc aatgattttg aaactaacca aaacacaaag ttcaccccag
6241 ttggtgtcat ccaagatggt ggcaccaccc accgaaatga accccaacag tgggtgctcc
6301 caggttactc aggcaggaac actcctaatg tgcatctggc ccccgctgta gcccccactt
6361 ttccgggtga gcaactcctc ttcttcaggt ccaccatgcc cggatgcagc gggtatccca
6421 acatggattt ggattgtctg ctcccccagg aatgggtgca gcacttctac caagaggcag
6481 ccccagcaca atctgatgtg gctctgctaa ggtttgtgaa tccagacaca ggtagggttt
6541 tgtttgaatg taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg
6601 atttggttat cccccccaat ggttatttta ggtttgactc ctgggtcaac cagttttaca
6661 cgcttgcccc catgggaaat ggagcggggc gtagacgtgc actataatgg ctggagcttt
6721 ctttgctgga ttggcatctg atgtccttgg ctctggactt ggttccctta tcaatgctgg
6781 ggctggggcc atcaaccaaa aagttgagtt tgaaaataac agaaaactgc aacaagcatc
6841 cttccaattt agcagcaatc tacaacaggc ttcctttcaa catgacaaag agatgctcca
6901 agcacaaatt gaggctacca aaaggttaca acaggaaatg atgaaagtta agcaggcaat
6961 gctcttagag ggtgggttct ctgagacaga tgcagcccgc ggggcaatca acgcccctat
7021 gacaaaagct ttggactgga gcgggacaag gtactgggct cctgatgcta ggactacaac
7081 atacaatgca ggccgctttt ccacccccca accatcgggg gcgctgccag ggagagctaa
7141 ccttagggat actgcccccg ctcggggttc ctctagtaag tcttctaatt cttctactgc
7201 tacttctgtg tactcaaatc aaactatttc aacgagactt ggttctacag ctggttctgg
7261 taccagtgtc tcgagcttcc cgtcaactgc aaggactagg agctgggttg aggatcaaag
7321 taggaatttg tcacctttca tgaggggggc ccacaacata tcgtttgtca ccccaccatc
7381 tagcagatcc tctagccaag gcacagtctc aaccgtgcct aaagagattt tggactcctg
7441 gactggcgct ttcaacacgc gcaggcagcc actcttcgct cacattcgta agcgagggga
7501 gtcacgggtg taatgtgaaa agacaaaatt gattatcttt cttttcttta gtgtctttt
//