![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| OR844370 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7513
LOCUS OR844370 7559 bp RNA linear VRL 29-NOV-2023
DEFINITION Norovirus GII isolate NGII.4/12241/Kyoto/Oct/2013-2014, complete
genome.
ACCESSION OR844370
VERSION OR844370.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7559)
AUTHORS Hoque,S.A., Pham,N., Okitsu,S., Shimizu,Y.O. and Ushijima,H.
TITLE Direct Submission
JOURNAL Submitted (24-NOV-2023) Department of Pathology and Microbiology,
Nihon University Graduate School of Medicine, Itabashi-ku,
Itabashi-Ku, Itabashi 173-8610, Japan
COMMENT ##Assembly-Data-START##
Assembly Method :: SOAPdenovo v. May-2022
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7559
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="NGII.4/12241/Kyoto/Oct/2013-2014"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Japan"
/collection_date="23-Oct-2013"
/note="genotype: GII.4"
gene 5..5104
/gene="ORF1"
CDS 5..5104
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="WPK51405.1"
/translation="MKMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARP
KQPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPE
ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS
LAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL
AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTM
SDLKQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATVQTSNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGC
LKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRRQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDL
ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
STQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSRISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 5..994
/gene="ORF1"
/product="p48"
mat_peptide 995..2092
/gene="ORF1"
/product="NTPase"
mat_peptide 2093..2629
/gene="ORF1"
/product="p22"
mat_peptide 2630..3028
/gene="ORF1"
/product="VPg"
mat_peptide 3029..3571
/gene="ORF1"
/product="Pro"
mat_peptide 3572..5101
/gene="ORF1"
/product="RdRp"
gene 5085..6707
/gene="ORF2"
CDS 5085..6707
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="WPK51406.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSRNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDRDFEANQNTKFTPVGVIQDG
STAHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6707..7513
/gene="ORF3"
CDS 6707..7513
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="WPK51407.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVRQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKSS
NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN
1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gttgccaata gcaacaacga
61 catcgcaaaa tcttcaagtg acggtgtgtt ttctaacatg gctgtcactt ttaagcgggc
121 cctcggggcg cggcctaaac agccgccccc gaaggaaata ccacccagac ccccgcgacc
181 acccacacca gaattggtca aaaagatccc tcctccccca cccaacgggg aggatgaact
241 agtggtctct tacagcgcca aagatggcgt ttccggactg cctgagctca ccactgtcag
301 acaaccggaa gaaaccaaca cggcgttcag tgtcccccca ctcaaccaaa gggagagcag
361 ggacgccaag gagccactaa ctggaacaat cattgaaatg tgggatggag aaatctacca
421 ttacggcctg tacgtggaac gaggtcttat acttggtgtg cacaagccac cggcagccat
481 tagccttgcc aaggtcgagc tagcaccgct ctccttgttc tggagacctg tatacacccc
541 ccagtatctc atctctccag acactcttag gagattacat ggagagtcat tcccctacac
601 tgcatttgac aacaattgct acgccttttg ttgttgggta ttagacctaa acgactcatg
661 gctaagcagg agaatgattc agagaacaac aggcttcttc aggccgtacc aggattggaa
721 caggaaaccc ctccccacta tggatgattc caaattaaag aaggtagcca acatattctt
781 gtgcactttg tcctcactat tcaccagacc cattaaggac ataataggga agttgaaacc
841 tcttaacatc cttaacattc tggctacatg tgattggacc ttcgcaggca tagtggaatc
901 cttaatactc ttggcagaac tctttggagt tttctggaca cccccagatg tgtctgcgat
961 gatcgccccc ttgctaggtg attatgaact gcaaggacct gaggaccttg cagtggaact
1021 ggtcccaata gtgatggggg ggataggttt ggtgctagga tttaccaaag agaaaatcgg
1081 aaagatgcta tcatccgctg catccacttt aagagcttgt aaagaccttg gtgcatacgg
1141 actggaaatc ttaaaattgg tcatgaagtg gttcttccca aagaaagagg aagcaaatga
1201 actggctatg gtgagatcca tcgaggatgc agtactagac ctcgaggcaa ttgaaaacaa
1261 ccacatgacc accctactca aagacaaaga cagcttggca acctacatga gaacccttga
1321 ccttgaggag gagaaagcca gaaaactctc aaccaaatct gcttcacccg atattgtggg
1381 cacaatcaac tctcttctgg caagaatcgc tgctgcacgc tccctagtgc atcgggcgaa
1441 agaagagctt tccagcaggc cgagacctgt cgttgtgatg atatcgggaa gaccagggat
1501 agggaaaact caccttgcca gggagctggc caagaagatc gcggcctccc tcacagggga
1561 ccagcgtgtg ggtcttatcc cacgcaatgg tgtcgaccac tgggacgcat acaagggcga
1621 aagagttgtc ctatgggacg actatggaat gagcaacccc atccatgatg ccctcaggtt
1681 gcaggagctt gctgacactt gccccctcac gctaaattgt gacagaattg agaacaaagg
1741 gaaagtcttt gacagtgatg ccataattat caccaccaat ctggccaacc cagcaccact
1801 ggattatgtc aattttgaag cgtgctcgag acgcattgat ttcctcgtgt acgcagaagc
1861 ccctgaggtg gagaaggcga agcgcgactt cccaggtcaa cctgacatgt ggaagaacgc
1921 tttcagtcct gacttctcac acataaaact gtcattggct ccacagggtg gttttgacaa
1981 gaacggcaac accccgcatg gaaaaggggt catgaagacc ctcaccactg gctccctcat
2041 cgcccgagca tcagggttac tccatgagag gctagatgaa tatgaactgc aaggcccagc
2101 cctcaccacc ttcaactttg accgcaacaa gatacttgct tttagacagc ttgctgctga
2161 aaacaagtat gggctgatgg acacaatgag agttggaaaa cagctcaagg atgtcaagac
2221 catgtcagac ctcaaacaag cactcaagaa catcgcgatc aagaagtgcc agatagtgta
2281 caatggtggc acctacacac ttgaggctga tggcaagggt agtgtgaaag ttgacaaagt
2341 gcaaagtgcc actgtgcaga ccagcaatga actagccggt gccctacacc acctaaggtg
2401 cgctagaatc agatactatg ttaagtgcgt ccaggaggca ctgtattcca tcatccaaat
2461 cgctggggct gcattcgtca ccacgcgcat cgctaagcgc atgaatatac agaatctctg
2521 gtccaagcca caggtggaag acacagaaga gatggccaac aaagatggtt gcctaaaacc
2581 caaagatgat gaagagtttg tcgtctcatc cgacgacatc aaaactgagg gcaagaaagg
2641 gaagaacaag tccggccgtg gcaagaagca cacagccttt tcaagtaaag ggctcagtga
2701 tgaggagtac gatgagtaca agagaatcag agaagaaagg aatggtaagt actccataga
2761 agagtacctt caggacagag acaggtacta cgaggaggtg gccattgcca gggcaaccga
2821 agaggacttc tgtgaagaag aagaggccaa aatccggcag agaattttca gaccaacaag
2881 gagacaacgc aaagaagaga gggcttctct cggcttggtc acaggctctg aaatcaggaa
2941 gagaaaccca gaagacttca aacccaaggg aaagctgtgg gctgatgatg acagaagtgt
3001 tgactacaat gagaaactca actttgaggc cccaccaagc atctggtcgc ggatagtcaa
3061 ctttggttca ggctggggct tctgggtctc ccccagtctg tttataacat caacccatgt
3121 cataccccaa ggtgcaaaag agttcttcgg agtccctatc aagcaaatcc agatacacaa
3181 gtcaggtgaa ttctgccggt tgagattccc aaagccaatc agaactgatg tgacgggcat
3241 gattctagaa gaaggtgcgc ccgaggggac cgtggccaca ctgctcatca agagaccaac
3301 tggagagctc atgcctctgg cagccagaat ggggacccat gcaaccatga aaattcaggg
3361 gcgcacagtt ggagggcaaa tgggtatgct cctgacagga tccaacgcca agagtatgga
3421 cctaggcaca acaccaggcg attgcggctg cccctacatc tacaagaggg ggaatgacta
3481 cgtggtcata ggagtccata cggccgctgc ccgtggagga aacactgtca tatgtgccac
3541 ccaggggagt gagggagaag ccacacttga aggaggtgac agtaaaggga catactgtgg
3601 cgcaccaatc ttgggcccag ggagcgctcc gaagctcagt accaagacta agttttggag
3661 atcatccaca acaccactcc cacctggcac ctacgaacca gcctacctcg gtggcaaaga
3721 ccctagagtc aaaggtggcc cttcattgca acaagttatg agggaccagc tgaagccatt
3781 cacagaaccc agaggcaaac caccaagacc aaatgtgttg gaagctgcca agaaaaccat
3841 catcaatgtc cttgagcaaa caattgatcc accccaaaaa tggtcatttg cgcaagcttg
3901 cgcatccctt gacaaaacca cctccagcgg ccacccgcac cacatgcgga aaaacgactg
3961 ttggaatggg gagtccttca caggaaaatt ggctgatcaa gcctccaagg ccaacctaat
4021 gtttgaagag ggaaagaaca tgactccagt ctacacaggt gcacttaaag atgagttggt
4081 aaagaccgat aaagtttatg gtaaggtcaa gaagaggctt ctgtggggtt cagatctggc
4141 gaccatgata cggtgcgccc gagcttttgg aggccttatg gatgaactca aggcacactg
4201 tgtcacactt cctgtcagag ttggcatgaa catgaatgag gatggcccca tcatctttga
4261 gaagcactcc agatatagat atcactatga tgctgattat tcccggtggg actcaacaca
4321 acaaagggat gtgctagcag cagcactaga aatcatggtt aagttctctc cagaaccaca
4381 cctggcccag atagttgcag aagacctcct ttcccctagc gtgatggatg taggtgactt
4441 tcaaatatca ataagtgagg gtctcccctc tggggtacct tgtacctccc agtggaactc
4501 catcgcccac tggctcctca ctctgtgtgc actctctgaa gtcacggacc tgtcccctga
4561 tatcattcag gccaactccc ttttctcctt ctacggtgat gatgagattg taagcacaga
4621 cataaagttg gacccagaga agctgacagc aaagctcaag gagtacgggc tgaaaccaac
4681 ccgccccgac aaaactgaag gaccccttgt tatctctgaa gacctggatg gcctgacatt
4741 cctccggaga actgtgaccc gtgatccagc tggctggttt ggaaaattgg aacaaagttc
4801 aattctcagg caaatgtact ggaccagggg tcccaaccat gaagacccat ttgaaacaat
4861 gataccacac tcccaaagac ccatacaatt gatgtccttg ctgggcgagg ctgcactcca
4921 cggcccggca ttctatagca gaattagcaa attagtcatt gcagagttga aggaaggtgg
4981 catggatttt tacgtaccca gacaagagcc aatgttcaga tggatgagat tctcagatct
5041 gagcacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga
5101 gtgacgccaa cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg
5161 ttatggctct ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccaacaaa
5221 atgtaattga cccctggatt agaaataatt ttgtacaagc ccctggtgga gagtttacag
5281 tatcccctag aaacgctcca ggtgaaatac tatggagcgc gcccttgggc cctgatctaa
5341 atccctacct atcccatttg gccagaatgt acaatggtta tgcaggtggt tttgaagtgc
5401 aggtaattct cgcggggaac gcgttcaccg ccgggaaggt catatttgca gcagtcccac
5461 caaattttcc aactgaaggc ttgagcccca gccaggtcac tatgttcccc catatagtag
5521 tagatgttag gcaactagaa cctgtgttga ttcccttacc cgatgttagg aataatttct
5581 atcattacaa tcaatcaaat gaccccacca ttaagttgat agcaatgttg tatacaccac
5641 ttagggctaa taatgctggg gatgatgtct tcacagtttc ttgccgagtt ctcacgagac
5701 catcccccga ttttgatttc atatttctag tgccacccac agttgagtca agaaccaaac
5761 cattctctgt cccagtttta actgttgagg agatgaccaa ttcaagattc cccattcctt
5821 tggaaaagtt gttcacgggt cccagcagtg cctttgttgt ccaaccacaa aacggtaggt
5881 gcacgactga tggcgtgctc ctaggcacca cccaactgtc tcctgtcaac atctgcacct
5941 tcagaggaga tgtcacccat atcacaggta gtcgtaacta cacaatgaat ttggcttctc
6001 aaaattggag caattatgac ccaacagaag aaatcccagc ccctctagga actccagatt
6061 ttgtggggaa gattcaaggc gtgctcaccc aaaccacaag gacagatggc tcaacacgcg
6121 gccacaaagc cacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc
6181 aatttgaaac tgacacagac cgtgattttg aagctaacca aaacacaaag ttcaccccag
6241 ttggtgtcat ccaagatggt agcaccgccc accgaaatga accccaacag tgggtgctcc
6301 caagttactc aggcagaaat actcctaatg tgcatctggc ccccgctgta gcccccactt
6361 ttccgggtga gcaacttctc ttcttcagat ccaccatgcc cggatgcagc gggtacccca
6421 acatggattt ggactgtctg ctcccccagg aatgggtgca gtacttctac caagaggcag
6481 ccccagcaca atctgatgtg gctctgctaa gatttgtgaa tccagacaca ggtagggttt
6541 tgtttgagtg taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg
6601 atttggttat cccccccaat ggttatttta ggtttgattc ctgggtcaac cagttttaca
6661 cgcttgcccc catgggaaat ggaacggggc gtagacgtgc actataatgg ctggagcttt
6721 ctttgctgga ttggcatctg atgtccttgg ctctggactt ggttccctta tcaatgctgg
6781 ggctggggcc atcaaccaaa aagttgagtt tgaaaataac agaaaactgc aacaagcatc
6841 cttccaattt agcagcaatc tacaacaggc ttcctttcaa catgacaaag agatgctcca
6901 agcacaaatt gaggccacca aaaggctaca acaggaaatg atgaaagtta ggcaggcaat
6961 gctcctagag ggtgggttct ctgagacaga tgcagcccgc ggggcaatca acgcccccat
7021 gacaaaagct ttggactgga gcgggacaag gtactgggct cccgatgcta ggactacaac
7081 atacaatgca ggccgctttt ccacccctca accatcgggg gcactgccag gaagagctaa
7141 tcttagggat gctgtccctg ctcggggttc ctccagtaag tcttctaatt cttctactgc
7201 tacttctgtg tactcaaatc aaactacttc aacgagactt ggttctacag ctggttctgg
7261 taccagtgtc tcgagcttcc cgtcaactgc aaggactagg agctgggttg aggatcaaag
7321 taggaatttg tcacctttca tgaggggggc ccacaacata tcgtttgtca ccccaccatc
7381 tagcagatcc tctagccaag gcacagtctc aaccgtgcct aaagagattt tggactcctg
7441 gactggcgct ttcaacacgc gcaggcagcc actcttcgct cacattcgta agcgagggga
7501 gtcacgggcg taatgtgaaa agacaaaatc gattatcttt cttttcttta gtgtctttt
//