![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MW661251 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 4..5103
ORF2: 5084..6706
ORF3: 6706..7512
LOCUS MW661251 7608 bp RNA linear VRL 28-SEP-2021
DEFINITION Norovirus GII isolate BMH15-059 nonstructural polyprotein (ORF1),
VP1 (ORF2), and VP2 (ORF3) genes, complete cds.
ACCESSION MW661251
VERSION MW661251.1
DBLINK BioProject: PRJNA396739
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7608)
AUTHORS Nasheri,N., Flint,A., Reaume,S., Harlow,J., Hoover,E. and
Weedmark,K.
TITLE Genomic Analysis of Human Noroviruses Using Hybrid
Illumina-Nanopore Data
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 7608)
AUTHORS Nasheri,N., Flint,A., Reaume,S., Harlow,J., Hoover,E. and
Weedmark,K.
TITLE Direct Submission
JOURNAL Submitted (23-FEB-2021) Bureau of Microbial Hazards, Health Canada,
251 SIR FREDERICK BANTING, Ottawa ON, Canada
COMMENT ##Assembly-Data-START##
Assembly Method :: Medaka v. v1.1.3; Pilon v. v1.23
Sequencing Technology :: Illumina; Nanopore
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7608
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="BMH15-059"
/isolation_source="feces"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Canada: Ottawa"
/collection_date="2015-03-11"
/note="Viral RNA extracted from fecal samples;
genotype: GII.4_Sydney_2012_GII.P31(GII.Pe)"
gene 4..5103
/gene="ORF1"
CDS 4..5103
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="QSD58372.1"
/translation="MKMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARP
KQPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPE
ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS
LAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGETFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKEEANELAIVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL
AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTM
SDLKQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGC
LKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRNDVAGMILEEGAPEG
TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDL
ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRFHYDADYSRWD
STQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 4..993
/gene="ORF1"
/product="p48"
mat_peptide 994..2091
/gene="ORF1"
/product="NTPase"
mat_peptide 2092..2628
/gene="ORF1"
/product="p22"
mat_peptide 2629..3027
/gene="ORF1"
/product="VPg"
mat_peptide 3028..3570
/gene="ORF1"
/product="Pro"
mat_peptide 3571..5100
/gene="ORF1"
/product="RdRp"
gene 5084..6706
/gene="ORF2"
CDS 5084..6706
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QSD58373.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
STTHQNEPQQWMLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
gene 6706..7512
/gene="ORF3"
CDS 6706..7512
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QSD58374.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQVEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRTNLRDAVPARGSSSKSP
NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN
1 tgaatgaaga tggcgtctaa cgacgcttcc gctgccgctg ttgccaacag caacaacgac
61 atcgcaaaat cttcaagtga cggtgtgttt tctaacatgg ctgtcacttt taagcgggcc
121 ctcggggcgc ggcctaaaca gccgcccccg aaggagatac cacccagacc cccgcgacca
181 cccacaccag aattggtcaa aaagatccct cctcccccac ccaacgggga ggatgaacta
241 gtggtctctt acagcgccaa agatggtgtt tccggactgc ctgagctcac cactgtcaga
301 caaccggaag aaaccaacac ggcgttcagt gtccccccac tcaaccaaag ggagagcagg
361 gacgccaagg agccactaac tggaacaatt attgaaatgt gggatggaga aatctaccac
421 tacggcctgt acgtggaacg aggtcttata cttggtgtgc acaaaccacc ggcagccatc
481 agccttgcca aggtcgagct ggcaccgctc tctttgtttt ggagacctgt gtacaccccc
541 cagtatctca tctctccaga cactcttagg agattacatg gagagacatt cccctacact
601 gcatttgaca acaattgcta cgccttttgt tgttgggtat tagacctgaa cgactcatgg
661 ctaagcagga gaatgattca gagaacaaca ggtttcttca ggccgtacca ggattggaac
721 aggaaacccc tccccactat ggatgattcc aaattaaaga aggtagccaa catattcttg
781 tgcactttgt cttcactatt caccagaccc attaaggaca taatagggaa gttgaaacct
841 cttaacatcc ttaacattct ggctacatgt gattggacct tcgcaggcat agtggaatcc
901 ttaatactct tggcagaact ctttggagtt ttctggacac ccccagatgt gtctgcgatg
961 atcgccccct tgctaggtga ttatgaactg caaggacctg aggaccttgc agtggaactg
1021 gtcccaatag tgatgggggg gataggtttg gtgctaggat ttaccaaaga gaaaatcgga
1081 aagatgctat catccgctgc atccacttta agagcttgta aagaccttgg tgcatacgga
1141 ttggaaatct taaaattggt catgaagtgg tttttcccaa agaaagagga agcaaatgag
1201 ttggctatag tgagatccat cgaggatgca gtactagacc tcgaggcaat tgaaaacaac
1261 cacatgacca ccctactcaa ggacaaagac agcttggcaa cctacatgag aacccttgac
1321 cttgaggagg agaaagccag aaaactctca accaaatctg cttcacccga tattgtgggc
1381 acaatcaact ctctcctggc aagaatcgct gctgcacgct ccctagtgca tcgggcgaaa
1441 gaagagctct ccagcaggcc tagacctgtc gttgtgatga tatcgggaag accagggata
1501 gggaaaactc accttgccag ggagctggcc aagaagatcg cggcctccct cacaggggac
1561 cagcgtgtgg gccttatccc acgcaatggt gtcgaccact gggacgcata caagggcgaa
1621 agagttgtcc tatgggacga ctatggaatg agcaacccca tccacgatgc tctcaggttg
1681 caggagcttg ctgacacttg ccccctcacg ctaaattgtg acagaattga gaacaaaggg
1741 aaagtctttg acagtgatgc cataattatc accaccaacc tggccaaccc agcaccacta
1801 gattatgtca actttgaagc gtgctcgaga cgcattgact tcctcgtgta cgcagaagcc
1861 cctgaggtag agaaggcaaa gcgcgacttc ccaggccaac ctgacatgtg gaagaacgct
1921 ttcagtcctg acttctcaca cataaaactt tcattggctc cacagggtgg ttttgacaag
1981 aacggcaaca ccccgcatgg aaaaggagtc atgaagaccc tcaccactgg ctccctcatc
2041 gcccgagcat cagggttact ccatgagagg ctagatgaat atgaactgca aggcccagcc
2101 ctcaccactt tcaactttga ccgcaacaag atacttgctt ttagacaact tgctgctgaa
2161 aacaagtatg ggttgatgga cacaatgaga gttggaaaac agcttaagga tgtcaagacc
2221 atgtcagacc tcaaacaagc actcaagaac atcgcgatca agaagtgcca gatagtgtac
2281 aatggtggca cctacacact tgaggccgat ggcaagggta gtgtaaaagt tgacaaagtg
2341 caaagtgcca ctgtgcagac caacaatgaa ctagccggtg ccctacacca cctaaggtgc
2401 gctagaatca gatattatgt taagtgcgtc caggaggcac tgtattccat catccaaatc
2461 gctggggctg cattcgtcac cacgcgcatc gccaagcgca tgaatataca gaatctctgg
2521 tccaagccac aggtggaaga cacagaagag atggccaaca aagatggttg cctaaaaccc
2581 aaagatgatg aagagtttgt cgtctcatcc gacgacatca aaactgaggg caagaaaggg
2641 aagaacaagt ccggccgtgg caagaagcac acagcctttt caagcaaagg gctcagtgat
2701 gaggagtacg atgagtacaa gagaatcaga gaagaaagga atggcaagta ctccatagaa
2761 gagtaccttc aggacagaga caggtactac gaggaggtgg ccattgccag ggcaaccgaa
2821 gaggacttct gtgaagaaga agaggccaaa atccggcaga gaattttcag accaacaagg
2881 aaacaacgca aagaagagag ggcctctctc ggcttggtca caggctctga aatcaggaag
2941 agaaacccag aagacttcaa acccaaggga aagctgtggg ctgacgatga cagaagtgtt
3001 gactacaatg agaaactcaa ctttgaggca ccaccaagca tctggtcacg gatagtcaac
3061 tttggttcag gctggggctt ctgggtctcc cccagtctgt ttataacatc aacccatgtc
3121 ataccccaag gtgcaaaaga gttcttcgga gtccctatca agcaaatcca gatacacaaa
3181 tcaggtgaat tctgccggtt gagattccca aagccaatca gaaatgatgt ggcgggtatg
3241 attctagaag aaggtgcgcc cgaggggacc gtggccacat tactcatcaa gagaccaact
3301 ggagagctca tgcctctggc agccagaatg gggacccatg cgaccatgaa aattcagggg
3361 cgcacagttg gagggcaaat gggtatgctc ctgacaggat ccaacgccaa gagtatggac
3421 ctaggcacaa caccaggcga ctgcggctgc ccctacatct ataagagggg gaatgactac
3481 gtggtcatag gggtccatac ggccgctgcc cgtgggggaa acactgtcat atgtgccacc
3541 caggggagtg agggagaagc cacacttgaa ggaggtgaca gtaaagggac atactgtggc
3601 gcaccaatct tgggaccagg gagtgctccg aagctcagta ccaagactaa gttttggaga
3661 tcatccacaa caccactccc acctggcacc tacgaaccag cctacctcgg tggcaaagac
3721 cctagagtca aaggtggccc ttcattgcaa caagttatga gggaccagct gaagccattc
3781 acagaaccca gaggcaaacc accaagacca aatgtgttgg aagctgccaa gaaaaccatc
3841 atcaatgttc ttgagcaaac aattgatcca ccccaaaaat ggtcatttgc gcaagcttgc
3901 gcatcccttg acaaaaccac ctccagtggc cacccgcacc acatgcggaa aaacgactgt
3961 tggaatgggg agtccttcac aggaaaattg gctgatcaag cctccaaagc caacttaatg
4021 tttgaagagg gaaagaacat gactccagtc tacacaggtg cacttaaaga tgagttggta
4081 aagaccgata aagtttatgg taaggtcaag aagagacttc tgtggggttc agatctggcg
4141 accatgatac ggtgcgcccg agcttttgga ggccttatgg atgaactcaa ggcgcactgt
4201 gtcacacttc ctgtcagagt tggtatgaac atgaatgagg atggccccat catctttgag
4261 aagcactcca gatatagatt tcactatgat gctgattatt cccggtggga ctcaacacaa
4321 caaagggatg tgctagcagc agcactagaa atcatggtta agttttctcc agaaccacac
4381 ctggcccaga tagttgcaga agacctcctt tcccctagcg taatggatgt aggtgacttt
4441 caaatatcaa taagtgaggg tctcccctct ggggtacctt gtacctccca gtggaattcc
4501 atcgcccact ggctcctcac tctgtgtgca ctctctgaag tcacggacct gtcccctgac
4561 atcattcagg ccaactccct tttctccttc tatggtgatg atgagattgt aagcacagac
4621 ataaagttgg acccagagaa gctgacagcg aaactcaagg agtacgggct aaaaccaacc
4681 cgccccgaca aaactgaagg accgcttgtt atctctgaag acctggatgg cctgacattc
4741 ctccggagaa ctgtgacccg tgatccagct ggctggtttg gaaaactgga acaaagttca
4801 attctcaggc aaatgtactg gactaggggc cccaaccatg aagatccatt tgaaacaatg
4861 ataccacact cccaaagacc catacaattg atgtccttgc tgggcgaggc tgcactccac
4921 ggcccggcat tctatagcaa aattagcaaa ttagtcattg cagagttgaa ggaaggtggc
4981 atggattttt acgtgcccag acaagagcca atgttcagat ggatgagatt ctcagatctg
5041 agcacgtggg agggcgatcg caatctggct cccagttttg tgaatgaaga tggcgtcgag
5101 tgacgccaac ccatctgatg ggtccgcagc caacctcgtc ccagaggtca acaatgaggt
5161 tatggctctg gagcccgttg ttggtgccgc cattgcggca cccgtagcgg gccaacaaaa
5221 tgtaattgac ccctggatta gaaataattt tgtgcaagcc cctggtggag agtttacagt
5281 gtcccctaga aacgctccag gtgaaatact atggagcgcg cccttgggtc ctgatctaaa
5341 tccctaccta tcccatttgg ccagaatgta caatggttat gcaggtggtt ttgaagtgca
5401 ggtaattctc gcggggaacg cgttcaccgc cgggaaggtc atatttgcag cagtcccacc
5461 aaattttcca actgaaggct tgagccccag ccaggtcact atgttccccc atatagtagt
5521 agatgttagg caactagaac ctgtgttgat tcccttaccc gatgtcagga ataattttta
5581 tcattacaat caatcaaatg accccaccat taagttgata gcaatgttgt atacaccact
5641 tagggctaat aatgctgggg acgatgtttt cacagtttct tgccgagttc tcacgagacc
5701 atcccctgat tttgatttca tatttctagt gccacccaca gttgagtcaa gaactaaacc
5761 tttctctgtc ccagttttaa ctgttgagga gatgaccaat tcaaggttcc ccattccttt
5821 ggaaaagttg ttcacgggtc ccagcagtgc ctttgttgtc caaccacaaa acggcaggtg
5881 cacgactgat ggcgtgctcc taggcactac ccaactgtct cctgtcaaca tctgcacctt
5941 cagaggagat gtcacccata tcacaggtag tcgtaactac acaatgaatt tggcttctca
6001 aaactggaac aattatgacc caacagaaga aatcccagcc cccctaggaa ctccagattt
6061 tgtggggaag attcaaggcg tgctcaccca aaccacaagg acagatggct caacacgcgg
6121 ccacaaagct acagtgtaca ctgggagcgc cgactttgcc ccaaaactgg gtagagttca
6181 atttgaaact gacacagacc atgactttga agctaaccaa aacacaaagt tcaccccagt
6241 tggtgttatc caagatggta gcaccaccca ccaaaatgaa ccccaacagt ggatgctccc
6301 aagttactca ggtagaaata ctcataatgt gcatctggcc cccgctgtag cccccacttt
6361 tccgggtgag caacttctct tcttcagatc caccatgccc ggatgcagcg ggtaccccaa
6421 catggattta gactgtctgc ttccccagga atgggtgcag tacttctacc aagaggcagc
6481 cccagcacaa tctgatgtgg ctctgctaag atttgtgaat ccagacacag gtagggttct
6541 gtttgagtgt aagcttcata aatcaggcta tgttacagtg gctcacactg gccaacatga
6601 tttggttatc ccccccaatg gttattttag gtttgattcc tgggtcaacc agttttacac
6661 gcttgccccc atgggaaatg gaacggggcg cagacgtgca gtataatggc tggagctttc
6721 tttgctggat tggcatctga tgtccttggc tctggactcg gttctcttat caatgctggg
6781 gctggggcca tcaaccaaaa agttgagttt gaaaataaca gaaaattgca acaagcatcc
6841 ttccaattta gcagcaatct acaacaggct tccttccaac atgacaaaga gatgctccaa
6901 gcacaagttg aggccaccaa aaagttacaa caggaaatga tgaaagttaa gcaggcaatg
6961 ctcctagagg gtgggttctc tgagacagat gcagcccgcg gggcaatcaa cgcccccatg
7021 acaaaagctt tggactggag cgggacaagg tactgggctc ccgatgctag gactacaaca
7081 tacaatgcag gccgcttttc cacccctcaa ccatcggggg cactgccagg aagaactaat
7141 cttagggatg ctgtccctgc tcggggttcc tccagcaaat ctcctaattc ttccactgct
7201 acttctgtgt actcaaatca aactacttca acgagacttg gttctacagc tggttctggt
7261 accagtgtct cgagcttccc gtcgactgcg aggactagga gttgggttga ggatcaaagc
7321 aggaatttgt cacctttcat gaggggggcc cacaacatat cgtttgtcac cccaccatct
7381 agcagatcct ctagccaagg cacagtctca accgtgccta aagaggtttt ggactcctgg
7441 actggcgctt tcaacacgcg caggcagcca ctcttcgctc acattcgtaa gcgaggggag
7501 tcacgggcgt aatgtgaaaa gacaaaattg actatctctt tctttttctt tagtgtcttt
7561 ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//