![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MW661247 | GII.4 Den Haag | ||
|---|---|---|---|
| GII.P4 Den Haag |
ORF1: 4..5103
ORF2: 5084..6706
ORF3: 6706..7512
LOCUS MW661247 7580 bp RNA linear VRL 28-SEP-2021
DEFINITION Norovirus GII isolate BMH12-030 nonstructural polyprotein (ORF1),
VP1 (ORF2), and VP2 (ORF3) genes, complete cds.
ACCESSION MW661247
VERSION MW661247.1
DBLINK BioProject: PRJNA396739
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7580)
AUTHORS Nasheri,N., Flint,A., Reaume,S., Harlow,J., Hoover,E. and
Weedmark,K.
TITLE Genomic Analysis of Human Noroviruses Using Hybrid
Illumina-Nanopore Data
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 7580)
AUTHORS Nasheri,N., Flint,A., Reaume,S., Harlow,J., Hoover,E. and
Weedmark,K.
TITLE Direct Submission
JOURNAL Submitted (23-FEB-2021) Bureau of Microbial Hazards, Health Canada,
251 SIR FREDERICK BANTING, Ottawa ON, Canada
COMMENT ##Assembly-Data-START##
Assembly Method :: Medaka v. v1.1.3; Pilon v. v1.23
Sequencing Technology :: Illumina; Nanopore
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7580
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="BMH12-030"
/isolation_source="feces"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Canada: Ottawa"
/collection_date="2012-02-08"
/note="Viral RNA extracted from fecal samples;
genotype: GII.4_Den_Haag_2006b_GII.P4"
gene 4..5103
/gene="ORF1"
CDS 4..5103
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="QSD58360.1"
/translation="MKMASNDASAAAVANSNNDTAKSSNDKMFSNMAVTLKRALGARP
KQPPPREIPQRPPRPPTPELIKKIPPPPPNGEDEVVVSYSAKDGVSGLPDLSTVRQPE
ETNTAFSVPPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAIS
LARVELTPLSLFWRPVYTPQYLISPDTLKKLHGETFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCALSSLFTRPIKDIIGR
LRPLNIINILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDFELQGPED
LVVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREV
AKRIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
AKRDFPGQPDMWKNAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEFELQGPALTTFNFDRNKVLAFRQLAAENKYGLMDTMKIGRQLKDVKTM
PELKQALNNISIKKCQIVYSGCTYTLESDGKGNVKVDRVQSTSIQTNNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIIKRVNIQDLWSKPQVENTEEATSKDGC
PKPKDDEEFVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
TVVTLLIKRSTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGGEGEATLEGGDSKGTYCGAPILG
PGSAPKLSTKTKFWRSSTAPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPKPSVLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKIKKRLLWGSDL
ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
STQQRAVLAAALEIMVKFSSEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTNLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPSESMIPHSQRPIQLMSLLGEAALHGPTFYSKISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 4..993
/gene="ORF1"
/product="p48"
mat_peptide 994..2091
/gene="ORF1"
/product="NTPase"
mat_peptide 2092..2628
/gene="ORF1"
/product="p22"
mat_peptide 2629..3027
/gene="ORF1"
/product="VPg"
mat_peptide 3028..3570
/gene="ORF1"
/product="Pro"
mat_peptide 3571..5100
/gene="ORF1"
/product="RdRp"
gene 5084..6706
/gene="ORF2"
CDS 5084..6706
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QSD58361.1"
/translation="MKMASNDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRSNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFTVPILTVEEMTNSRFPIPLEKLFTGPSSAFIVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHIAGSHNYTMNLASLNWNNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTKGDGSTRGHKATVYTGSADFTPKLGSVRFSTDTDNDFETHQNTRFTPVGVIQDG
GTTHRNEPQQWVLPSYSGRNVHNVHLAPAVAPNFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6706..7512
/gene="ORF3"
CDS 6706..7512
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QSD58362.1"
/translation="MAGTFFAGLASDVLSSGLGSLINAGAGAINQRIDFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQELMKVKQAILLEGGFSETDAARGAIN
APMTKTLDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRINPRTPIPARGSPSMSS
NVSTATSIHSNQTASTRLGSTAGSGTNVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKDVLDSWTGAFNTRRQPLFAHIRRRGESRV"
ORIGIN
1 tgaatgaaga tggcgtctaa cgacgcttcc gctgccgctg ttgctaacag caacaacgac
61 accgcaaaat cttcaaatga caaaatgttt tctaacatgg ctgtcactct taaacgagcc
121 ctcggggcgc ggcctaaaca gccccccccg agggaaatac cacaaagacc cccacgacca
181 cctactccag aactgatcaa aaagatccct cctcccccgc ccaacggaga ggatgaagtg
241 gtggtttcct atagtgccaa agatggcgtt tccggtttgc ctgatctttc caccgtcagg
301 caaccggaag aaaccaatac ggccttcagt gtccctccac taaatcagag ggagaatagg
361 gatgctaagg aaccactgac tggaacaatt ctggaaatgt gggatggaga aatctaccat
421 tatggcctgt acgttgagcg aggtcttgtg ctgggtgtac acaaaccacc agctgccatt
481 agcctcgcta gggtcgaatt aacaccactc tccttgttct ggagacctgt gtacactcct
541 cagtacctca tctctccaga cactctcaag aagttacacg gagaaacatt tccctacaca
601 gcctttgaca acaactgcta tgccttttgt tgttgggtcc tggatctaaa cgactcgtgg
661 ctgagcagga gaatgatcca gagaacaact ggcttcttca gaccctacca agattggaat
721 aggaaacccc tccccactat ggatgattcc aaattaaaga aggtagctaa catattcctg
781 tgtgccctgt cttcgctatt caccaggccc ataaaagaca taataggaag gttaaggcct
841 cttaacatca tcaacatcct ggcttcatgt gattggactt tcgcaggcat agtggagtcc
901 ttgatactct tggcagagct ctttggagtc ttctggacac ccccagatgt gtctgcgatg
961 attgccccct tactcggtga tttcgagtta caaggacctg aagaccttgt agtggagctc
1021 gtccctgtag taatgggggg gattggtctg gtgctaggat tcaccaaaga gaagattgga
1081 aaaatgttgt catctgctgc atccaccttg agggcttgta aagaccttgg tgcatatggg
1141 ctagaaatcc taaagttagt catgaagtgg ttcttcccga agaaagagga agctaatgaa
1201 ctggctatgg tgagatccat cgaggatgcg gtactggacc ttgaggcaat tgaaaacaat
1261 catatgacca ctttgctcaa agacaaagat agcctggcaa cctacatgag aacccttgac
1321 ctcgaggaag aaaaagccag aaaactctca accaagtctg cttcacctga catcgtgggc
1381 acaatcaacg cgcttctggc gagaatcgcc gctgcacgct ccctggtgca ccgagcgaag
1441 gaggagcttt ccagcagacc aagacctgta gtcttgatga tatcaggcag gccaggaata
1501 gggaagaccc accttgctag ggaagtggct aagagaatcg cagcctccct cacaggagac
1561 cagcgtgtag gcctcatccc acgcaatggc gtcgaccact gggatgcgta caagggggag
1621 agggtcgtcc tatgggacga ttatggaatg agcaatccca tccacgacgc cctcaggctg
1681 caagaactcg ctgacacttg ccccctcacc ctaaattgtg acaggattga gaataaagga
1741 aaggtctttg acagcgatgt catcattatc actactaatc tggccaaccc agctccactg
1801 gactatgtca actttgaagc atgctcgagg cgcatcgatt tcctcgtgta cgcagaagcc
1861 cccgaggtcg agaaggcgaa gcgcgacttc ccgggccaac ctgacatgtg gaaaaacgct
1921 tttagttctg atttctcaca cataaaactg gcactggctc cacaaggtgg ctttgataag
1981 aacgggaaca ccccacacgg gaagggcgtc atgaagactc tcaccactgg ctccctcatt
2041 gcccgggcat cagggctgct ccacgagaga ttggatgagt ttgagctgca gggcccagct
2101 ctcaccacct tcaactttga ccgcaacaaa gtgcttgcct tcaggcagct tgctgctgaa
2161 aacaaatatg ggttgatgga cacaatgaaa attgggaggc agctcaagga tgtcaaaacc
2221 atgccagaac ttaaacaagc actcaataat atctcaatca agaagtgcca gattgtgtac
2281 agtggttgca cctacacact tgagtctgat ggcaaaggca atgtgaaagt tgacagagtc
2341 cagagtacct ccattcagac taacaatgag ttggctggcg ccctgcacca tctaaggtgc
2401 gccagaatca ggtattatgt taagtgtgtt caggaggccc tgtactctat catccagatt
2461 gctggggctg catttgtcac cacgcgcatc atcaagcgtg tgaacattca agacttatgg
2521 tccaagccac aagtggaaaa cacagaggag gctaccagca aagacgggtg cccaaaaccc
2581 aaagatgatg aggagttcgt catttcatct gacgacatta aaactgaggg taagaaaggg
2641 aagaacaaaa ctggccgtgg taaaaagcat acggccttct caagtaaagg tcttagtgat
2701 gaagagtatg atgagtacaa gagaattaga gaggaaagaa atggcaagta ctccatagaa
2761 gagtaccttc aggacaggga caaatactat gaggaggtgg ccattgccag ggcgaccgaa
2821 gaagacttct gtgaagagga ggaggccaag atccggcaaa ggatcttcag accaacaagg
2881 aaacaacgca aggaagaaag ggcttctctc ggtttagtca caggttctga aattaggaaa
2941 agaaatccag aagacttcaa gcccaagggg aaactatggg ctgacgatga cagaagtgtg
3001 gactacaatg aaaaactcag ttttgaggcc ccaccaagca tctggtcaag gatagtcaac
3061 tttggttcag gttggggctt ctgggtctcc cccagcctgt tcataacatc aacccacgtc
3121 ataccccagg gcgcaaagga gttctttgga gtccccatca aacaaattca ggtgcacaag
3181 tcaggcgaat tctgtcgctt gaggttccca aaaccaatca ggactgatgt gactggcatg
3241 atcttggaag aaggtgcgcc cgaaggcacc gtggtcacac tacttatcaa aaggtctact
3301 ggagaactca tgcccttagc agctagaatg ggaacccacg caaccatgaa aatccaaggg
3361 cgcactgttg gaggtcagat gggcatgctt ctaacagggt ccaacgccaa aagcatggat
3421 ctaggtacca caccaggtga ttgtggctgt ccctacatct acaagagagg aaacgactat
3481 gtggtcattg gagtccacac ggctgccgct cgtgggggaa acactgtcat atgtgccacc
3541 caggggggtg agggggaagc tacacttgaa ggtggtgaca gtaagggaac atactgtggt
3601 gcaccaatcc taggcccagg gagtgcccca aaactcagca ccaaaaccaa attctggagg
3661 tcgtccacag caccacttcc acctggcacc tatgagccag cctaccttgg tggtagagac
3721 cccagagtca agggtggccc ctcgttgcag caagtcatga gagaccagct aaaaccattt
3781 acagagccta ggggtaagcc accaaagcca agtgtgttag aagctgccaa gaaaaccatc
3841 atcaatgtcc ttgaacagac aattgaccca cctgagaagt ggtcgttcgc acaagcttgc
3901 gcgtcccttg ataagaccac ttctagcggc catccgcacc acatgcggaa aaacgactgc
3961 tggaacgggg agtccttcac aggcaagctg gcagaccagg cttccaaggc caacctgatg
4021 tttgaagaag ggaagaacat gaccccagtc tacacaggtg cacttaaaga tgaattagtt
4081 aaaactgaca aagtttatgg caagatcaag aagaggcttc tctggggctc ggatttggca
4141 accatgatcc ggtgtgctcg agcattcgga ggtcttatgg atgaactcaa agcacactgt
4201 gtcacacttc ctgtcagagt tggtatgaat atgaatgagg atggccccat catcttcgag
4261 aagcattcca gatacagata ccactatgat gctgattact ctcgctggga ttcaacacaa
4321 cagagagccg tgctggcagc tgctctagaa atcatggtta aattctcctc agaaccacat
4381 ttggctcagg tagtagcaga agaccttctt tctcctagcg tagtggatgt gggtgacttc
4441 acaatatcaa tcaacgaggg tcttccctct ggggtgccct gcacctccca atggaactcc
4501 atcgcccact ggcttctcac tctctgcgca ctctccgaag tcacaaattt gtctccagac
4561 atcatacagg ctaattctct cttctccttc tatggtgatg atgaaattgt tagtacagac
4621 ataaaattag acccagagaa gttgacagca aagcttaagg aatatgggtt gaaaccaacc
4681 cgccctgaca aaactgaagg gcctcttgtt atttctgaag acttagacgg tttgactttc
4741 ttgcggagaa ctgtgacccg cgaccctgct ggttggtttg gaaaactgga gcagagctca
4801 atacttaggc aaatgtactg gactaggggc cccaaccatg aagacccatc cgaatcaatg
4861 atcccacact ctcaaagacc catacaattg atgtccttac tgggagaggc cgcactccac
4921 ggcccaacat tctacagtaa aattagcaaa ttagtcattg cagagctaaa agaaggtggc
4981 atggattttt acgtgcccag acaggagcca atgttcagat ggatgagatt ctcggatctg
5041 agcacgtggg agggcgatcg caatctggct cccagttttg tgaatgaaga tggcgtcgaa
5101 tgacgccaac ccatctgatg ggtccgcagc caacctcgtc ccagaggtca acaatgaggt
5161 tatggctttg gagcccgttg tcggtgccgc tattgcggcg cctgtagcgg gccaacaaaa
5221 tgtaattgac ccctggatta gaagtaactt tgtacaagcc cctggtggag agttcacagt
5281 atcccctaga aacgctccag gtgaaatact atggagcgcg cccttaggcc ctgacctgaa
5341 tccctaccta tctcatttgg ccagaatgta taatggttat gcaggtggtt ttgaagtgca
5401 ggtgatcctc gcggggaacg cgttcaccgc gggaaaaatt atatttgcag cagtcccacc
5461 aaattttcca actgaaggct tgagtcccag ccaggttact atgttccccc acataatagt
5521 agatgttagg caattggaac ctgtgttgat ccccttacct gatgttagga ataacttcta
5581 tcactataac cagtcaaatg attctaccat taaattgata gcaatgctgt acacaccact
5641 cagggccaat aatgccgggg atgatgtctt cacagtctct tgtcgagtcc tcacgaggcc
5701 atcccctgat tttgacttca tatttctggt accacctaca gttgagtcaa gaactaagcc
5761 attcactgtc ccaatcttga ctgttgaaga aatgaccaat tcaagattcc ccattccttt
5821 ggagaaattg ttcacgggtc ccagcagtgc ctttattgtt caaccacaaa atggcagatg
5881 cacaactgat ggcgtgctct taggcaccac ccaactgtct cctgtcaaca tctgcacctt
5941 cagaggggat gtcacccaca ttgcgggttc ccataattac acaatgaatt tggcctctct
6001 aaattggaac aattatgacc caacagaaga gattccagcc cctctgggaa ctccagattt
6061 cgtgggaaag atccaaggtg tgctcactca aaccacaaag ggagatggtt cgacccgggg
6121 ccataaagct acagtttaca ctgggagtgc cgacttcact ccaaagctgg gcagtgttcg
6181 attttctact gacacagata atgactttga aactcaccaa aacacaagat ttaccccagt
6241 cggtgtcatt caggatggtg gcaccaccca ccgaaatgaa ccccaacaat gggtgctccc
6301 aagttattca ggtagaaatg tccataatgt acacctagcc cctgctgtag cccccaattt
6361 tccaggtgaa caactccttt tcttcaggtc cactatgccc ggatgcagcg ggtatcccaa
6421 catggatttg gattgcctac tcccccagga gtgggtgcaa cacttctacc aagaagcagc
6481 tccagcacaa tctgatgtgg ctctattgag atttgtgaat ccagacacgg gtagggtctt
6541 gtttgagtgc aaactccata aatcaggcta tgtcacagtg gctcataccg gccaacatga
6601 tttggtcatc ccccccaatg gttattttag gtttgattcc tgggttaatc agttctacac
6661 acttgccccc atgggaaatg gaacggggcg tagacgtgct ttgtaatggc tggaactttc
6721 tttgctggat tggcatctga tgtccttagc tctggacttg gttccctaat caatgctggg
6781 gctggggcta tcaaccagag gattgatttt gaaaataaca gaaaattgca gcaagcttcc
6841 tttcagttta gtagtaatct acaacaagct tcctttcaac atgataaaga gatgctccaa
6901 gcacaaattg aggccactaa aaagttgcaa caggaactga tgaaagtcaa acaggcaata
6961 ctcttagaag gtggattttc tgaaacagat gcagcccgtg gggcaatcaa cgcccccatg
7021 acaaagactt tggactggag tggaacaagg tactgggccc ctgacgctag gactacaaca
7081 tacaatgcag gccgcttttc cacccctcaa ccttcggggg cactgccagg aagaatcaac
7141 cccaggaccc ctatccccgc ccggggctcc ccaagcatgt cttccaatgt ttctactgct
7201 acttctatac attcaaatca aactgcttca acgagacttg gttctacagc tggttctggt
7261 accaatgtct cgagtctccc gtcaactgca aggactagga gttgggttga ggatcaaaac
7321 agaaatttgt cacctttcat gaggggggct cacaacatat cgtttgtcac cccaccatct
7381 agcagatcct ccagccaagg cacagtctca accgtgccta aagatgtttt ggactcctgg
7441 actggcgctt tcaacacgcg caggcagcct ctcttcgctc acattcgtag gcgaggggag
7501 tcacgggtgt aatgtgaaaa gacaaaattg attatctttc ccttcctcta gtgtctttta
7561 aaaaaaaaaa aaaaaaaaaa
//