Typing tool
|
Complete norovirus genomes
MW661251 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 4..5103 ORF2: 5084..6706 ORF3: 6706..7512LOCUS MW661251 7608 bp RNA linear VRL 28-SEP-2021 DEFINITION Norovirus GII isolate BMH15-059 nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION MW661251 VERSION MW661251.1 DBLINK BioProject: PRJNA396739 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7608) AUTHORS Nasheri,N., Flint,A., Reaume,S., Harlow,J., Hoover,E. and Weedmark,K. TITLE Genomic Analysis of Human Noroviruses Using Hybrid Illumina-Nanopore Data JOURNAL Unpublished REFERENCE 2 (bases 1 to 7608) AUTHORS Nasheri,N., Flint,A., Reaume,S., Harlow,J., Hoover,E. and Weedmark,K. TITLE Direct Submission JOURNAL Submitted (23-FEB-2021) Bureau of Microbial Hazards, Health Canada, 251 SIR FREDERICK BANTING, Ottawa ON, Canada COMMENT ##Assembly-Data-START## Assembly Method :: Medaka v. v1.1.3; Pilon v. v1.23 Sequencing Technology :: Illumina; Nanopore ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7608 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="BMH15-059" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="Canada: Ottawa" /collection_date="2015-03-11" /note="Viral RNA extracted from fecal samples; genotype: GII.4_Sydney_2012_GII.P31(GII.Pe)" gene 4..5103 /gene="ORF1" CDS 4..5103 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QSD58372.1" /translation="MKMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARP KQPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPE ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS LAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGETFPYTAFDNNCYAFCCWVLDLNDS WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP KKEEANELAIVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTM SDLKQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLR CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGC LKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG KYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRNDVAGMILEEGAPEG TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDL ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRFHYDADYSRWD STQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide 4..993 /gene="ORF1" /product="p48" mat_peptide 994..2091 /gene="ORF1" /product="NTPase" mat_peptide 2092..2628 /gene="ORF1" /product="p22" mat_peptide 2629..3027 /gene="ORF1" /product="VPg" mat_peptide 3028..3570 /gene="ORF1" /product="Pro" mat_peptide 3571..5100 /gene="ORF1" /product="RdRp" gene 5084..6706 /gene="ORF2" CDS 5084..6706 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QSD58373.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG STTHQNEPQQWMLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV" gene 6706..7512 /gene="ORF3" CDS 6706..7512 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QSD58374.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQVEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRTNLRDAVPARGSSSKSP NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRA" ORIGIN 1 tgaatgaaga tggcgtctaa cgacgcttcc gctgccgctg ttgccaacag caacaacgac 61 atcgcaaaat cttcaagtga cggtgtgttt tctaacatgg ctgtcacttt taagcgggcc 121 ctcggggcgc ggcctaaaca gccgcccccg aaggagatac cacccagacc cccgcgacca 181 cccacaccag aattggtcaa aaagatccct cctcccccac ccaacgggga ggatgaacta 241 gtggtctctt acagcgccaa agatggtgtt tccggactgc ctgagctcac cactgtcaga 301 caaccggaag aaaccaacac ggcgttcagt gtccccccac tcaaccaaag ggagagcagg 361 gacgccaagg agccactaac tggaacaatt attgaaatgt gggatggaga aatctaccac 421 tacggcctgt acgtggaacg aggtcttata cttggtgtgc acaaaccacc ggcagccatc 481 agccttgcca aggtcgagct ggcaccgctc tctttgtttt ggagacctgt gtacaccccc 541 cagtatctca tctctccaga cactcttagg agattacatg gagagacatt cccctacact 601 gcatttgaca acaattgcta cgccttttgt tgttgggtat tagacctgaa cgactcatgg 661 ctaagcagga gaatgattca gagaacaaca ggtttcttca ggccgtacca ggattggaac 721 aggaaacccc tccccactat ggatgattcc aaattaaaga aggtagccaa catattcttg 781 tgcactttgt cttcactatt caccagaccc attaaggaca taatagggaa gttgaaacct 841 cttaacatcc ttaacattct ggctacatgt gattggacct tcgcaggcat agtggaatcc 901 ttaatactct tggcagaact ctttggagtt ttctggacac ccccagatgt gtctgcgatg 961 atcgccccct tgctaggtga ttatgaactg caaggacctg aggaccttgc agtggaactg 1021 gtcccaatag tgatgggggg gataggtttg gtgctaggat ttaccaaaga gaaaatcgga 1081 aagatgctat catccgctgc atccacttta agagcttgta aagaccttgg tgcatacgga 1141 ttggaaatct taaaattggt catgaagtgg tttttcccaa agaaagagga agcaaatgag 1201 ttggctatag tgagatccat cgaggatgca gtactagacc tcgaggcaat tgaaaacaac 1261 cacatgacca ccctactcaa ggacaaagac agcttggcaa cctacatgag aacccttgac 1321 cttgaggagg agaaagccag aaaactctca accaaatctg cttcacccga tattgtgggc 1381 acaatcaact ctctcctggc aagaatcgct gctgcacgct ccctagtgca tcgggcgaaa 1441 gaagagctct ccagcaggcc tagacctgtc gttgtgatga tatcgggaag accagggata 1501 gggaaaactc accttgccag ggagctggcc aagaagatcg cggcctccct cacaggggac 1561 cagcgtgtgg gccttatccc acgcaatggt gtcgaccact gggacgcata caagggcgaa 1621 agagttgtcc tatgggacga ctatggaatg agcaacccca tccacgatgc tctcaggttg 1681 caggagcttg ctgacacttg ccccctcacg ctaaattgtg acagaattga gaacaaaggg 1741 aaagtctttg acagtgatgc cataattatc accaccaacc tggccaaccc agcaccacta 1801 gattatgtca actttgaagc gtgctcgaga cgcattgact tcctcgtgta cgcagaagcc 1861 cctgaggtag agaaggcaaa gcgcgacttc ccaggccaac ctgacatgtg gaagaacgct 1921 ttcagtcctg acttctcaca cataaaactt tcattggctc cacagggtgg ttttgacaag 1981 aacggcaaca ccccgcatgg aaaaggagtc atgaagaccc tcaccactgg ctccctcatc 2041 gcccgagcat cagggttact ccatgagagg ctagatgaat atgaactgca aggcccagcc 2101 ctcaccactt tcaactttga ccgcaacaag atacttgctt ttagacaact tgctgctgaa 2161 aacaagtatg ggttgatgga cacaatgaga gttggaaaac agcttaagga tgtcaagacc 2221 atgtcagacc tcaaacaagc actcaagaac atcgcgatca agaagtgcca gatagtgtac 2281 aatggtggca cctacacact tgaggccgat ggcaagggta gtgtaaaagt tgacaaagtg 2341 caaagtgcca ctgtgcagac caacaatgaa ctagccggtg ccctacacca cctaaggtgc 2401 gctagaatca gatattatgt taagtgcgtc caggaggcac tgtattccat catccaaatc 2461 gctggggctg cattcgtcac cacgcgcatc gccaagcgca tgaatataca gaatctctgg 2521 tccaagccac aggtggaaga cacagaagag atggccaaca aagatggttg cctaaaaccc 2581 aaagatgatg aagagtttgt cgtctcatcc gacgacatca aaactgaggg caagaaaggg 2641 aagaacaagt ccggccgtgg caagaagcac acagcctttt caagcaaagg gctcagtgat 2701 gaggagtacg atgagtacaa gagaatcaga gaagaaagga atggcaagta ctccatagaa 2761 gagtaccttc aggacagaga caggtactac gaggaggtgg ccattgccag ggcaaccgaa 2821 gaggacttct gtgaagaaga agaggccaaa atccggcaga gaattttcag accaacaagg 2881 aaacaacgca aagaagagag ggcctctctc ggcttggtca caggctctga aatcaggaag 2941 agaaacccag aagacttcaa acccaaggga aagctgtggg ctgacgatga cagaagtgtt 3001 gactacaatg agaaactcaa ctttgaggca ccaccaagca tctggtcacg gatagtcaac 3061 tttggttcag gctggggctt ctgggtctcc cccagtctgt ttataacatc aacccatgtc 3121 ataccccaag gtgcaaaaga gttcttcgga gtccctatca agcaaatcca gatacacaaa 3181 tcaggtgaat tctgccggtt gagattccca aagccaatca gaaatgatgt ggcgggtatg 3241 attctagaag aaggtgcgcc cgaggggacc gtggccacat tactcatcaa gagaccaact 3301 ggagagctca tgcctctggc agccagaatg gggacccatg cgaccatgaa aattcagggg 3361 cgcacagttg gagggcaaat gggtatgctc ctgacaggat ccaacgccaa gagtatggac 3421 ctaggcacaa caccaggcga ctgcggctgc ccctacatct ataagagggg gaatgactac 3481 gtggtcatag gggtccatac ggccgctgcc cgtgggggaa acactgtcat atgtgccacc 3541 caggggagtg agggagaagc cacacttgaa ggaggtgaca gtaaagggac atactgtggc 3601 gcaccaatct tgggaccagg gagtgctccg aagctcagta ccaagactaa gttttggaga 3661 tcatccacaa caccactccc acctggcacc tacgaaccag cctacctcgg tggcaaagac 3721 cctagagtca aaggtggccc ttcattgcaa caagttatga gggaccagct gaagccattc 3781 acagaaccca gaggcaaacc accaagacca aatgtgttgg aagctgccaa gaaaaccatc 3841 atcaatgttc ttgagcaaac aattgatcca ccccaaaaat ggtcatttgc gcaagcttgc 3901 gcatcccttg acaaaaccac ctccagtggc cacccgcacc acatgcggaa aaacgactgt 3961 tggaatgggg agtccttcac aggaaaattg gctgatcaag cctccaaagc caacttaatg 4021 tttgaagagg gaaagaacat gactccagtc tacacaggtg cacttaaaga tgagttggta 4081 aagaccgata aagtttatgg taaggtcaag aagagacttc tgtggggttc agatctggcg 4141 accatgatac ggtgcgcccg agcttttgga ggccttatgg atgaactcaa ggcgcactgt 4201 gtcacacttc ctgtcagagt tggtatgaac atgaatgagg atggccccat catctttgag 4261 aagcactcca gatatagatt tcactatgat gctgattatt cccggtggga ctcaacacaa 4321 caaagggatg tgctagcagc agcactagaa atcatggtta agttttctcc agaaccacac 4381 ctggcccaga tagttgcaga agacctcctt tcccctagcg taatggatgt aggtgacttt 4441 caaatatcaa taagtgaggg tctcccctct ggggtacctt gtacctccca gtggaattcc 4501 atcgcccact ggctcctcac tctgtgtgca ctctctgaag tcacggacct gtcccctgac 4561 atcattcagg ccaactccct tttctccttc tatggtgatg atgagattgt aagcacagac 4621 ataaagttgg acccagagaa gctgacagcg aaactcaagg agtacgggct aaaaccaacc 4681 cgccccgaca aaactgaagg accgcttgtt atctctgaag acctggatgg cctgacattc 4741 ctccggagaa ctgtgacccg tgatccagct ggctggtttg gaaaactgga acaaagttca 4801 attctcaggc aaatgtactg gactaggggc cccaaccatg aagatccatt tgaaacaatg 4861 ataccacact cccaaagacc catacaattg atgtccttgc tgggcgaggc tgcactccac 4921 ggcccggcat tctatagcaa aattagcaaa ttagtcattg cagagttgaa ggaaggtggc 4981 atggattttt acgtgcccag acaagagcca atgttcagat ggatgagatt ctcagatctg 5041 agcacgtggg agggcgatcg caatctggct cccagttttg tgaatgaaga tggcgtcgag 5101 tgacgccaac ccatctgatg ggtccgcagc caacctcgtc ccagaggtca acaatgaggt 5161 tatggctctg gagcccgttg ttggtgccgc cattgcggca cccgtagcgg gccaacaaaa 5221 tgtaattgac ccctggatta gaaataattt tgtgcaagcc cctggtggag agtttacagt 5281 gtcccctaga aacgctccag gtgaaatact atggagcgcg cccttgggtc ctgatctaaa 5341 tccctaccta tcccatttgg ccagaatgta caatggttat gcaggtggtt ttgaagtgca 5401 ggtaattctc gcggggaacg cgttcaccgc cgggaaggtc atatttgcag cagtcccacc 5461 aaattttcca actgaaggct tgagccccag ccaggtcact atgttccccc atatagtagt 5521 agatgttagg caactagaac ctgtgttgat tcccttaccc gatgtcagga ataattttta 5581 tcattacaat caatcaaatg accccaccat taagttgata gcaatgttgt atacaccact 5641 tagggctaat aatgctgggg acgatgtttt cacagtttct tgccgagttc tcacgagacc 5701 atcccctgat tttgatttca tatttctagt gccacccaca gttgagtcaa gaactaaacc 5761 tttctctgtc ccagttttaa ctgttgagga gatgaccaat tcaaggttcc ccattccttt 5821 ggaaaagttg ttcacgggtc ccagcagtgc ctttgttgtc caaccacaaa acggcaggtg 5881 cacgactgat ggcgtgctcc taggcactac ccaactgtct cctgtcaaca tctgcacctt 5941 cagaggagat gtcacccata tcacaggtag tcgtaactac acaatgaatt tggcttctca 6001 aaactggaac aattatgacc caacagaaga aatcccagcc cccctaggaa ctccagattt 6061 tgtggggaag attcaaggcg tgctcaccca aaccacaagg acagatggct caacacgcgg 6121 ccacaaagct acagtgtaca ctgggagcgc cgactttgcc ccaaaactgg gtagagttca 6181 atttgaaact gacacagacc atgactttga agctaaccaa aacacaaagt tcaccccagt 6241 tggtgttatc caagatggta gcaccaccca ccaaaatgaa ccccaacagt ggatgctccc 6301 aagttactca ggtagaaata ctcataatgt gcatctggcc cccgctgtag cccccacttt 6361 tccgggtgag caacttctct tcttcagatc caccatgccc ggatgcagcg ggtaccccaa 6421 catggattta gactgtctgc ttccccagga atgggtgcag tacttctacc aagaggcagc 6481 cccagcacaa tctgatgtgg ctctgctaag atttgtgaat ccagacacag gtagggttct 6541 gtttgagtgt aagcttcata aatcaggcta tgttacagtg gctcacactg gccaacatga 6601 tttggttatc ccccccaatg gttattttag gtttgattcc tgggtcaacc agttttacac 6661 gcttgccccc atgggaaatg gaacggggcg cagacgtgca gtataatggc tggagctttc 6721 tttgctggat tggcatctga tgtccttggc tctggactcg gttctcttat caatgctggg 6781 gctggggcca tcaaccaaaa agttgagttt gaaaataaca gaaaattgca acaagcatcc 6841 ttccaattta gcagcaatct acaacaggct tccttccaac atgacaaaga gatgctccaa 6901 gcacaagttg aggccaccaa aaagttacaa caggaaatga tgaaagttaa gcaggcaatg 6961 ctcctagagg gtgggttctc tgagacagat gcagcccgcg gggcaatcaa cgcccccatg 7021 acaaaagctt tggactggag cgggacaagg tactgggctc ccgatgctag gactacaaca 7081 tacaatgcag gccgcttttc cacccctcaa ccatcggggg cactgccagg aagaactaat 7141 cttagggatg ctgtccctgc tcggggttcc tccagcaaat ctcctaattc ttccactgct 7201 acttctgtgt actcaaatca aactacttca acgagacttg gttctacagc tggttctggt 7261 accagtgtct cgagcttccc gtcgactgcg aggactagga gttgggttga ggatcaaagc 7321 aggaatttgt cacctttcat gaggggggcc cacaacatat cgtttgtcac cccaccatct 7381 agcagatcct ctagccaagg cacagtctca accgtgccta aagaggtttt ggactcctgg 7441 actggcgctt tcaacacgcg caggcagcca ctcttcgctc acattcgtaa gcgaggggag 7501 tcacgggcgt aatgtgaaaa gacaaaattg actatctctt tctttttctt tagtgtcttt 7561 ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa //