Typing tool
|
Complete norovirus genomes
MN416764 | GI.2 | ||
---|---|---|---|
GI.P2 |
ORF1: 1..5062 ORF2: 5046..6686 ORF3: 6686..7321LOCUS MN416764 7362 bp RNA linear VRL 31-DEC-2019 DEFINITION Norovirus GI isolate G19_046 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MN416764 VERSION MN416764.1 KEYWORDS . SOURCE Norovirus GI ORGANISM Norovirus GI Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7362) AUTHORS Strubbia,S., Schaeffer,J. and Le Guyader,F.S. TITLE Metagenomic to detect norovirus and Human enteric viruses in oysters: impact on hexamer selection and targeted capture-based enrichment JOURNAL Unpublished REFERENCE 2 (bases 1 to 7362) AUTHORS Strubbia,S., Schaeffer,J. and Le Guyader,F.S. TITLE Direct Submission JOURNAL Submitted (06-SEP-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. v3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7362 /organism="Norovirus GI" /mol_type="genomic RNA" /isolate="G19_046" /isolation_source="digestive tissue" /host="shellfish" /db_xref="taxon:122928" /country="France" /collection_date="11-Nov-2018" /note="genotype: GIP2, GI.2" gene <1..5062 /gene="ORF1" CDS <1..5062 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="QEM24798.1" /translation="SWEELDTTVKEEILDNHKEWFDAGGLGPCTMPPTYERVRDDSPP GEQVKWSARDGVNIGVERLTTVSGPEWNLCPLPPIDLRNMEPASEPTIGDMIEFYEGH IYHYSIYIGQGKTVGVHSPQAAFSVARVTIQPIAAWWRVCYIPQPKHRLSYDQLKELE NEPWPYAAITNNCFEFCCQVMNLEDTWLQRRLVTSGRFHHPTQSWSQQTPEFQQDSKL ELVRDAILAAVNGLVSQPFKNFLGKLKPLNVLNILSNCDWTFMGVVEMVILLLELFGV FWNPPDVSNFIASLLPDFHLQGPEDLARDLVPVILGGIGLAIGFTRDKVTKVMKSAVD GLRAATQLGQYGLEIFSLLKKYFFGGDQTERTLKGIEAAVIDMEVLSSTSVTQLVRDK QAAKAYMNILDNEEEKARKLSAKNADPHVISSTNALISRISMARSALAKAQAEMTSRM RPVVIMMCGPPGIGKTKAAEHLAKRLANEIRPGGKVGLVPREAVDHWDGYHGEEVMLW DDYGMTKIQDDCNKLQAIADSAPLTLNCDRIENKGMQFVSDAIVITTNAPGPAPVDFV NLGPVCRRVDFLVYCSAPEVEQIRRVSPGDTSALKDCFKPDFSHLKMELAPQGGFDNQ GNTPFGRGIMKPTTINRLLIQAVALTMERQDEFQLQGKMYDFDDDRVSAFTTMARDNG LGILSMAGLGKKLRGVTTMEGLKNALKGYKISACTIKWQAKVYSLESDGNSVNIKEER NILTQQQQSVCAASVALTRLRAARAVAYASCIQSAITSILQIAGSALVVNRAVKRMFG TRTATLSLEGPPREHKCRVHMAKAAGKGPIGHDDVVEKYGLCETEEDEEVAHTEIPSA TMEGKNKGKNKKGRGRKNNYNAFSRRGLNDEEYEEYKKIREEKGGNYSIQEYLEDRQR YEEELAEVQAGGDGGIGETEMEIRHRVFYKSKSRKHHQEERRQLGLVTGSDIRKRKPI DWTPPKSAWADDEREVDYNEKISFEAPPTLWSRVTKFGSGWGFWVSSTVFITTTHVIP TSAKEFFGEPLTSIAIHRAGEFTLFRFSKKIRPDLTGMILEEGCPEGTVCSVLIKRDS GELLPLAVRMGAIASMRIQGRLVHGQSGMLLTGANAKGMDLGTIPGDCGAPYVYKRAN DWVVCGVHAAATKSGNTVVCAVQASEGETTLEGGDKGHYAGHEIIKHGCGPALSTKTK FWKSSPEPLPPGVYEPAYLGGRDPRVTGGPSLQQVLRDQLKPFAEPRGRMPEPGLLEA AVETVTSSLEQVMDTPVPWSYSDACQSLDKTTSSGFPHHRRKNDDWNGTTFIRELGEQ AAHANNMYEQAKSMKPMYTAALKDELVKPEKVYQKVKKRLLWGADLGTVVRAARAFGP FCDAIKSHTIKLPIKVGMNSIEDGPLIYAEHSKYKYHFDADYTAWDSTQNRQIKTESF SIMCRLTASPELASVVAQDLLAPSEMDVGDYVIRVKEGLPSGFPCTSQVNSINHWLIT LCALSEVTGLSPDVIQSMSYFSFYGDDEIVSTDIEFDPAKLTQVLREYGLKPTRPDKS EGPIIVRKSVDGLVFLRRTISRDAAGFQGRLDRASIERQIYWTRGPNHSDPFETLVPH QQRKVQLISLLGEASLHGEKFYRKISSKVIQEIKTGGLEMYVPGWQAMFRWMRFHDLG LWTGDRNLLPEFVNDDGV" mat_peptide <1..892 /gene="ORF1" /product="p48" mat_peptide 893..1981 /gene="ORF1" /product="NTPase" mat_peptide 1982..2578 /gene="ORF1" /product="p22" mat_peptide 2579..2992 /gene="ORF1" /product="VPg" mat_peptide 2993..3535 /gene="ORF1" /product="Pro" mat_peptide 3536..5059 /gene="ORF1" /product="RdRp" gene 5046..6686 /gene="ORF2" CDS 5046..6686 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QEM24799.1" /translation="MMMASKDAPQSADGASGAGQLVPEVNTADPLPMEPVAGPTTAVA TAGQVNMIDPWIVNNFVQSPQGEFTISPNNTPGDILFDLQLGPHLNPFLSHLSQMYNG WVGNMRVRILLAGNAFSAGKIIVCCVPPGFTSSSLTIAQATLFPHVIADVRTLEPIEM PLEDVRNVLYHTNDNQPTMRLVCMLYTPLRTGGGSGSSDSFVVAGRVLTAPSSDFSFL FLVPPTIEQKTRAFTVPNIPLQTLSNSRFPSLIQGMILSPDASQVVQFQNGRCLIDGQ LLGTTPATSGQLFRVRGKINQGARTLNLTEVDGKPFMAFDSPAPVGFPDFGKCDWHMR ISKTPNNTSSGDPMRSVSVQTNVQGFVPHLGSIQFDEVFNHPTGDYIGTIEWISQPST PPGTDINLWEIPDYGSSLSQAANLAPPVFPPGFGEALVYFVSAFPGPNNRSAPNDVPC LLPQEYITHFVSEQAPTMGDAALLHYVDPDTNRNLGEFKLYPGGYLTCVPNGVGAGPQ QLPLNGVFLFVSWVSRFYQLKPVGTASTARGRLGVRRI" gene 6686..7321 /gene="ORF3" CDS 6686..7321 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QEM24800.1" /translation="MAQAIIGAIAASAAGSALGAGIQAGAEAALQSQRYQQDLALQRN TFEHDKDMLSYQVQASNALLAKNLNTRYSMLVAGGLSSADASRAVAGAPVTQLIDWNG TRVAAPRSSATTLRSGGFMAVPMPVQPKSKALQSSGFSNPAYDTSTVSSRTSSWVQSQ NSLRSVSPFHRQALQTVWVTPPGSTSSSSVSSTPYGVFNTDRMPLFANLRR" ORIGIN 1 atcatgggaa gagcttgaca ccacagttaa ggaagagatc ctagacaacc acaaagaatg 61 gtttgacgct ggtggtttgg gcccttgcac aatgcctcca acatatgaac gggtcaggga 121 cgacagtcca cctggtgaac aggttaaatg gtccgcacgt gatggagtta acattggagt 181 ggaacgcctc acgacagtga gtgggcctga gtggaatctt tgccccttac cccccatcga 241 tttgaggaac atggaaccag ctagtgaacc cactattgga gatatgatag aattctacga 301 aggccacatc tatcattact ccatatacat tgggcaaggc aaaacagtcg gcgtccattc 361 tccacaggcg gcattttcag tggctagagt gaccatccag cccatagccg cttggtggag 421 agtttgttac ataccccaac ccaagcatag actgagttac gaccaactca aggaactaga 481 gaatgagcca tggccatacg cggccataac caataattgt tttgaattct gctgtcaagt 541 catgaacctt gaggacacgt ggttgcaaag gcgactggtc acgtcgggta gattccacca 601 ccccacccag tcgtggtcac agcagacccc tgagttccaa caagatagca agttagagtt 661 ggttagggac gccatattgg ctgcagtgaa tggtcttgtt tcgcagccct ttaagaactt 721 cttgggtaaa ctcaaacccc tcaatgtgct taacatcctg tctaactgtg attggacctt 781 catgggggtg gtggaaatgg tcatactact acttgaactc tttggtgtgt tctggaaccc 841 gcctgatgta tccaatttta tagcgtccct tcttcctgat ttccatcttc agggacctga 901 agacttggca cgagatctag tcccagtgat tcttggtggt attggattag ccattgggtt 961 caccagagac aaagttacaa aggtcatgaa gagtgctgtg gatggtcttc gagctgccac 1021 acaactggga cagtatggat tagaaatatt ctcactgctc aagaagtact tctttggggg 1081 ggaccagact gagcgcaccc tcaaaggcat tgaggcagca gtcatagata tggaggtact 1141 gtcctccact tcagtgacac agctagtgag ggacaaacag gcagcaaagg cctatatgaa 1201 catcttggac aatgaagaag agaaggccag gaagctctct gctaaaaacg ctgacccaca 1261 tgtgatatcc tcaacaaatg ccctaatatc gcgcatatcc atggcacgat ctgcattggc 1321 caaggctcag gctgagatga ccagtcgaat gcgaccagtt gtcattatga tgtgtggccc 1381 acctgggatt gggaagacca aggctgctga gcacctagct aagcgtctag ccaatgagat 1441 cagaccaggt ggtaaggtgg ggttggttcc ccgtgaagct gtcgaccact gggacggtta 1501 tcatggtgag gaagtgatgc tgtgggatga ctatggcatg acaaaaatac aagacgactg 1561 taataaactc caggccattg ctgactcggc ccccctcaca ttaaattgtg ataggattga 1621 aaataaagga atgcagttcg tttcagatgc aatagtcatc accaccaacg ccccaggccc 1681 cgcccctgtg gactttgtca accttggacc agtgtgtaga cgggtcgact ttttggtgta 1741 ttgctctgcc ccagaggtgg agcagatacg gagagtcagc cctggcgaca catcagcact 1801 gaaggactgc ttcaagccag atttctcaca tttaaaaatg gagctggctc cacaaggtgg 1861 gtttgacaat caagggaaca caccgtttgg caggggcatc atgaagccaa caaccattaa 1921 tagactcctc atacaagccg tggcccttac catggaaagg caggatgagt tccagttgca 1981 gggaaaaatg tatgactttg atgatgacag ggtgtcagcg tttaccacca tggcacgtga 2041 caatggcctg ggcatcttga gcatggcggg tctgggtaag aagttacgcg gtgtcacaac 2101 gatggagggc ttgaagaatg ccctgaaggg atacaaaatt agtgcgtgca cgataaaatg 2161 gcaggctaaa gtgtactcac tagagtcaga tggcaacagt gtcaacatta aagaggagag 2221 gaacatctta actcaacaac aacagtcagt gtgtgctgcc tctgtcgcgc tcactcgcct 2281 ccgggctgcg cgtgcggtgg catacgcgtc atgcatccaa tcggctataa cttccatact 2341 acaaattgct ggctcagccc tagtggtcaa cagagcagtg aagagaatgt ttggcacgcg 2401 tactgccacc ctgtcccttg agggcccccc cagagaacac aaatgcaggg tccacatggc 2461 caaggccgca ggaaaggggc ctattggcca tgatgatgtg gtagaaaagt atgggctttg 2521 tgaaactgag gaggacgaag aagtggccca cactgaaatc ccttctgcca ccatggaggg 2581 caagaataaa gggaagaaca agaaaggacg tggtcggaag aacaactaca acgccttctc 2641 ccgcagggga ctcaatgatg aagagtacga agagtacaag aagatacgcg aggagaaagg 2701 tggcaattat agcatacagg agtacctaga ggataggcaa aggtatgaag aagagctagc 2761 agaggttcaa gcaggtggag atggaggaat cggggaaact gaaatggaaa tccgccacag 2821 agtgttctac aaatctaaga gtagaaagca tcaccaggaa gagcgacgcc agctagggct 2881 ggtgacaggt tccgacattc ggaagagaaa accaatcgac tggaccccac ccaagtcagc 2941 atgggcagat gatgagcgtg aggtggatta caatgagaag atcagttttg aggcgccccc 3001 tactttatgg agcagagtga caaagtttgg gtctggatgg ggtttctggg tcagctctac 3061 agtcttcata accacaacgc acgtcatacc aaccagtgcc aaggaattct ttggtgaacc 3121 cctaaccagc atagccatcc acagggctgg tgagttcact ctattcaggt tctcaaagaa 3181 aattaggcct gacctcacag gtatgatcct tgaggagggt tgccccgagg gcacagtgtg 3241 ttcagtgcta ataaaaaggg actctggtga actactgcca ttggctgtaa gaatgggcgc 3301 aatagcatca atgcgtatac agggccgcct tgttcatggg cagtccggca tgctgctcac 3361 cggggcaaat gctaagggca tggatcttgg aaccatccca ggagactgtg gggctcctta 3421 tgtctataag agagccaacg actgggtggt ctgtggtgta cacgctgctg ccaccaaatc 3481 aggcaacacc gttgtgtgcg ccgttcaggc cagtgaagga gaaaccacgc ttgaaggcgg 3541 tgacaaagga cactatgctg gacatgaaat aattaagcat ggttgtggac cagccctgtc 3601 aaccaaaacc aaattctgga aatcatcccc tgaaccacta ccccctgggg tctatgaacc 3661 cgcctacctc gggggccggg accctagggt aactggcggt ccctcactcc aacaggtgtt 3721 gcgggaccaa ttaaagccat ttgctgagcc acgaggacgc atgccagagc caggtctctt 3781 ggaggccgca gttgagactg tgacttcatc attagagcag gttatggaca ctcccgttcc 3841 ttggagttat agtgatgcgt gccagtccct tgataagacc actagttctg gttttcccca 3901 ccacagaagg aagaatgacg actggaatgg caccaccttt atcagggagt taggggagca 3961 ggcagcacac gctaataaca tgtatgaaca ggctaaaagt atgaaaccca tgtacacggc 4021 agcgctcaaa gatgaactag tcaaaccaga gaaggtatac caaaaagtga agaagcgctt 4081 gttatggggg gcagacttgg gcacggtggt tcgggccgcg cgggcttttg gtccattctg 4141 tgatgctata aaatcccaca caatcaaatt gcccattaaa gttggaatga attcaattga 4201 ggatgggcca ctgatctatg cagaacattc aaagtataag taccattttg atgcagatta 4261 cacagcttgg gattcaactc aaaatagaca aatcaagaca gagtcattct caatcatgtg 4321 tcggctaact gcgtcacctg aactagcttc agtggtggct caagatttgc ttgcaccctc 4381 agagatggat gttggcgact atgtcataag agtgaaggaa ggcctcccat ctggttttcc 4441 atgtacatca caggttaata gtataaacca ttggttaata actctgtgtg ccctttctga 4501 agtaactggt ctgtcgccag atgtcatcca gtccatgtca tatttctctt tctatggtga 4561 tgatgaaata gtgtcaactg acatagaatt tgatccagca aaactgacac aagtcctcag 4621 agagtatgga cttaaaccca cccgccccga caaaagcgag ggcccaataa ttgtgaggaa 4681 aagtgtggat ggtttggtct ttttgcgtcg cactatctcc cgcgacgccg caggattcca 4741 ggggcgactg gaccgggcat ccattgaaag acaaatctac tggactagag gacccaacca 4801 ttcagatcct tttgagaccc tggtgccaca ccaacaaagg aaggtccaat taatatcatt 4861 attgggtgag gcctcactgc atggtgaaaa gttttacagg aagatttcaa gtaaagtcat 4921 ccaggaaatt aaaacagggg gccttgaaat gtatgtgcca ggatggcaag ccatgttccg 4981 ttggatgcgg ttccatgacc ttggtttgtg gacaggagat cgcaatctcc tgcccgaatt 5041 tgtaaatgat gatggcgtct aaggacgccc ctcaaagcgc tgatggcgca agcggcgcag 5101 gtcaactggt gccggaggtt aatacagctg accccttacc catggaacct gtggctgggc 5161 caacaacagc cgtagccact gctgggcaag ttaatatgat tgatccctgg attgttaata 5221 attttgtcca gtcaccccaa ggtgagttca caatctctcc taataatacc cccggtgata 5281 ttttgtttga tttacaattg ggtccacacc taaacccttt cttgtcacat ttgtcccaaa 5341 tgtataatgg ctgggttggg aacatgagag tcagaattct ccttgctggg aatgcattct 5401 cagctggaaa gattatagtt tgttgtgtcc cccctggctt cacatcctct tctctcacca 5461 tagctcaggc tacattgttt ccccatgtaa ttgctgatgt gagaaccctt gagccaatag 5521 aaatgcccct cgaggatgta cgtaatgtcc tctatcacac caatgataat caaccaacaa 5581 tgcggctggt gtgtatgcta tacacgccgc tccgcactgg tggggggtct ggtagttctg 5641 attcctttgt agtcgctggc agggttctca cagcccctag cagcgacttc agtttcttgt 5701 tccttgtccc gcctaccata gagcagaaga ctcgggcttt cactgtgcct aatatcccct 5761 tgcaaacctt gtccaattct aggtttcctt ccctcatcca ggggatgatt ctgtcccccg 5821 atgcatctca agtggtccaa ttccaaaatg ggcgttgcct tatagatggt caactcttag 5881 gcactacacc cgctacatca ggacagctgt tcagagtaag aggaaagata aatcagggag 5941 cccgtacact taacctcaca gaggtggatg gtaaaccatt catggcattt gattcccctg 6001 cacctgtggg gttccccgat tttggaaaat gtgattggca catgagaatc agcaaaaccc 6061 caaacaacac aagttcaggt gaccccatgc gcagtgtcag cgtgcaaacc aatgtgcagg 6121 gttttgtgcc acacctggga agtatacaat ttgatgaagt gtttaatcat cccacaggtg 6181 actacattgg caccattgaa tggatttccc agccatctac accccctgga acagatatta 6241 atctgtggga gatccccgat tatgggtcat ccctttccca agcagctaat ctggcccccc 6301 cagtgttccc ccctggattt ggtgaggccc ttgtatactt tgtttctgct ttcccgggcc 6361 ccaataaccg ctcagccccg aatgatgtac cctgtcttct ccctcaagag tacataaccc 6421 actttgtcag tgaacaagcc ccaacgatgg gtgacgcagc tctactgcat tatgtcgacc 6481 ctgataccaa caggaacctt ggggagttca agctataccc tggaggttac ctcacctgtg 6541 taccaaacgg ggtaggtgcc gggccccaac agctccctct caatggtgtt tttctttttg 6601 tttcttgggt gtctcgtttt tatcagctta agcctgtggg aacagccagt acggcaagag 6661 gtaggctcgg agtacgccgt atataatggc ccaagccatc ataggagcaa ttgccgcgtc 6721 agctgcaggc tcagcattgg gtgcgggcat ccaggctggt gccgaggctg cgcttcagag 6781 tcaaagatac caacaagact tagccctgca aaggaatact tttgaacatg ataaggatat 6841 gctttcctac caggtccagg caagtaatgc acttttggca aagaatctca atacccgcta 6901 ttctatgctt gttgcagggg gtctttctag tgctgatgct tctcgggctg ttgctggggc 6961 ccctgtaaca caattgattg attggaacgg cactcgggtt gccgccccca gatcaagtgc 7021 aacaactctg aggtctggtg gtttcatggc agtccccatg cctgttcaac ccaaatctaa 7081 ggccctgcaa tcctctgggt tttctaatcc tgcttatgac acgtccacag tttcttctag 7141 gacttcttct tgggtgcagt cacagaattc cctgcgaagt gtgtcaccct ttcataggca 7201 ggcccttcaa actgtgtggg ttactccacc tgggtctact tcctcttctt ctgtttcctc 7261 aacaccttat ggtgttttta atacggatag gatgccgcta ttcgcaaatt tgcggcgtta 7321 atgttgtaat ataatgcagc agtgggcact atattcaatt tg //