Typing tool
|
Complete norovirus genomes
MK956178 | GI.2 | ||
---|---|---|---|
GI.P2 |
ORF1: 1..5121 ORF2: 5105..6745 ORF3: 6745..7380LOCUS MK956178 7433 bp RNA linear VRL 12-NOV-2019 DEFINITION Norovirus GI isolate G19-006 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK956178 VERSION MK956178.1 KEYWORDS . SOURCE Norovirus GI ORGANISM Norovirus GI Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7433) AUTHORS Strubbia,S., Schaeffer,J., Oude Munnink,B.B., Besnard,A., Phan,M.V.T., Nieuwenhuijse,D.F., de Graaf,M., Schapendonk,C.M.E., Wacrenier,C., Cotten,M., Koopmans,M.P.G. and Le Guyader,F.S. TITLE Metavirome Sequencing to Evaluate Norovirus Diversity in Sewage and Related Bioaccumulated Oysters JOURNAL Front Microbiol 10, 2394 (2019) PUBMED 31681246 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 7433) AUTHORS Le Guyader,S., Schaeffer,J., Strubbia,S., Besnard,A., Phan,M.V., Cotten,M., Oude Munnink,B.B., Nieuwenhuijse,D.F., De Graaf,M. and Koopmans,M. TITLE Direct Submission JOURNAL Submitted (21-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. v3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7433 /organism="Norovirus GI" /mol_type="genomic RNA" /isolate="G19-006" /isolation_source="sewage" /db_xref="taxon:122928" /country="France: Nantes" /collection_date="22-Mar-2018" /note="genotype: GI.2-GI.P2" gene <1..5121 /gene="ORF1" CDS <1..5121 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QCT04933.1" /translation="QEEVQYGMGWSNRPIDQNVKSWEELDTTVKEEILDNHKEWFDAG GLGPCTMPPTYERVRDDSPPGEQVKWSARDGVNIGVERLTTVSGPEWNLCPLPPIDLR NMEPASEPTIGDMIEFYEGHIYHYSIYIGQGKTVGVHSPQAAFSVARVTIQPIAAWWR VCYIPQPKHRLSYDQLKELENEPWPYAAITNNCFEFCCQVMNLEDTWLQRRLVTSGRF HHPTQSWSQQTPEFQQDSKLELVRDAILAAVNGLVSQPFKNFLGKLKPLNVLNILSNC DWTFMGVVEMVILLLELFGVFWNPPDVSNFIASLLPDFHLQGPEDLARDLVPVILGGI GLAIGFTRDKVTKVMKSAVDGLRAATQLGQYGLEIFSLLKKYFFGGDQTERTLKGIEA AVIDMEVLSSTSVTQLVRDKQAAKAYMNILDNEEEKARKLSAKNADPHVISSTNALIS RISMARSALAKAQAEMTSRMRPVVIMMCGPPGIGKTKAAEHLAKRLANEIRPGGKVGL VPREAVDHWDGYHGEEVMLWDDYGMTKIQDDCNKLQAIADSAPLTLNCDRIENKGMQF VSDAIVITTNAPGPAPVDFVNLGPVCRRVDFLVYCSAPEVEQIRRVSPGDTSALKDCF KPDFSHLKMELAPQGGFDNQGNTPFGRGIMKPTTINRLLIQAVALTMERQDEFQLQGK MYDFDDDRVSAFTTMARDNGLGILSMAGLGKKLRGVTTMEGLKNALKGYKISACTIKW QAKVYSLESDGNSVNIKEERNILTQQQQSVCAASVALTRLRAARAVAYASCIQSAITS ILQIAGSALVVNRAVKRMFGTRTATLSLEGPPREHKCRVHMAKAAGKGPIGHDDVVEK YGLCETEEDEEVAHTEIPSATMEGKNKGKNKKGRGRKNNYNAFSRRGLNDEEYEEYKK IREEKGGNYSIQEYLEDRQRYEEELAEVQAGGDGGIGETEMEIRHRVFYKSKSRKHHQ EERRQLGLVTGSDIRKRKPIDWTPPKSAWADDEREVDYNEKISFEAPPTLWSRVTKFG SGWGFWVSSTVFITTTHVIPTSAKEFFGEPLTSIAIHRAGEFTLFRFSKKIRPDLTGM ILEEGCPEGTVCSVLIKRDSGELLPLAVRMGAIASMRIQGRLVHGQSGMLLTGANAKG MDLGTIPGDCGAPYVYKRANDWVVCGVHAAATKSGNTVVCAVQASEGETTLEGGDKGH YAGHEIIKHGCGPALSTKTKFWKSSPEPLPPGVYEPAYLGGRDPRVTGGPSLQQVLRD QLKPFAEPRGRMPEPGLLEAAVETVTSSLEQVMDTPVPWSYSDACQSLDKTTSSGFPY HRRKNDDWNGTTFIRELGEQAAHANNMYEQAKSMKPMYTAALKDELVKPEKVYQKVKK RLLWGADLGTVVRAARAFGPFCDAIKSHTIKLPIKVGMNSIEDGPLIYAEHSKYKYHF DADYTAWDSTQNRQIMTESFSIMCRLTASPELASVVAQDLLAPSEMDVGDYVIRVKEG LPSGFPCTSQVNSINHWLITLCALSEVTGLSPDVIQSMSYFSFYGDDEIVSTDIEFDP AKLTQVLREYGLKPTRPDKSEGPIIVRKSVDGLVFLRRTISRDAAGFQGRLDRASIER QIYWTRGPNHSDPFETLVPHQQRKVQLISLLGEASLHGEKFYRKISSKVIQEIKTGGL EMYVPGWQAMFRWMRFHDLGLWTGDRNLLPEFVNDDGV" mat_peptide <1..951 /gene="ORF1" /product="p48" mat_peptide 952..2040 /gene="ORF1" /product="NTPase" mat_peptide 2041..2637 /gene="ORF1" /product="p22" mat_peptide 2638..3051 /gene="ORF1" /product="VPg" mat_peptide 3052..3594 /gene="ORF1" /product="Pro" mat_peptide 3595..5118 /gene="ORF1" /product="RdRp" gene 5105..6745 /gene="ORF2" CDS 5105..6745 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCT04934.1" /translation="MMMASKDAPQSADGASGAGQLVPEVNTADPLPMEPVAGPTTAVA TAGQVNMIDPWIVNNFVQSPQGEFTISPNNTPGDILFDLQLGPHLNPFLSHLSQMYNG WVGNMRVRILLAGNAFSAGKIIVCCVPPGFTSSSLTIAQATLFPHVIADVRTLEPIEM PLEDVRNVLYHTNDNQPTMRLVCMLYTPLRTGGGSGSSDSFVVAGRVLTAPSSDFSFL FLVPPTIEQKTRAFTVPNIPLQTLSNSRFPSLIQGMILSPDASQVVQFQNGRCLIDGQ LLGTTPATSGQLFRVRGKINQGARTLNLTEVDGKPFMAFDSPAPVGFPDFGKCDWHMR ISKTPNNTSSGDPMRSVSVQTNVQGFVPHLGSIQFDEVFNHPTGDYIGTIEWISQPST PPGTDINLWEIPDYGSSLSQAANLAPPVFPPGFGEALVYFVSAFPGPNNRSAPNDVPC LLPQEYITHFVSEQAPTMGDAALLHYVDPDTNRNLGEFKLYPGGYLTCVPNGVGAGPQ QLPLNGVFLFVSWVSRFYQLKPVGTASTARGRLGVRRI" gene 6745..7380 /gene="ORF3" CDS 6745..7380 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCT04935.1" /translation="MAQAIIGAIAASAAGSALGAGIQAGAEAALQSQRYQQDLALQRN TFEHDKDMLSYQVQASNALLAKNLNTRYSMLVAGGLSSADASRAVAGAPVTQLIDWNG TRVAAPRSSATTLRSGGFMAVPMPVQPKSKALQSSGFSNPAYDTSTVSSRTSSWVQSQ NSLRSVSPFHRQALQTVWVTPPGSTSSSSVSSTPYGVFNTDRMPLFANLRR" ORIGIN 1 caggaggagg tccagtatgg tatggggtgg tccaacaggc ccattgatca gaacgtcaaa 61 tcatgggaag agcttgacac cacagttaag gaagagatcc tagacaacca caaagaatgg 121 tttgacgctg gtggtttggg cccttgcaca atgcctccaa catatgaacg ggtcagggac 181 gacagtccac ctggtgaaca ggttaaatgg tccgcacgtg atggagttaa cattggagtg 241 gaacgcctca cgacagtgag tgggcctgag tggaatcttt gccccttacc ccccatcgat 301 ttgaggaaca tggaaccagc tagtgaaccc actattggag atatgataga attctacgaa 361 ggccacatct atcattactc catatacatt gggcaaggca aaacagtcgg cgtccattct 421 ccacaggcgg cattttcagt ggctagagtg accatccagc ccatagccgc ttggtggaga 481 gtttgttaca taccccaacc caagcataga ctgagttacg accaactcaa ggaactagag 541 aatgagccat ggccatacgc ggccataacc aataattgtt ttgaattctg ctgtcaagtc 601 atgaatcttg aggacacgtg gttgcaaagg cgactggtca cgtcgggcag attccaccac 661 cccacccagt cgtggtcaca gcagacccct gagttccaac aagatagcaa gttagagttg 721 gttagggacg ccatattggc tgcagtgaat ggtcttgttt cgcagccctt taagaacttc 781 ttgggtaaac tcaaacccct caatgtgctt aacatcctgt ctaactgtga ttggaccttc 841 atgggggtgg tggaaatggt catactacta cttgaactct ttggtgtgtt ctggaacccg 901 cctgatgtat ccaattttat agcgtccctt cttcctgatt tccatcttca gggacctgaa 961 gacttggcac gagatctagt cccagtgatt cttggtggta taggattagc cattgggttc 1021 accagagaca aagttacaaa ggtcatgaag agtgctgtgg atggtcttcg agctgccaca 1081 caactgggac agtatggatt agaaatattc tcactgctca agaagtactt ctttgggggg 1141 gaccagactg agcgcaccct caaaggcatt gaggcagcag tcatagatat ggaggtactg 1201 tcctccactt cagtgacaca gctagtgagg gacaaacagg cagcaaaggc ctatatgaac 1261 atcttggaca atgaagaaga gaaggccagg aagctctctg ctaaaaacgc tgacccacat 1321 gtgatatcct caacaaatgc cctaatatcg cgcatatcca tggcacgatc tgcattggcc 1381 aaggctcagg ctgagatgac cagtcgaatg cgaccagttg tcattatgat gtgtggccca 1441 cctgggattg ggaagaccaa ggctgctgag cacctagcta agcgtctagc caatgagatc 1501 agaccaggtg gtaaggtggg gttggttccc cgtgaagctg tcgaccactg ggacggttat 1561 catggtgagg aagtgatgct gtgggatgac tatggcatga caaaaataca agacgactgt 1621 aataaactcc aggccattgc tgattcggcc cccctcacat taaattgtga taggattgaa 1681 aataaaggaa tgcagttcgt ttcagatgca atagtcatca ccaccaacgc cccaggcccc 1741 gcccctgtgg actttgtcaa ccttggacca gtgtgtagac gggtcgactt tttggtgtat 1801 tgctctgccc cagaggtgga gcagatacgg agagtcagcc ctggcgacac atcagcactg 1861 aaggactgct tcaagccaga tttctcacat ttaaaaatgg agctggctcc acaaggtggg 1921 tttgacaatc aagggaacac accgtttggc aggggcatca tgaagccaac aaccattaat 1981 agactcctca tacaagccgt ggcccttacc atggaaaggc aggatgagtt ccagttgcag 2041 gggaagatgt atgactttga tgatgacagg gtgtcagcgt ttaccaccat ggcacgtgac 2101 aatggcctgg gcatcttgag catggcgggt ctgggtaaga agttacgcgg tgtcacaacg 2161 atggagggct tgaagaatgc cctgaaggga tacaaaatta gtgcgtgcac gataaaatgg 2221 caggctaaag tgtactcact agagtcagat ggcaacagtg tcaacattaa agaggagagg 2281 aacatcttaa ctcaacaaca acagtcagtg tgtgctgcct ctgtcgcgct cactcgcctc 2341 cgggctgcgc gtgcggtggc atacgcgtca tgcatccaat cggctataac ttctatacta 2401 caaattgctg gctcagccct agtggtcaac agagcagtga agagaatgtt tggcacgcgt 2461 actgccaccc tgtcccttga gggccccccc agagaacaca aatgcagggt ccacatggcc 2521 aaggccgcag gaaaggggcc tattggccat gatgatgtgg tagaaaagta tgggctttgt 2581 gaaactgagg aggacgaaga agtggcccac actgaaatcc cttctgccac tatggagggc 2641 aagaataaag ggaagaacaa gaaaggacgt ggtcggaaga acaactacaa cgccttctcc 2701 cgcaggggac tcaatgatga agagtacgaa gagtacaaga agatacgcga ggagaaaggt 2761 ggcaattata gcatacagga gtacctagag gataggcaaa ggtatgaaga agagctagca 2821 gaggttcaag caggtggaga tggaggaatc ggggaaactg aaatggaaat ccgccacaga 2881 gtgttctaca aatctaagag tagaaagcat caccaggaag agcgacgcca gctagggctg 2941 gtgacaggtt ccgacattcg gaagagaaaa ccaatcgact ggaccccacc caagtcagca 3001 tgggcagatg atgagcgtga ggtggattac aatgagaaga tcagttttga ggcgccccct 3061 actttatgga gcagagtgac aaagtttggg tctggatggg gtttctgggt cagctctaca 3121 gtcttcataa ccacaacgca cgtcatacca accagtgcga aggaattctt tggtgaaccc 3181 ctaaccagca tagccatcca cagggctggt gagttcactc tattcaggtt ctcaaagaaa 3241 attaggcctg acctcacagg tatgatcctt gaggagggtt gccccgaggg cacagtgtgt 3301 tcagtgctaa taaaaaggga ctctggtgaa ctactgccat tggctgtaag aatgggcgca 3361 atagcatcaa tgcgtataca gggccgcctt gttcatgggc agtccggcat gctgctcacc 3421 ggggctaatg ctaagggcat ggaccttgga accatcccag gagactgtgg ggctccttat 3481 gtctataaga gagccaacga ctgggtggtc tgtggtgtac acgctgctgc caccaaatca 3541 ggcaacaccg ttgtgtgcgc cgttcaggcc agtgaaggag aaaccacgct tgaaggcggt 3601 gacaaaggac actatgctgg acatgaaata attaagcatg gttgtggacc agccctgtca 3661 accaaaacca aattctggaa atcatcccct gaaccactac cccctggggt ctatgaaccc 3721 gcctacctcg ggggccggga ccctagggta actggcggtc cctcactcca acaggtgttg 3781 cgggaccaat taaagccatt tgctgagcca cgaggacgca tgccagagcc aggtctcttg 3841 gaggccgcag ttgagactgt gacttcatca ttagagcagg ttatggacac tcccgtccct 3901 tggagttata gtgatgcgtg ccagtccctt gacaagacca ctagttctgg ctttccctac 3961 cacagaagga agaacgacga ctggaatggc accaccttca tcagggagtt aggggagcag 4021 gcagcacatg ctaataacat gtatgaacag gctaaaagta tgaaacccat gtacacggca 4081 gcgctcaaag atgaactagt caaaccagag aaggtatacc aaaaagtgaa gaagcgcttg 4141 ttatgggggg cagacttggg cacggtggtt cgggccgcgc gggcttttgg tccattctgt 4201 gatgctataa aatcccacac aatcaaattg cccattaaag ttggaatgaa ttcaattgag 4261 gatgggccac tgatctatgc agaacattca aagtataagt accattttga tgcagattac 4321 acagcttggg attcaactca aaatagacaa atcatgacag agtcattctc aatcatgtgt 4381 cgactaactg catcacctga actagcttca gtggtagctc aagatttgct tgcaccttca 4441 gagatggatg ttggcgacta tgtcataaga gtgaaggaag gcctcccatc tggttttcca 4501 tgtacatcac aggttaatag tataaaccat tggttaataa ctctgtgtgc cctttctgaa 4561 gtaactggtc tgtcgccaga tgtcatccag tccatgtcat atttctcttt ctatggtgat 4621 gatgaaatag tgtcaactga catagaattt gacccagcaa aactgacaca agtcctcaga 4681 gagtatggac ttaaacccac ccgccccgac aaaagcgagg gcccaataat tgtgaggaag 4741 agtgtggatg gtttggtctt tttgcgtcgc actatctccc gcgacgccgc aggattccag 4801 gggcgactgg accgggcatc cattgaaaga caaatctact ggactagagg acccaaccat 4861 tcagatcctt ttgagaccct ggtgccacac caacaaagga aggtccaatt aatatcatta 4921 ttgggtgagg cctcactgca tggtgaaaag ttttacagga agatttcaag taaagtcatc 4981 caggaaatta aaacaggggg ccttgaaatg tatgtgccag gatggcaagc catgttccgt 5041 tggatgcggt tccatgacct tggtttgtgg acaggagatc gcaatctcct gcccgaattt 5101 gtaaatgatg atggcgtcta aggacgcccc tcaaagcgct gatggcgcaa gcggcgcagg 5161 tcaactggtg ccggaggtta atacagctga ccccttaccc atggaacctg tggctgggcc 5221 aacaacagcc gtagccactg ctgggcaagt taatatgatt gatccctgga ttgttaataa 5281 ttttgtccag tcaccccaag gtgagttcac aatctctcct aataataccc ccggtgatat 5341 tttgtttgat ttacaattgg gtccacacct aaaccctttc ttgtcacatt tgtcccaaat 5401 gtacaatggc tgggttggga acatgagagt cagaattctc cttgctggga atgcattctc 5461 agctggaaag attatagttt gttgtgtccc ccctggcttt acatcttctt ctctcaccat 5521 agctcaggct acattgtttc cccatgtaat tgctgatgtg agaacccttg agccaataga 5581 aatgcccctc gaagatgtac gtaatgtcct ctatcacacc aatgataatc aaccaacaat 5641 gcggctggtg tgtatgctat acacgccgct ccgcactggt ggggggtctg gtagttctga 5701 ttcctttgta gtcgctggca gggttctcac agcccctagc agcgacttca gtttcttgtt 5761 ccttgtcccg cctaccatag agcagaagac tcgggctttc actgtgccta atatcccctt 5821 gcaaaccttg tccaactcta ggtttccttc cctcatccag gggatgattc tgtcccccga 5881 tgcatctcaa gtggtccaat tccaaaatgg gcgttgcctt atagatggtc aactcttagg 5941 cactacaccc gctacatcag gacagctgtt cagagtaaga ggaaagataa atcagggagc 6001 ccgtacactt aacctcacag aggtggatgg taaaccattc atggcatttg attcccctgc 6061 acctgtgggg ttccccgatt ttgggaaatg tgattggcac atgagaatta gcaaaacccc 6121 aaacaacaca agttcaggtg accccatgcg cagtgtcagc gtgcaaacca atgtgcaggg 6181 ttttgtgcca cacttgggaa gtatacaatt tgatgaagtg tttaatcatc ccacaggtga 6241 ctacattggc accattgaat ggatttccca gccatctaca ccccctggaa cagatattaa 6301 tctgtgggag atccccgatt atgggtcatc cctttcccaa gcagctaatc tggccccccc 6361 agtgttcccc cctggatttg gtgaggccct tgtatacttt gtttctgctt tcccgggccc 6421 caataaccgc tcagccccga atgatgtacc ctgtcttctc cctcaagagt acataaccca 6481 ctttgtcagt gaacaagccc caacgatggg tgacgcagct ttactgcatt atgtcgaccc 6541 tgataccaac aggaaccttg gggagttcaa gctataccct ggaggttacc tcacctgtgt 6601 accaaacggg gtaggtgccg ggcctcaaca gctccctctc aatggtgttt ttctctttgt 6661 ttcttgggtg tctcgttttt atcagcttaa gcctgtggga acagccagta cggcaagagg 6721 taggctcgga gtacgccgta tataatggcc caagccatca taggagcaat tgccgcgtca 6781 gctgcaggct cagcattggg tgcgggcatc caggctggtg ccgaggctgc gcttcagagt 6841 caaagatacc aacaagactt agccctgcaa aggaatactt ttgaacatga taaggatatg 6901 ctttcctacc aggtccaggc aagtaatgca cttttggcaa agaatctcaa tacccgctat 6961 tctatgcttg ttgcaggggg tctttctagt gctgatgctt ctcgggctgt tgctggggcc 7021 cctgtaacac aattgattga ttggaacggc actcgggttg ccgcccccag atcaagtgca 7081 acaactctga ggtctggtgg tttcatggca gtccccatgc ctgttcaacc caaatctaag 7141 gccctgcaat cctctgggtt ttctaatcct gcttatgaca cgtccacagt ttcttctagg 7201 acttcttctt gggtgcagtc acagaattcc ctgcgaagtg tgtcaccctt tcataggcag 7261 gcccttcaaa ctgtgtgggt tactccacct gggtctactt cctcttcttc tgtttcctca 7321 acaccttatg gtgtttttaa tacggatagg atgccgctat tcgcaaattt gcggcgttaa 7381 tgttgtaata taatgcagca gtgggcacta tattcaattt ggtttaatta gtg //