Typing tool
|
Complete norovirus genomes
MK907787 | GII.3 | ||
---|---|---|---|
GII.P21 |
ORF1: 1..4899 ORF2: 4880..6526 ORF3: 6526..7220LOCUS MK907787 7220 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_016 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MK907787 VERSION MK907787.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7220) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7220) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7220 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_016" /isolation_source="sewage" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="08-Jan-2014" /note="genotype: GII.3-GII.P21" gene <1..4899 /gene="ORF1" CDS <1..4899 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QCO93060.1" /translation="IPPPPPNGEDEPTISYNVKEGVSGLPELSTVTQLEESSTAFSVP PLSQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLAKVELTPL SLYWRPVYTPQYLISPDTLKKLHGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQR TTGFFRPYQDWNRKPLPTMDDSKVKKVANVVLCALSSLFTRPIKDIIGKLKPLNILNI LATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVV MGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEKNELA MVRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDMEEEKARKLSTKSASPDIVG TINALLSRIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIASSLT GDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRI ENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPEVEKAKRDFPGQP DMWKDTFKPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTASSLVARASGLLHERLD EYELQGPTPTTFNFDRNKVLAFRQLAAENKYGLVDTMRVGSQLKNVKTMTELKQALKN ISVKKCQLVYGGGTYTLESDGKGNVHVEKVNNASVQTNNELSGALHHLRCARIRYYVK CVQEALYSILQIAGAAFITTRIAKRMNIQNLWSKPQAEDLEETGNEEGCPKPKSDEEF VISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQ DRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRN PDDFKPKGKLWADDDRSVDYNERLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHV IPQGAQEFFGVSIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKR PTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKR GNDYVVIGVHTAAARGGNTVICATQGSEGEAILEGGDNKGTYCGAPILGPGNAPKLST KTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPKPSV LEAAKKTIINVLEQTIDPPQKWTYAQACASLDKTTSSGHPYHMRKNDCWNGETFTGKL ADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGTIKKRLLWGSDLSTMVRCARA FGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWDSTQQRAVLA AALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFKISINEGLPSGVPCTSQWNSIAHW LLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTTKLKEYGLKPTRP DKTEGPLIISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHEDPFETM IPHSQRPIQLMSLLGEAALHGPSFYSKISKLVISELKEGGMDFYVPRQEPMFRWMRFS DLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..789 /gene="ORF1" /product="p48" mat_peptide 790..1887 /gene="ORF1" /product="NTPase" mat_peptide 1888..2424 /gene="ORF1" /product="p22" mat_peptide 2425..2823 /gene="ORF1" /product="VPg" mat_peptide 2824..3366 /gene="ORF1" /product="Pro" mat_peptide 3367..4896 /gene="ORF1" /product="RdRp" gene 4880..6526 /gene="ORF2" CDS 4880..6526 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93062.1" /translation="MKMASNDAAPSNDGAAGLVPEINSEAMALEPVAGAAIAAPLTGQ QNIIDPWIMNNFVQAPGGEFTVSPRNSPGEVLLNLELGPEINPYLAHLARMYNGYAGG FEVQVVLAGNAFTAGKIIFAAIPPNFPIDNLSAAQITMCPHVIVDVRQLEPVNLPMPD VRNNFFHYNQGSDSRLRLIAMLYTPLRANNSGDDVFTVSCRVLTRPSPDFSFNFLVPP TVESKTKPFSLPILTISEMSNSRFPVPIDSLHTSPTENIVVQCQNGRVTLDGELMGTT QLLPSQICAFRGTLTRSTSRASDQADTATPRLFNYYWHIQLDNLNGTPYDPAEDIPGP LGTPDFRGKVFGVASQRNPDNTTRAHEAKIDTTSGRFTPKLGSLEISTESGDFDQNQP TRFTPVGIGVDHEADFQQWTLPDYAGQFTHNMNLAPAVAPNFPGEQLLFFRSQLPSSG GRSNGILDCLVPQEWVQHFYQESAPSQTQVALVRYVNPDTGRVLFEAKLHKLGFMTIA KSGDSPITVPPNGYFRFESWVNPFYTLAPMGTGNGRRRIQ" gene 6526..>7220 /gene="ORF3" CDS 6526..>7220 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93061.1" /translation="MAGAFIAGLAGDMLTNTVGSLVNAGANAINQTIDFENNKYLQNA SFNHDKEMLNAQVEATKRLQADMIAIKQGVLTAGGFSPTDAARGAINAPMTKVLDWNG TRYWAPGATSTTSMSGGFTNQTVRRSTPNFKTNQAPKPAPSSGSSVRSNSTQITSLSS HSSGSSRSSGSTVVSSIPSSNRTRDWVNQQNFNLEPHMPGSLRTAFVTPPSSTASSSG TVSTVPKNVLDSWT" ORIGIN 1 atcccccctc ccccacccaa cggggaggac gaaccaacca tttcttacaa cgtcaaagag 61 ggtgtttctg gtttgcctga actctcaact gttacccaac tggaagagag ctctacagca 121 ttcagcgttc cccctcttag tcagagggag aacagagatg caaaggaacc attgactgga 181 accatcctgg agatgtggga cggggagatc taccattatg gcttgtatgt ggaacgaggg 241 ttagtgctag gtgtacacaa accaccagca gccatcagcc ttgctaaggt tgagctgacc 301 cccttgtcct tgtattggag accagtgtat accccacagt acctcatttc ccctgacact 361 ctcaaaaagt tgcatgggga gacgttccct tatacagcct ttgataacaa ctgttatgcc 421 ttctgctgtt gggtacttga cttgaacgat tcatggctga gcagaaggat gatacagagg 481 acaacaggct tcttccgacc ctaccaggac tggaacagga aaccccttcc cacgatggat 541 gactccaaag tgaaaaaggt ggccaatgtt gtcctctgtg cactctcgtc attgtttaca 601 agaccaatta aggacattat tgggaagttg aagcccctaa acatcctcaa catcctagcc 661 acatgtgatt ggacttttgc aggcatagtg gaatcactaa tccttttggc tgaactgttt 721 ggagttttct ggacaccccc agatgtgtct gcaatgatcg ctcccttact aggtgattat 781 gagctgcagg gacccgagga cctcgctgtg gaactcgtac ccgtagtaat gggagggata 841 ggtttggtgc tgggattcac caaagagaaa atcggcaaaa tgctttcatc tgctgcttca 901 accctgaggg catgcaaaga tctcggtgca tatgggttgg agatcctcaa gttggtcatg 961 aaatggtttt tcccaaagaa agaagagaaa aacgaattgg caatggtgag atccatcgag 1021 gatgcagtgc tagatcttga ggccattgag aacaaccaca tgacggctct actcaaggat 1081 aaagacagcc ttgcaaccta catgaggact cttgacatgg aggaggagaa ggcgaggaaa 1141 ctttccacca agtctgcctc gccggatatt gtgggcacga taaatgccct actgtcaagg 1201 attgcagctg cccggtctct agtgcacagg gctaaggagg aactgtcaag caggccccga 1261 ccggtcgttg tgatgatttc agggagacca ggcataggga aaacccacct agctagagag 1321 ctggcaaaga agatcgcctc ctcactcaca ggtgaccaga gggtgggtct tatcccacgt 1381 aacggggtcg accactggga tgcatacaaa ggtgaaaggg tcgttctctg ggacgactac 1441 gggatgagta accctatcca cgacgccctc agactccaag agctcgctga tacctgtcct 1501 cttacactta actgtgacag gattgagaac aaaggcaaag tctttgacag tgacgccata 1561 ataataacca ccaatctggc taacccagcg ccactggatt acgtcaactt tgaagcctgc 1621 tcaagacgca tagacttcct cgtctatgct gatgcccctg aagttgagaa ggctaaacga 1681 gacttcccag gccaaccaga catgtggaaa gacaccttta aacctgactt ctcacacata 1741 aaattggcat tagccccgca aggaggtttt gacaagaatg gtaacactcc tcatgggaag 1801 ggtgtcatga agaccctcac tgccagttcc ctcgttgccc gtgcatcggg actcctccac 1861 gagagattag acgaatatga gctgcagggc ccaactccca caacattcaa cttcgaccgc 1921 aacaaggtgc ttgcctttag gcagcttgct gctgaaaaca agtatggcct tgtcgacaca 1981 atgagggtcg gatcacagct caagaatgtc aaaactatga cagagctcaa gcaggctctt 2041 aagaacatct cagtcaagaa atgtcagctt gtgtatggtg ggggcacgta cacactcgaa 2101 tctgatggca aaggcaatgt gcatgtcgag aaggtgaaca acgccagtgt tcaaacaaac 2161 aacgagctct ccggggcttt gcaccatctc aggtgtgcca gaatcaggta ctatgttaag 2221 tgtgtccagg aagctctcta ctccatcttg caaattgctg gggctgcatt cattaccaca 2281 cgcattgcaa agcgcatgaa catacaaaac ctctggtcca aaccacaggc agaagatctg 2341 gaggagactg gtaatgagga gggttgtcca aagcctaaaa gtgatgagga gtttgtcatc 2401 tcctctgatg acatcaaaac tgaaggcaag aaaggaaaga ataagaccgg ccgtggcaaa 2461 aagcacacag ccttttccag caaaggactc agtgatgagg agtacgatga atataaaaga 2521 atcagggaag aaaggaatgg caagtactcc atagaggaat acctacagga tagagacaag 2581 tactatgagg aggtggccat agccagggca actgaggaag acttctgtga agaagaggag 2641 gcaaagatcc ggcaaaggat atttagacca acaaggaaac aacgcaaaga ggaaagggcc 2701 tctcttggtt tggttacagg ttccgagatc aggaagagaa acccagatga cttcaagccc 2761 aaagggaaac tgtgggccga tgatgacagg agtgttgact acaatgagag acttagtttc 2821 gaggctccac caagcatttg gtcacgaata gtcaactttg gctcagggtg gggcttttgg 2881 gtctcgccca gcctcttcat aacatcaacc catgttatcc cccaaggcgc acaggagttc 2941 tttggagtgt ccatcaaaca aatacaaatt cacaagtcag gtgagttctg ccggcttagg 3001 ttcccaaaac caatcaggac agatgtcaca ggtatgatcc tggaagaggg tgctccagaa 3061 ggcactgttg ccacacttct catcaagaga ccaactgggg aactcatgcc cttggcagcc 3121 agaatgggca cccacgcgac catgaaaatt caaggtcgca ctgttggtgg acagatgggc 3181 atgttactta cagggtctaa cgctaagagt atggacctgg gtacaactcc tggcgattgt 3241 ggttgtccct acatctacaa aagggggaac gactacgtgg tcattggggt tcacactgcc 3301 gccgctcgtg gaggaaacac cgtcatctgt gcaactcaag gaagcgaggg tgaggccata 3361 ttagaaggtg gtgacaacaa gggaacttac tgtggagcac caatattggg ccctggaaat 3421 gctcccaaac tcagcaccaa aaccaaattt tggagatctt ccaccacccc cctaccaccc 3481 ggaacctatg agccagccta tctgggtggc aaggacccta gagtgaaagg tggcccctca 3541 ctgcaacagg ttatgaggga ccaactaaaa ccattcactg agcccagagg taaaccaccc 3601 aaaccaagtg tgctagaagc cgccaagaag actataatca atgtgcttga acaaacaata 3661 gacccacctc agaaatggac atacgcacag gcgtgtgcat cactagacaa gaccacttcc 3721 agcggccacc cttaccacat gcggaagaac gattgctgga atggggagac tttcacagga 3781 aaactggcag atcaagcatc aaaggctaac ctaatgtttg aggaagggaa gaacatgacc 3841 ccagtataca caggggctct gaaagatgag ttagtcaaga ctgataagat ctatggaaca 3901 atcaagaaga gactcctttg gggttcagac ctatcgacca tggtacggtg tgcacgagcc 3961 ttcggtgggc taatggacga actcaaggcc cattgcgtca cactaccagt cagggttggt 4021 atgaacatga atgaggatgg acccataatc tttgagaaac attccagata taaatatcat 4081 tatgatgcag actactcccg ttgggattca acacaacaaa gagcagtttt ggctgcagcc 4141 ctggaaataa tggtcaaatt ctcaccagaa ccccacctgg cccaagtggt tgcagaagac 4201 ctcttatccc ccagtgtgat ggatgtgggt gacttcaaaa tatcaattaa cgaggggtta 4261 ccctctggtg ttccctgcac ctcacaatgg aactccattg cccactggct cctcacacta 4321 tgtgcactgt ctgaggttac agacctgtct cctgacatta tccaggcaaa ttcactgttc 4381 tccttttatg gtgatgatga aatagtgagt acagatatca aactggaccc agaaaaactg 4441 acaacaaaat tgaaggaata cgggctaaag ccaacccgtc ctgacaaaac agaaggaccc 4501 ttaatcatct ctgaagattt ggatggcctg accttcttac ggagaacggt gacccgtgat 4561 ccggccgggt ggtttggcaa attggaccaa agttcaatac tcaggcagat gtactggacc 4621 aggggaccaa accatgagga ccccttcgaa acaatgatac cacactccca aagacccata 4681 caactgatgt cactattggg tgaagctgcg ttgcatggcc catcattcta cagtaaaatc 4741 agcaaattgg tcatctcaga attgaaagag ggtggaatgg atttttacgt gcccagacaa 4801 gaaccaatgt tcaggtggat gagattctca gatttgagca cgtgggaggg cgatcgcaat 4861 ctggctccca gttttgtgaa tgaagatggc gtcgaatgac gccgctccat ctaatgatgg 4921 tgccgccggc ctcgtcccag agatcaacag tgaggcaatg gcgctagagc cagtggcggg 4981 tgcagcgata gcagcacccc tcactggcca gcaaaatata attgatccct ggattatgaa 5041 taactttgtg caagcacctg gtggtgagtt cacagtatcc cctaggaatt cccctggtga 5101 agttcttctc aatttggaat tgggtccaga aataaatccc tacttggctc atcttgctag 5161 gatgtataat ggttatgcag gtggatttga agtgcaggtg gtcctagctg gaaatgcgtt 5221 tacagcagga aagataatct ttgcagctat tccccctaat tttccaattg ataatctaag 5281 tgcagcacag atcacaatgt gtccacatgt gattgtggat gttaggcagt tggaaccagt 5341 caacctcccg atgcctgacg ttcgcaataa ctttttccat tataatcagg ggtctgattc 5401 gagattgcgc ctaattgcaa tgttatacac acctcttagg gcaaataact ctggggatga 5461 tgttttcact gtgtcttgca gagtgctaac tagacctagt cctgacttct catttaattt 5521 ccttgtgcca ccaactgtgg agtcaaagac aaaacccttt tccctcccca ttctgactat 5581 ctctgaaatg tccaattcca ggttcccagt accaattgat tctctgcaca ccagccctac 5641 tgagaacatt gttgtccagt gccagaatgg acgcgtcacc cttgatggtg agttgatggg 5701 caccactcaa cttttaccta gccaaatctg tgctttcagg ggcacgctta ccagatcaac 5761 aagcagagcc agtgaccagg ccgatacagc aacccctaga ttgtttaatt attattggca 5821 tatacaattg gataacctaa atggaactcc ttatgaccct gcagaagata taccaggccc 5881 cctagggaca ccagatttcc ggggcaaagt ctttggcgtg gctagccaga gaaatcctga 5941 taacacgact agggcacatg aagcaaagat agacacaaca tctggccgct tcaccccaaa 6001 attaggctca ttagagattt ccactgaatc tggagatttt gatcaaaacc aaccaacaag 6061 attcacccca gttggcattg gggttgacca tgaggcagat ttccaacaat ggactcttcc 6121 cgactacgct ggccagttca cccacaacat gaacttagcc ccagctgttg ctcccaactt 6181 ccctggtgag caactccttt tcttccgctc acaattacca tcttccggtg ggcggtccaa 6241 cggtattcta gactgcctgg tcccccaaga atgggtgcag cacttctatc aagaatcagc 6301 cccctcccaa actcaagtgg ccctggttag gtatgtcaac cccgacactg gtagagtgtt 6361 atttgaggcc aagctgcaca aactaggttt catgactata gccaagagtg gtgactctcc 6421 aataactgtc cccccaaatg gatacttcag gtttgaatct tgggtgaacc ccttttatac 6481 acttgccccc atgggaactg ggaatgggcg tagaaggatt caataatggc tggagctttt 6541 atagcaggat tggctggtga tatgctcaca aatactgtag gatctttagt taatgcaggg 6601 gctaatgcta ttaatcaaac aattgatttt gaaaataata aatatttgca aaatgcctct 6661 tttaatcatg ataaggagat gttgaatgca caagttgagg caacaaagag gttacaggct 6721 gacatgattg ctatcaaaca aggggttttg accgctggcg gcttctcccc tactgatgca 6781 gcccgcgggg caattaatgc tcctatgaca aaagtcctag attggaatgg aacgagatac 6841 tgggcgccag gtgccacctc cacaacctcg atgtcaggtg gcttcacaaa tcaaactgtg 6901 cgcagatcca caccaaattt taaaacgaac caggccccca aacccgcacc cagcagtggg 6961 tcttcagtga ggtcaaattc aacccaaatc actagcctga gctcacactc gtccgggtcg 7021 tctcggtcca gcgggtctac agttgtcagc tcaataccgt cctctaacag gactagggac 7081 tgggttaacc aacaaaattt taatctggaa ccacacatgc ctggatctct taggacagct 7141 tttgtcactc caccatctag tacagcctct agctcaggca cagtctcaac tgtgcccaaa 7201 aatgttttgg actcctggac //