Typing tool
|
Complete norovirus genomes
MK907791 | GII.17 | ||
---|---|---|---|
GII.P17 |
ORF1: 1..4411 ORF2: 4392..6014 ORF3: 6014..6509LOCUS MK907791 6509 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_022 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MK907791 VERSION MK907791.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 6509) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 6509) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..6509 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_022" /isolation_source="sewage" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="05-May-2014" /note="genotype: GII.17-GII.P17" gene <1..4411 /gene="ORF1" CDS <1..4411 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="QCO93072.1" /translation="FFKPYQDWNRKPLPTMDEPKIKKAANAVLCALSSLFTRPIKDII GKLRPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYEMQGP EDLAVELVPVVMGGIGLVLGFTKEKIGKMLSSAATTLRACKDLGSYGLEILKLVMKWF FPKKEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARRL STKSASPDIVGTINALLARIAAARSLVHKAKEELSSRQRPVVVMISGRPGIGKTHLAR ELAKKVASTLSGDQRIGLVPRNGVDHWDAYKGERVVLWDDYGMSNPIQDALRLQELAD TCPLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPDI EKAKRDFPGQPDMWKDHFRPDFSHIKLQLAPQGGFDKNGNTPHGKGVVKSLTIGSLIA RASGLLHERMDEFELQGSDLPTFNFDRNKVAAFRQLAAENKYGLMDTLRVGNQLKSVK TLDELKQAIKNISIKKCQIVYNGCTYTMESDGRGNVVVEKVQNATVQTNNELVGALHH LRSARIRYYVKCFQEAIYSLLQIAGAAFVTSRIVRRMNISNLWSKPPIEEGDEPEDRG GCPKPRDEDDLTIDSRDIKVEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREER NGKYSIEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLG LVTGSEIRKRNPDDFKPKGKLWADDNRSVDYNERIDFEAPPSVWSRIVNFGTGWGFWV SPSLFITSTHVIPKGITEAFGVPINQIQIHKSGEFCRLRFPKPIRPDVSGMILEEGAP EGTVVSILIKRTTGELMPLAVRMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTP GDCGCPYIYKRGNDLVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPI LGPGNAPKLSTKTKFWRSSNAPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFT EPRGKPPNPNVLESAKKTIINVLEQTIDPPQKWSYAQACASLDKTTSSGYPHHVRKND YWSGESFTGKLADQASKANLMYEEGKHMQPVYTAALKDELVKTDKIYGKIKKRLLWGS DLSTMIRCARAFGGLMDEFKANCITLPIRVGMNMNEDGPIIFEKHSRYRYHYDADYSR WDSTQQRAVLEAALEIMVRFSAEPQLAQIVAEDLLSPSVVDVGDFKIAINEGLPSGVP CTSQWNSIAHWLLTLCALSEVTGLGPDIIQANSMYSFYGDDEIVSTDIKLDPEKLTAK LKEYGLKPTRPDKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLDQNSILRQLYWTR GPNHEDPSETMIPHAQRPVQLMALLGESSLHGPSFYSKVSKLVISELKEGGMDFYVPR QESMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..301 /gene="ORF1" /product="p48" mat_peptide 302..1399 /gene="ORF1" /product="NTPase" mat_peptide 1400..1936 /gene="ORF1" /product="p22" mat_peptide 1937..2335 /gene="ORF1" /product="VPg" mat_peptide 2336..2878 /gene="ORF1" /product="Pro" mat_peptide 2879..4408 /gene="ORF1" /product="RdRp" gene 4392..6014 /gene="ORF2" CDS 4392..6014 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93073.1" /translation="MKMASNDAAPSNDGAAGLVPEGNNETLPLEPVAGAAIAAPVTGQ NNIIDPWIRTNFVQAPNGEFTVSPRNSPGEILLNLELGPDLNPYLAHLSRMYNGYAGG VEVQVLLAGNAFTAGKILFAAVPPNFPVEFLSPAQITMLPHLIVDVRTLEPIMIPLPD VRNTFFHYSNQPNSRMRLVAMLYTPLRSNGSGDDVFTVSCRVLTRPTPDFEFTYLVPP SVESKTKPFSLPILTLSELTNSRFPVPIDSLFTAQNNVLQVQCQNGRCTLDGELQGTT QLLPSGICAFRGRVTAQINQRDRWHMQLQNLNGTTYDPTDDVPAPLGTPDFKGVVFGM VSQRNVGNDAPGSTRAQQAWVSTYSPQFVPKLGSVNLRISDNDDFQFQPTKFTPVGVN DDDDGHPFRQWELPNYSGELTLNMNLAPPVAPNFPGEQLLFFRSFVPCSGGYNQGIID CLIPQEWIQHFYQESAPSQSDVALIRYVNPDTGRTLFEAKLHRSGYITVAHSGDYPLV VPANGHFRFDSWVNQFYSLAPMGTGNGRRRAQ" gene 6014..>6509 /gene="ORF3" CDS 6014..>6509 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93074.1" /translation="MAGAFIAGLAGDMLTSSVGSLVNAGANAINQKIDFENNKQLQSA SFQHDKEMLQAQVRATKQLQSEMIALKQGVLAAGGFSPTDAARGSIGAPMTKVLDWSG TRYWAPNSTKTTGYSGQFTSSPVHMSSPNAPQSKPAKPRSLAPSSSSSSVYSMYTQST HLTSG" ORIGIN 1 attcttcaaa ccttatcagg attggaacag aaaaccctta ccaaccatgg atgaaccgaa 61 aatcaagaag gccgcgaatg ctgtcctatg cgctctctcc tcactcttca ctagacccat 121 taaggacatc attggaaagc ttagaccact gaacattctt aatatactgg caacctgcga 181 ttggactttt gcaggcatag tggaatcctt gatccttctt gctgaactct ttggggtgtt 241 ctggacaccc ccagatgtgt ctgcgatgat tgctccctta ctcggtgact acgagatgca 301 aggccccgaa gacctggccg tggaacttgt gcccgtggtg atgggaggaa taggtttggt 361 gttaggattc accaaagaaa agatcggcaa gatgctttca tcagctgcga ccacacttag 421 ggcctgcaag gatctaggat cttatggatt ggagatactc aaattggtca tgaagtggtt 481 cttccccaag aaggaggagg ctaacgaact agccatggtg agggccattg aggatgcagt 541 cctggatttg gaggcaatag aaaacaatca catgaccacc cttctcaagg ataaagacag 601 cctggccaca tacatgagga ctcttgactt ggaagaagag aaagcaagga gactgtcaac 661 caaatcggcg tcacctgaca tcgtgggtac aataaacgca ttattggcta ggatagcagc 721 cgccaggtca ctggtccaca aggctaagga ggagctctcg agcagacaga gacctgttgt 781 tgtaatgata tcaggtagac caggtatagg aaagacccat ctggccagag agctagccaa 841 gaaagttgcg tcaacactgt caggcgacca aaggattgga ctggtaccta gaaatggcgt 901 cgaccattgg gacgcctaca agggagaaag ggtggttctg tgggatgatt atggtatgag 961 caaccctata caggacgcac tgaggctcca agaattggct gacacctgtc ctctgaccct 1021 aaattgtgat aggattgaaa acaaaggaaa agtgttcgac agtgatgcca taattataac 1081 aaccaacctt gcaaacccag caccactaga ctatgtcaac tttgaggcct gctctagacg 1141 catagacttc ttggtgtatg cagacgcacc tgacattgag aaggctaagc gtgacttccc 1201 tggccaacca gatatgtgga aagaccattt cagaccagac ttctcacaca ttaaacttca 1261 gctggcacca cagggaggtt ttgacaaaaa tggcaacacc ccacatggca aaggtgtagt 1321 gaagtccctg acgattggtt ctctgatcgc cagggcctcc ggcttgctcc acgagagaat 1381 ggatgaattc gagctccaag gctctgacct gccaaccttc aattttgacc gcaacaaagt 1441 cgccgccttc agacaattag cagctgaaaa caaatatggt cttatggaca cactgagggt 1501 cggcaaccaa ctgaaaagtg ttaagaccct ggatgagctg aagcaggcca taaagaacat 1561 cagtatcaag aagtgtcaga tagtgtataa tggatgcacc tataccatgg aatctgatgg 1621 gaggggcaat gttgtggttg aaaaggtaca aaacgcaaca gttcaaacca acaatgaact 1681 tgtaggggcg ctccaccacc tgagatctgc aaggataaga tactatgtta aatgtttcca 1741 agaagccatt tactctctac tccaaattgc tggtgcagcc ttcgtcacct cacgcatcgt 1801 gaggcgcatg aacatatcaa acctctggtc gaagccaccc atcgaggagg gtgatgagcc 1861 tgaagacaga ggaggatgcc ccaagcccag ggatgaagat gacctcacta ttgactctag 1921 agacattaaa gtggaaggga agaagggcaa gaacaagtct ggccggggca agaaacacac 1981 agccttctct tccaagggcc tcagtgatga agagtacgat gagtacaaga gaatcagaga 2041 ggagaggaat ggcaaatact caatagagga atacctccaa gatagggaca gatactatga 2101 agagcttgcc attgctaagg ccactgagga ggacttctgt gaggaggagg agatcaaaat 2161 ccgccagaga atcttccgac ccaccaggaa gcagcggaaa gaggaaaggg ccacgcttgg 2221 gcttgtcaca ggatcggaaa tcagaaagag gaatccagat gactttaaac caaaaggaaa 2281 attgtgggct gatgacaaca ggagcgtgga ttacaatgag aggatagatt ttgaggctcc 2341 cccgagtgtt tggtctagga tagtcaactt tggcacaggc tggggattct gggtttctcc 2401 aagtctcttt ataacttcaa cacacgtcat accaaaagga atcactgagg cctttggagt 2461 gcccataaac caaatccaaa ttcacaaatc tggagaattt tgccgcttgc ggtttccaaa 2521 gccaattaga ccagacgtga gtgggatgat cttggaagag ggcgcccctg aaggcactgt 2581 tgtgtctatt ctcatcaaaa ggacaacagg ggagctgatg cctcttgcag tcagaatggg 2641 gactcatgcc acaatgaaaa tccaaggtag aacggtgggt ggtcagatgg gtatgcttct 2701 cactggctca aatgctaaaa gcatggatct aggcacaacg ccaggtgact gcggatgtcc 2761 atacatatac aaaaggggca atgaccttgt ggtcattggg gtgcacactg cagcagctcg 2821 tggaggcaac acggtcatat gtgccacaca gggaagtgaa ggtgaagcca ctcttgaagg 2881 aggtgacaac aagggtacct attgtggagc ccccattttg ggacctggca atgcaccaaa 2941 attgagcaca aagacaaaat tctggagatc ctctaacgca ccgcttcccc caggtactta 3001 tgaaccagca taccttggtg ggagagaccc gcgcgtgaag ggaggcccat ctctgcaaca 3061 agttatgaga gaccaactta aaccttttac agagcccaga ggtaagccac caaacccaaa 3121 tgtcttagag tcagcaaaga agactatcat caatgttctg gaacaaacca ttgacccccc 3181 tcaaaagtgg tcatatgccc aagcttgtgc atccctcgac aagacaacct ccagtggata 3241 cccgcaccac gtccggaaga atgactattg gagtggtgaa tccttcacag gaaaacttgc 3301 agatcaggct tcaaaagcaa atctcatgta tgaggaaggc aagcacatgc aaccagttta 3361 caccgcagca ctcaaagacg agctcgtgaa aactgacaaa atttatggca aaatcaagaa 3421 aagactcctg tggggttctg acctctccac aatgattcga tgcgccagag catttggtgg 3481 tctcatggat gaattcaaag caaattgtat tacactccct atcagagttg gtatgaatat 3541 gaatgaagat ggtcccataa tatttgagaa acactccagg tacaggtacc actatgatgc 3601 tgattactcc cgctgggact ccacacaaca gcgggcagtg ctggaagcgg cacttgaaat 3661 catggtgaga ttttctgctg agccacagct ggcacaaata gtggcagagg acctgctgtc 3721 accaagtgtg gttgatgtgg gcgatttcaa aatcgctatc aatgaaggcc taccatctgg 3781 cgtgccctgc acctcacaat ggaattctat tgcccactgg ttacttacct tgtgtgccct 3841 ttctgaagtg acaggattag gtcctgacat catacaagct aactccatgt actctttcta 3901 tggtgatgat gagattgtga gcacagacat aaaattggac ccagagaaat tgaccgcaaa 3961 gctcaaagaa tatggcctta aacccactcg gcccgacaaa actgaggggc cgttggtgat 4021 tagtgaggac ctgaatgggt tgactttcct ccgccgaaca gtcacccgtg atccagcagg 4081 ttggtttgga aagttggacc aaaactccat cctcaggcag ttgtactgga caagaggacc 4141 caaccatgaa gaccccagtg agaccatgat accacacgca caaagacctg tgcagctcat 4201 ggcactacta ggagaatcct ccctacatgg accctcattt tacagcaagg ttagcaaatt 4261 agtcatatct gaacttaaag agggaggaat ggatttttat gtgcccagac aagagtcaat 4321 gttcagatgg atgaggttct cagatctaag cacatgggag ggcgatcgca atctggctcc 4381 cagttttgtg aatgaagatg gcgtcgaatg acgccgctcc atctaatgat ggtgctgctg 4441 gtctcgtacc agagggcaac aacgagaccc ttcccctaga accagttgcg ggcgcagcta 4501 tagccgcacc cgtcactggc caaaataaca taattgaccc ctggattaga acaaattttg 4561 tgcaagcacc aaatggagag ttcacagtgt cacccagaaa ctctcctgga gaaattttat 4621 taaacttaga gttgggccct gatttgaacc cttatttggc tcatttgtca aggatgtaca 4681 atgggtatgc tggtggagtg gaagttcagg ttctcctggc agggaacgcg ttcactgccg 4741 gaaagatcct cttcgccgcc gtcccgccaa atttcccagt ggaattctta agcccagccc 4801 agatcacaat gctcccacat ttaatagtag atgttaggac tcttgaacca attatgatcc 4861 cactccctga tgttaggaat acattcttcc attatagtaa ccagcctaac agccgcatga 4921 gattagtggc tatgctctat accccactca gatctaatgg ctcaggtgat gatgtcttta 4981 ctgtctcttg cagggttttg actaggccta ctcctgattt tgagttcact tatttagtgc 5041 caccttctgt tgaatctaaa actaagcctt tttccttacc tattttaacc ctttctgagc 5101 tcacaaattc gaggttccca gtccccatcg attcgctttt caccgcccag aataatgtgt 5161 tgcaggtgca gtgtcaaaat ggcaggtgta cacttgatgg tgagttacaa ggcacaaccc 5221 agttgctccc atctggcatc tgtgcattca gaggacgggt gacagcacaa attaaccaac 5281 gtgacaggtg gcacatgcaa ctgcaaaacc tcaatggtac aacatatgac ccaactgatg 5341 atgtgccagc cccgctgggt acacctgact tcaagggcgt cgtgtttggg atggtaagcc 5401 aaagaaatgt gggtaatgat gcgcctggct caaccagagc ccaacaggcg tgggtttcaa 5461 cctatagccc ccaatttgtc cccaaattag gttctgtcaa tcttagaatt agtgataatg 5521 atgatttcca attccagccg acaaaattca caccagtggg cgtcaatgat gacgatgatg 5581 gccacccgtt cagacaatgg gaactaccaa actattcagg ggagcttacc ttgaatatga 5641 atcttgcccc cccagttgct ccaaattttc ctggtgaaca attgttattc ttcagatctt 5701 tcgtgccatg ctcaggaggt tacaaccaag gtattataga ttgtcttatt ccccaagaat 5761 ggatccaaca cttctatcag gaatcagcac cctcccagtc agacgtggcc ctaatcaggt 5821 atgtcaaccc cgatacggga cgtacactgt ttgaagcaaa attgcacaga tctggttaca 5881 ttactgtggc tcactctgga gactatcctc ttgttgttcc ggctaatgga cactttagat 5941 ttgattcttg ggtaaatcag ttttactcac tcgccccaat gggaactggg aatgggcgaa 6001 ggagggctca gtaatggctg gggcttttat tgcaggattg gcaggcgaca tgctcacgtc 6061 atctgtgggc tcccttgtga acgcaggggc aaacgccatc aaccaaaaga tagactttga 6121 aaacaacaaa caactccagt ctgcttcctt tcagcatgat aaagagatgc tccaagcgca 6181 ggtgagggca accaagcagc tgcaatctga aatgatagcc ctaaaacagg gggttttggc 6241 cgcaggcggc ttttccccca ctgatgcagc aaggggatcc attggtgcac ccatgacaaa 6301 ggtgcttgac tggtctggca ctcgatactg ggcgcccaat tccacaaaga caactggcta 6361 ttcgggacaa ttcacctctt cacctgtgca tatgtctagc ccaaatgctc cacaatcaaa 6421 acctgcaaag cctaggtctc tagctccttc ctcctcttct agcagtgtct atagtatgta 6481 tactcaatct actcatttaa catctggct //