Typing tool
|
Complete norovirus genomes
MK907800 | GII.17 | ||
---|---|---|---|
GII.P17 |
ORF1: 1..5100 ORF2: 5081..6703 ORF3: 6703..7482LOCUS MK907800 7528 bp RNA linear VRL 16-APR-2020 DEFINITION Norovirus GII isolate G19_036 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK907800 VERSION MK907800.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7528) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7528) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7528 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_036" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="20-Oct-2016" /note="genotype: GII.17-GII.P17" gene <1..5100 /gene="ORF1" CDS <1..5100 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QCO93097.1" /translation="ASNDASAAVAGKNNNNDKEKSSSDSLFANMSVTFKKALGARSKQ PPPGETKQIQTPPRPPTPELVKRIPPPPPNGEDEPGIVYKVGEGVSGLPDLTTVVQPD AQNTAYSVPPLSQREVGEAKEPLPGSILEMWDGEIYHYGLYVEQGHVLGVHKPPAAIS LAKIEITPLSLYWRVVYTPQYLIDPGTLKNLSGETFPYTAFDNNCYAFCCWVLDLNDS WLSRRMIQRTTGFFKPYQNWNRKPLPTMDEPKIKKAANAVLCALSSLFTRPIKDIIGK LRPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYEMQGPED LAVELVPVVMGGIGLVLGFTKEKIGKMLSSAATTLRACKDLGSYGLEILKLVMKWFFP KKEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST KSASPDIVGTINALLARIAAARSLVHKAKEELSSRQRPVVVMISGRPGIGKTHLAREL AKKVASTLSGDQRIGLVPRNGVDHWDAYKGERVVLWDDYGMSNPIQDALRLQELADTC PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPDIEK AKRDFPGQPDMWKDHFRPDFSHIKLQLAPQGGFDKNGNTPHGKGVVKSLTIGSLIARA SGLLHERMDEFELQGSDLPTFNFDRNKVAAFRQLAAENKYGLMDTLRVGNQLKSVKTL DELKQAIKNISIKKCQIVYNGCTYTMESDGRGKVVVEKVQNATVQTNNELVGALHHLR SARIRYYVKCFQEAIYSLLQIAGAAFVTSRIVRRMNISNLWSKPPIEEGDEPEDRGGC PKPRDEDDLTIDSRDIKVEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG KYSIEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLGLV TGSEIRKRNPDDFKPKGKLWADDNRSVDYNERIDFEAPPSVWSRIVNFGTGWGFWVSP SLFITSTHVIPKGITEAFGVPINQIQIHKSGEFCRLRFPKPIRPDVSGMILEEGAPEG TVVSILIKRTTGELMPLAVRMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD CGCPYIYKRGNDLVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILG PGNAPKLSTKTKFWRSSNAPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEP RGKPPNPNVLESAKKTIINVLEQTIDPPQKWSYAQACASLDKTTSSGYPHHVRKNDYW SGESFTGKLADQASKANLMYEEGKHMQPVYTAALKDELVKTDKIYGKIKKRLLWGSDL STMIRCARAFGGLMDEFKANCITLPIRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD STQQRAVLEAALEIMVRFSAEPQLAQIVAEDLLSPSVVDVGDFKIAINEGLPSGVPCT SQWNSIAHWLLTLCALSEVTGLGPDIIQANSMYSFYGDDEIVSTDIKLDPEKLTAKLK EYGLKPTRPDKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLDQNSILRQLYWTRGP NHEDPSETMIPHAQRPVQLMALLGESSLHGPSFYSKVSKLVISELKEGGMDFYVPRQE SMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..990 /gene="ORF1" /product="p48" mat_peptide 991..2088 /gene="ORF1" /product="NTPase" mat_peptide 2089..2625 /gene="ORF1" /product="p22" mat_peptide 2626..3024 /gene="ORF1" /product="VPg" mat_peptide 3025..3567 /gene="ORF1" /product="Pro" mat_peptide 3568..5097 /gene="ORF1" /product="RdRp" gene 5081..6703 /gene="ORF2" CDS 5081..6703 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93098.1" /translation="MKMASNDAAPSNDGAAGLVPEGNNETLPLEPVAGAAIAAPVTGQ NNIIDPWIRTNFVQAPNGEFTVSPRNSPGEILLNLELGPDLNPYLAHLSRMYNGYAGG VEVQVLLAGNAFTAGKILFAAVPPNFPVEFLSPAQITMLPHLIVDVRTLEPIMIPLPD VRNTFFHYSNQPNSRMRLVAMLYTPLRSNGSGDDVFTVSCRVLTRPTPDFEFTYLVPP SVESKTKPFSLPILTLSELTNSRFPVPIDSLFTAQNNVLQVQCQNGRCTLDGELQGTT QLLPTGICAFRGRVTAQINQRDRWHMQLQNLNGTTYDPTDDVPAPLGTPDFKGVVFGM VSQRNVGNDAPGSTRAQQAWVSTYSPQFVPKLGSVNLRISDNDDFQFQPTKFTPVGVN DDDDGHPFRQWELPNYSGELTLNMNLAPPVAPNFPGEQLLFFRSFVPCSGGYNQGIID CLIPQEWIQHFYQESAPSQSDVALIRYVNPDTGRTLFEAKLHRSGYITVAHSGDYPLV VPANGHFRFDSWVNQFYSLAPMGTGNGRRRAQ" gene 6703..7482 /gene="ORF3" CDS 6703..7482 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93099.1" /translation="MAGAFIAGLAGDMLTSSVGSLVNAGANAINQKIDFENNKQLQSA SFQHDKEMLQAQVRATKQLQSEMIALKQGVLAAGGFSPTDAARGSIGAPMTKVLDWSG TRYWAPNSTKTTGYSGQFTSSPVHMSSPNAPQSKPAKPRSLAPSSSSSSVYSMYTQST HLTSGSSSNASSASTKLTNLSSGSSQNRTAEWVNQQRSLSPFMSGALNISHATPPSSR ASSSGTVSTVPKEVLDSWTSAFNTHRQPLFAHLRVRGESRV" ORIGIN 1 gcgtctaacg acgcttccgc tgccgttgct ggcaaaaaca acaacaacga caaggaaaaa 61 tcttcaagtg acagcttgtt tgctaacatg tctgtcactt ttaagaaagc cctcggggcg 121 cggtctaaac aaccgccccc gggagagaca aaacaaatac aaacaccacc aaggccaccg 181 acaccggaat tggtgaagag aatacctcca cccccaccca atggcgagga tgaaccaggg 241 atagtgtaca aggtgggaga gggtgtgtct gggctgcccg acctaacaac tgtggtacaa 301 cctgacgcac aaaacacagc ttatagtgtt cccccactta gccagagaga agtcggcgaa 361 gctaaagaac cgctgcctgg ctccattctg gaaatgtggg atggtgagat ctaccactat 421 gggctgtatg ttgagcaggg gcatgtgctc ggagtacaca aaccacctgc tgcaataagc 481 cttgccaaaa ttgaaataac accgctgtct ctctactgga gagtggtcta cactccccag 541 tatttgatag atccaggaac tcttaagaac ttgagtggag agactttccc atacacggcc 601 tttgacaaca actgttatgc cttctgttgt tgggtcctag acctcaacga ctcctggctt 661 agcaggagaa tgatccaaag aacaactgga ttcttcaaac cttatcagaa ttggaacaga 721 aaacccttac caaccatgga tgaaccgaaa atcaagaagg ccgcgaatgc tgtcctatgc 781 gctctctcct cactcttcac tagacccatt aaggacatca ttggaaagct tagaccactg 841 aacattctta atatactggc aacctgcgat tggacttttg caggcatagt ggaatccttg 901 atccttcttg ctgaactctt tggggtgttc tggacacccc cagatgtgtc tgcgatgatt 961 gctcccttac tcggtgacta cgagatgcaa ggccccgaag acctggccgt ggaacttgtg 1021 cccgtggtga tgggaggaat aggtttggtg ttaggattca ccaaagaaaa gatcggcaag 1081 atgctttcat cagctgcgac cacacttagg gcctgcaagg atctaggatc ttatggattg 1141 gagatactca aattggtcat gaagtggttc ttccccaaga aggaggaggc taacgaacta 1201 gccatggtga gggccattga ggatgcagtc ctggatttgg aggcaataga aaacaatcac 1261 atgaccaccc ttctcaagga taaagacagc ctggccacat acatgaggac tcttgacttg 1321 gaagaagaga aagcaaggaa actgtcaacc aaatcggcgt cacctgacat cgtgggtaca 1381 ataaacgcat tattggctag gatagcagcc gccaggtcac tggtccacaa ggctaaggag 1441 gagctctcga gcagacagag acctgttgtt gtaatgatat caggtagacc aggtatagga 1501 aagacccatc tggccagaga gctagccaag aaagttgcgt caacactgtc aggcgaccaa 1561 aggattggac tggtacctag aaatggcgtc gaccattggg acgcctacaa gggagaaagg 1621 gtggtcctgt gggatgatta tggtatgagc aaccctatac aggacgcact gaggctccaa 1681 gaattggctg acacctgtcc tctgacccta aattgtgata ggattgaaaa caaaggaaaa 1741 gtgttcgaca gtgatgccat aattataaca accaaccttg caaacccagc accactagac 1801 tatgtcaact ttgaggcctg ctctagacgc atagacttct tggtgtatgc agacgcacct 1861 gacattgaga aggctaagcg tgacttccct ggccaaccag atatgtggaa agaccatttc 1921 agaccagact tctcacacat taaacttcag ctggcaccac agggaggttt tgacaaaaat 1981 ggcaacaccc cacatggcaa aggtgtggtg aagtccctga cgattggttc tctgatcgcc 2041 agggcctccg gcttgctcca cgagagaatg gatgaattcg agctccaagg ctctgacctg 2101 ccaaccttca attttgaccg caacaaagtc gccgccttca gacaattagc agctgaaaac 2161 aaatatggtc ttatggacac attgagggtc ggcaaccaac tgaaaagtgt taagaccctg 2221 gatgagctga agcaggccat aaagaacatc agtatcaaga agtgtcagat agtgtataat 2281 ggatgcacct ataccatgga atctgatggg aggggcaagg ttgtggttga aaaggtacaa 2341 aacgcaacag ttcaaaccaa caatgaactt gtaggggcgc tccaccacct gagatctgca 2401 aggataagat actatgttaa atgtttccaa gaagccattt actctctact ccaaattgct 2461 ggtgcagcct tcgtcacctc acgcatcgtg aggcgcatga acatatcaaa cctctggtcg 2521 aagccaccca tcgaggaggg tgatgagcct gaagacagag gaggatgccc caagcccagg 2581 gatgaagatg acctcactat tgactctaga gacattaaag tggaagggaa gaagggcaag 2641 aacaagtctg gccggggcaa gaaacacaca gccttctctt ccaagggcct cagtgatgaa 2701 gagtacgatg agtacaagag aatcagagag gagaggaatg gcaaatactc aatagaggaa 2761 tacctccaag atagggacag atactatgaa gagcttgcca ttgctaaggc cactgaggag 2821 gacttctgtg aggaggagga gatcaaaatc cgccagagaa tcttccgacc caccaggaag 2881 cagcggaaag aggaaagggc cacgcttggg cttgtcacag gatcggaaat cagaaagagg 2941 aatccagatg actttaaacc aaaaggaaaa ttgtgggctg atgacaacag gagcgtggat 3001 tacaatgaga ggatagattt tgaggctccc ccgagtgttt ggtctaggat agtcaacttt 3061 ggcacaggct ggggattctg ggtttctcca agtctcttta taacttcaac acacgtcata 3121 ccaaaaggaa tcactgaggc ctttggagtg cccataaacc aaatccaaat tcacaaatct 3181 ggagaatttt gccgcttgcg gtttccaaag ccaattagac cagacgtgag tgggatgatc 3241 ttggaagagg gcgcccctga aggcactgtt gtgtctattc tcatcaaaag gacaacaggg 3301 gagctgatgc ctcttgcagt cagaatgggg actcatgcca caatgaaaat ccaaggtaga 3361 acggtgggtg gtcagatggg tatgcttctc actggctcaa atgctaaaag catggatcta 3421 ggcacaacgc caggtgactg cggatgtcca tacatataca aaaggggcaa tgaccttgtg 3481 gtcattgggg tgcacactgc agcagctcgt ggaggcaaca cggtcatatg tgccacacag 3541 ggaagtgaag gtgaagccac tcttgaagga ggtgacaaca agggaaccta ttgtggagcc 3601 cccattttgg gacctggcaa tgcaccaaaa ttgagcacaa agacaaaatt ctggagatcc 3661 tctaacgcac cgcttccccc aggtacttat gaaccagcat accttggtgg gagagacccg 3721 cgcgtgaagg gaggcccatc tctgcaacaa gttatgagag accaacttaa accttttaca 3781 gagcccagag ggaagccacc aaacccaaat gtcttagagt cagcaaagaa gactatcatc 3841 aatgttctgg aacaaaccat tgacccccct caaaagtggt catatgccca agcttgtgca 3901 tccctcgaca agacaacctc cagtggatac ccgcaccacg tccggaagaa tgactattgg 3961 agtggtgaat ccttcacagg aaaacttgca gatcaggctt caaaagcaaa tctcatgtat 4021 gaggaaggca agcacatgca accagtttac accgcagcac tcaaagacga gctcgtgaaa 4081 actgacaaaa tttatggcaa aatcaagaaa agactcctgt ggggttctga cctctccaca 4141 atgattcgat gcgccagagc atttggtggt ctcatggatg aattcaaagc aaattgtatt 4201 acactcccta tcagagttgg tatgaatatg aatgaagatg gtcccataat atttgagaaa 4261 cactccaggt acaggtacca ctatgatgct gattactccc gctgggactc cacacaacag 4321 cgggcagtgc tggaagcggc acttgaaatc atggtgagat tttctgctga gccacagctg 4381 gcacaaatag tggcagagga cctgctgtca ccaagtgtgg ttgatgtggg cgatttcaaa 4441 atcgctatca atgaaggcct accatctggc gtgccttgca cctcacaatg gaattctatt 4501 gcccactggt tacttacctt gtgtgccctt tctgaagtga caggattagg tcctgacatc 4561 atacaagcta actccatgta ctctttctat ggtgatgatg agattgtgag cacagacata 4621 aaattggacc cagagaaatt gaccgcaaag ctcaaagaat atggccttaa acccactcgg 4681 cccgacaaaa ctgaggggcc gttggtgatt agtgaggacc tgaatgggtt gactttcctc 4741 cgccgaacag tcacccgtga tccagcaggt tggtttggaa agttggacca aaactccatc 4801 ctcaggcagt tgtactggac aagaggaccc aaccatgaag accccagtga gaccatgata 4861 ccacacgcac aaagacctgt gcagctcatg gcactactag gagaatcctc cctacatgga 4921 ccctcatttt acagcaaggt tagcaaatta gtcatatctg aacttaaaga gggaggaatg 4981 gatttttatg tgcccagaca agagtcaatg ttcagatgga tgagattctc agatctaagc 5041 acatgggagg gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgaatga 5101 cgccgctcca tctaatgatg gtgctgctgg tctcgtacca gagggcaaca acgagaccct 5161 tcccctagaa ccagttgcgg gcgcagctat agccgcaccc gtcactggcc aaaataatat 5221 aattgacccc tggattagaa caaattttgt gcaagcacca aatggagagt tcacagtgtc 5281 acccagaaac tctcctggag aaattttatt aaatttagag ttgggccctg atttgaaccc 5341 ttatttggct catttgtcaa ggatgtacaa tgggtatgct ggtggagtgg aagttcaggt 5401 tctcctggca gggaacgcgt tcactgccgg aaagatcctc ttcgccgccg tcccgccaaa 5461 tttcccagtg gaattcttaa gcccagccca gatcacaatg ctcccacatt taatagtaga 5521 tgttaggact cttgaaccaa ttatgatccc actccctgat gttaggaata cattcttcca 5581 ttatagtaac cagcctaaca gccgcatgag attagtggct atgctctata ccccactcag 5641 atctaatggc tcaggtgatg atgtctttac tgtctcttgc agggttttga ctaggcctac 5701 tcctgatttt gagttcactt atttagtgcc accttctgtt gaatctaaaa ctaagccttt 5761 ttccttacct attttaaccc tttctgagct cacaaattcg aggtttccag tccccatcga 5821 ttcgcttttc accgcccaga ataatgtgtt gcaggtgcag tgtcaaaatg gcaggtgtac 5881 acttgatggt gagttacaag gcacaaccca gttgctccca actggcatct gtgcattcag 5941 aggacgggtg acagcacaaa ttaaccaacg tgacaggtgg cacatgcaac tgcagaacct 6001 caatggtaca acatatgacc caactgatga tgtgccagcc ccgctgggta cacctgactt 6061 caagggcgtc gtgtttggga tggtaagcca aagaaatgtg ggtaatgatg cgcctggctc 6121 aaccagagcc caacaggcgt gggtttcaac ctatagcccc caatttgtcc ccaaattagg 6181 ttctgtcaat cttagaatta gtgataatga tgatttccaa ttccagccga caaaattcac 6241 accagtgggc gtcaatgatg acgatgatgg ccacccgttc agacaatggg aactaccaaa 6301 ctattcaggg gaactcacct tgaatatgaa tcttgccccc ccagttgctc caaattttcc 6361 tggtgaacaa ttgttattct tcagatcttt cgtgccatgc tcaggaggtt acaaccaagg 6421 tattatagat tgtcttattc cccaagaatg gatccaacac ttctatcagg aatcagcacc 6481 ctcccagtca gacgtggccc taatcaggta tgtcaacccc gatacgggac gtacactgtt 6541 tgaagcaaaa ttgcacagat ctggttacat tactgtggct cactctggag actatcctct 6601 tgttgttccg gctaatggac actttagatt tgattcttgg gtaaatcagt tttactcact 6661 cgccccaatg ggaactggga atgggcgaag gagggctcag tgatggctgg ggctttcatt 6721 gcaggattgg caggcgacat gctcacgtca tctgtgggct cccttgtgaa cgcaggggca 6781 aacgccatca accaaaagat agactttgaa aacaacaaac aactccagtc tgcttccttt 6841 cagcatgata aagagatgct ccaagcgcag gtgagggcaa ccaagcagct gcaatctgaa 6901 atgatagccc taaaacaggg ggttttggcc gcaggcggct tttcccccac tgatgcagca 6961 aggggatcca ttggtgcacc catgacaaag gtgcttgact ggtctggcac tcgatactgg 7021 gcgcccaatt ccacaaagac aactggctat tcgggacaat tcacctcttc acctgtgcat 7081 atgtctagcc caaatgctcc acaatcaaaa cctgcaaagc ctaggtctct agctccttcc 7141 tcttcttcta gcagtgtcta tagtatgtat actcaatcta ctcatttaac atctggctct 7201 tctagtaatg cttcttctgc ctccacaaaa ttgacaaatt taagctctgg ctcctctcaa 7261 aacagaacag cagagtgggt aaatcaacag agaagtctta gccctttcat gagtggcgca 7321 cttaacatct cacatgccac gccaccctca agtagggctt ccagttctgg gacggtctcg 7381 accgtgccca aggaagtttt ggactcctgg acgtctgcgt tcaacacaca cagacagccg 7441 ctcttcgcac acctcagagt gaggggggag tcacgtgttt agtgaaaaga aataattggc 7501 tataatgtga tttctttcta aatttggc //