Typing tool
|
Complete norovirus genomes
MK789656 | GI.6 | ||
---|---|---|---|
GI.P11 |
ORF1: 1..5320 ORF2: 5304..6926 ORF3: 6926..7555LOCUS MK789656 7641 bp RNA linear VRL 01-NOV-2019 DEFINITION Norovirus GI isolate G19_021 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK789656 VERSION MK789656.1 KEYWORDS . SOURCE Norovirus GI ORGANISM Norovirus GI Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7641) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S.F. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7641) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (02-APR-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7641 /organism="Norovirus GI" /mol_type="genomic RNA" /isolate="G19_021" /isolation_source="sewage" /db_xref="taxon:122928" /country="France: Nantes" /collection_date="05-May-2014" /note="genotype: GI.6-GI.Pb" gene <1..5320 /gene="ORF1" CDS <1..5320 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="QCA41819.1" /translation="NNANNTSATSRFLSRFKGLGGGASPPNPIKIKSTEMALGLIGKA TQEAVGASDLPPKQQRDRPPRTQEEVQYGMGWTERPVDQNVKSWEELDTSTKEEILDS HKEWFDAGGLGPCTMPSTCEQAKDDSPPGEQVRWSARDGVNLGVNRLTTVSGPEWNLC PLPPIDLRNMEPASEPTIGDMIEFYEGHIYHYSIYIGQGKTVGVHSPQAAFSVARVTI QPIAAWWRVCYIPQPKHRLSYDQLRELENEPWPYAAITNNCFEFCCQVMNLEDTWLQR RLITSGRFHHPSQSWSQQTPEFQQDSKLELVRDAILAAVNGLVSQPFKNFLGKLKPLN VLNILSNCDWTFMGVVEMVILLLELFGVFWNPPDVSNFIASLLPDFHLQGPEDLARDL VPVILGGIGLAIGFTRDKVTKVMKSAVDGLRAATQLGQYGLEIFSLLKKYFFGGDQTE RTLKGIEAAVIDMEVLSSTSVTQLVRDKQAAKAYMNILDNEEEKARKLSAKNADPHVI SSTNALISRIAMARSALAKAQAEMTSRMRPVVIMMCGPPGIGKTKAAEHLAKRLANEI RPGGKVGLVPREAVDHWDGYHGEEVMLWDDYGMTKIQDDCNKLQAIADSAPLTLNCDR IENKGMQFVSDAIVITTNAPGPAPVDFVNLGPVCRRVDFLVYCSAPEVEQIRRVSPGD TSALKDCFKSDFSHLKMELAPQGGFDNQGNTPFGKGVMKPTTINRLLIQAVALTMERQ DEFRLQGKMYDFDDDRVSAFTTMARDNGLGILSMASLGKKLRGVTSMDGLKNALKGYK IGACTIKWQAKVYSLESDGNSVNIREEKNVLTQQQQSVCAASIALTRLRAARAVAYAS CIQSAITSILQIAASALVVNRAVKRMFGTRTAALSLEGPPKEHKCRVHQAKAAGKGPI GHDDMVDKYGLCETEEDEEVVHTEIPSATMEGKNKGKNKKGRGRKNNYNAFSRRGLND EEYEEYKKIREEKGGNYSIQEYLEDRQRYEEELAEVQAGGDGGIGETEMEIRHRVFYK SKSKKHHQEERRQLGLVTGSDIRKRKPIDWTPPKSAWADDEREVDYNERINFEAPPTL WSRVTKFGSGWGFWVSPTVFITTTHVIPTSAKEFFGEPLASIAIHRAGEFTLFRFSKK IRPDLTGMILEEGCPEGTVCSVLIKRDSGELLPLAVRMGAIASMRIQGRLVHGQSGML LTGANAKGMDLGTIPGDCGAPYVYKRANDWVVCGVHAAATKSGNTVVCAVQASEGETT LEGGNKGHYAGHEIIKHGCGPALSTKTKFWKSSPEPLPPGVYEPAYLGGRDPRVSGGP SLQQVLRDQLKPFAEPRGRMPEPGLLEAAVETVTSSLEQVMDTPVPWSYSDACQSLDK TTSSGFPHHKKKNDDWNGTAFIRELGEQAAHANNMYEQAKSMKPMYTAALKDELVKPE KVYQKVKKRLLWGADLGTVIRAARAFGPFCDAIKSHTIKLPIKVGMNSIEDGPLIYAE HSKYKYHFDADYTAWDSTQNRQIMTESFSIMCRLTASPELASVVAQDLLAPSEMDVGD YVIRVKEGLPSGFPCTSQVNSINHWLITLCALSEVTGLSPDVIQSMSYFSFYGDDEIV STDIEFDPAKLTQVLKEYGLKPTRPDKSEGPIIVRKSVDGLVFLRRTISRDAAGFQGR LDRASIERQIYWTRGPNHSDPFETLVPHQQRKIQLISLLGEASLHGEKFYRKISSKVI QEIKTGGLEMYVPGWQAMFRWMRFHDLGLWTGDRNLLPEFVNDDGV" mat_peptide <1..1150 /gene="ORF1" /product="p48" mat_peptide 1151..2239 /gene="ORF1" /product="NTPase" mat_peptide 2240..2836 /gene="ORF1" /product="p22" mat_peptide 2837..3250 /gene="ORF1" /product="VPg" mat_peptide 3251..3793 /gene="ORF1" /product="Pro" mat_peptide 3794..5317 /gene="ORF1" /product="RdRp" gene 5304..6926 /gene="ORF2" CDS 5304..6926 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCA41820.1" /translation="MMMASKDAPTSPDGASGAGQLVPEANTAEQISMDPVAGASTAVA TAGQVNMIDPWIFNNFVQAPQGEFTISPNNTPGDILFDLQLGPHLNPFLAHLSQMYNG WVGNMRVRILLAGNAFTAGKIIICCVPPGFDARILTIAQATLFPHLIADVRTLEPVEL PLEDVRNVLFHNSSQPQPTMRLVAMLYTPLRTGGGSGGTDAFVVAGRVLTCPAPDFSF LFLVPPSVEQKTRVFSVPNIPLKDLSNSRVPVPVQGMFMSPDVNQSVQFQNGRCQIDG QLQGTTPVSLSQLCKIRGKTSSNARVLNLSEVDGTPFIPLESPAPVGFPDLGGCDWHV NFTFQAQDQDPSQSVTFATNDASFVPYLGSISPHNGGDFHAGDIIGSLGWISAPSDNS QLNVWTIPKYGSSLPDVTHLAPAVFPPGFGEVILYFYSTFPGSGQSSQLQVPCLLPQE FITHFCNEQAPIAGEAALLHYVDPDTGRNLGEFKLYPDGFMTCVPNSVSSGPQTLPIN GVFVFVSWVSRFYQLKPVGTASAARRLGLRRI" gene 6926..7555 /gene="ORF3" CDS 6926..7555 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCA41821.1" /translation="MAQAVIGAIAASAAGSILGAGIQAGAEAGLQAQRYQQDLQLQQN SFKHDKEMLGYQVQASNALLAKNLNTRYALLQAGGLSSADAARAVAGAPVTRIVDWNG TRIAAPTSSTTTLRSGGFMAVPIPLSSKTKQPVMSGQDNPNYAASSISRTASWVQSQN SMRSVSPFHSDALRTVWVTPPGSTSTSSVQSSFYGVFNTDRLPLFANRR" ORIGIN 1 caacaacgct aacaacacta gtgctacatc tcgatttttg tcgagattta agggtttagg 61 tggtggcgcg agccccccta accccataaa gatcaaaagc acagaaatgg ccttgggttt 121 gattggtaag gcaacccaag aggcagtagg tgccagtgac ctaccgccta aacagcaaag 181 agaccggccc cccagaaccc aagaggaagt tcagtacggc atggggtgga ctgaaaggcc 241 cgtggaccag aatgttaagt catgggagga acttgacacc tccaccaagg aagagatttt 301 ggacagccac aaagaatggt tcgatgccgg cggtttgggc ccgtgcacta tgccctcaac 361 ttgtgaacag gctaaagatg atagcccacc tggtgagcaa gtcagatggt cagcgcgtga 421 tggagtcaac cttggagtga accgtctcac gacggtgagt ggccccgagt ggaacctctg 481 ccctctaccc cccattgacc taaggaatat ggaaccagct agtgaaccca ccattggaga 541 catgatagag ttttatgaag gccatatcta ccactactcc atatacattg ggcaaggaaa 601 gacagttggt gtgcattccc cacaggcagc attctcagtg gctagagtaa ccatccaacc 661 tatagctgct tggtggaggg tttgttatat accccagccc aaacatagac tgagctatga 721 ccaacttagg gaattggaaa atgaaccttg gccatatgca gctatcacca acaattgttt 781 tgagttttgc tgtcaagtta tgaaccttga ggacacatgg ttgcagaggc gactaataac 841 atcaggtaga ttccaccacc cttcccagtc ttggtcacaa cagacccctg aatttcagca 901 ggacagtaaa ctggaactgg ttagggatgc catattggcc gcggtgaatg gccttgtttc 961 acaacccttc aagaatttct tgggcaagct caagcctctt aatgtattaa acatcctatc 1021 taactgtgac tggactttta tgggggtggt agagatggtt atactacttc ttgagctctt 1081 tggtgtgttc tggaacccgc ccgatgtgtc taattttata gcatctctcc tccctgactt 1141 ccacctccag ggaccagaag acctggcccg ggatttggtg ccagtcattc ttggtggcat 1201 aggactagcc attggattta ccagagataa agtcaccaag gtcatgaaga gtgctgtgga 1261 tggactccga gccgctacac agctggggca atacgggttg gaaatattct cactcctcaa 1321 gaagtatttc tttggtgggg atcagactga acgaaccctc aaaggcattg aagcagcagt 1381 catagatatg gaggttttgt cctctacatc agtgacacaa ttggtgagag ataagcaggc 1441 agctaaagct tacatgaaca tcctggataa tgaagaggaa aaagctagaa aactctctgc 1501 taagaatgct gacccccatg taatatcctc aacaaatgcc ctaatatcac gtatagccat 1561 ggcgcgatcc gctctggcta aggctcaagc tgagatgacc agccgaatga ggccagttgt 1621 catcatgatg tgtggacctc ctggaattgg taagactaaa gcagcggaac acttggcaaa 1681 acgcctagct aatgagatca ggcctggcgg caaagtggga ctggtaccac gtgaggctgt 1741 tgaccactgg gatggctacc atggtgagga agtgatgcta tgggatgact atggtatgac 1801 aaagatacaa gatgactgca acaagctcca ggctattgct gactctgccc cactcactct 1861 caattgtgat aggattgaaa ataaagggat gcagtttgtg tcagatgcaa tagtcatcac 1921 caccaacgct ccagggcctg cccctgtgga ctttgtcaac cttggccctg tgtgtagacg 1981 agttgacttc ctggtttact gttccgcccc agaggtagag cagataagga gagtcagccc 2041 tggagacacg tcggcactta aagactgctt caagtcagat ttctcccacc tgaagatgga 2101 gttagctcct caaggggggt ttgacaacca ggggaacaca ccattcggca agggtgtcat 2161 gaaaccaaca accatcaaca ggctcctcat acaagctgtg gctctcacca tggagagaca 2221 ggatgagttc cgtctccaag ggaagatgta tgattttgat gatgacaggg tgtcagcctt 2281 caccaccatg gcacgcgata atggattggg tatcctgagc atggcgagcc taggtaagaa 2341 gctgcgcggt gtcacatcga tggatggtct gaagaatgct ttgaaagggt ataaaattgg 2401 tgcgtgcaca atcaagtggc aggccaaggt gtattcacta gagtcagatg gcaacagtgt 2461 taacatcagg gaggagaaga atgtcttaac tcaacaacag cagtcggtgt gcgctgcctc 2521 tattgcactc acccgtctgc gggccgcacg cgcggtggcg tatgcgtcat gcattcaatc 2581 agccataacc tccatactac aaattgctgc ctctgcccta gtggtcaaca gggccgtgaa 2641 aagaatgttt ggcacacgca ctgctgcttt gtctctagag ggccccccca aagaacacaa 2701 atgcagagtc caccaggcta aagccgcagg aaaagggccc attggccatg atgacatggt 2761 tgataaatat ggtctatgtg agactgagga ggacgaagaa gtagtccaca ctgagatacc 2821 atctgccacc atggaaggta agaacaaagg aaagaacaag aaagggcgcg gccgaaagaa 2881 caactacaat gccttttccc gtagagggct caatgacgaa gagtacgagg agtacaagaa 2941 aatacgggaa gagaagggtg gaaattacag cattcaggag tacctagagg atagacaaag 3001 gtatgaagag gagcttgctg aggttcaagc aggtggagat ggaggaatcg gtgaaactga 3061 aatggaaatc cgccatagag tgttctacaa gtctaagagc aagaagcacc accaggaaga 3121 acgacgccaa ctggggttag tcacaggttc tgacatccgg aaaagaaaac caattgactg 3181 gaccccccct aagtcagcat gggcagatga tgagcgtgaa gtggactaca atgagaggat 3241 caactttgag gcgcccccca ctttgtggag ccgggtcaca aagttcggat ctgggtgggg 3301 cttctgggtc agtcccacag tcttcataac cacaacgcac gttataccaa ccagtgcaaa 3361 agagttcttt ggtgaacccc ttgccagcat agccatccac agagccggag aattcaccct 3421 cttcaggttc tctaagaaaa tcaggcccga tctcacgggt atgattcttg aggaaggctg 3481 cccagagggt acggtgtgct cagtattaat aaaaagggac tccggtgagc tactaccact 3541 agctgtaaga atgggcgcaa tagcatcaat gcgcatacag ggtcgccttg tccatggcca 3601 gtctggtatg ttgctcactg gggcgaatgc taagggcatg gacctcggaa ctattccagg 3661 ggattgtgga gctccctatg tctataagag agcaaatgac tgggtggttt gtggtgtgca 3721 cgctgctgct actaagtcag gcaacacggt agtgtgcgcc gttcaggcta gtgaagggga 3781 aacaacactt gagggaggta acaaaggtca ctacgccgga catgagataa tcaagcatgg 3841 atgtgggcca gccctgtcaa ccaaaacaaa gttctggaaa tcgtcccctg aaccactacc 3901 ccctggagtc tatgaacctg catacctcgg tggccgggat ccaagagtga gtggtggccc 3961 ctcgctccaa caggtgttac gggatcagtt aaagccattt gctgagccac gggggcgtat 4021 gccggaacca ggtctcctgg aggccgcagt tgagactgtg acctcatcac tggagcaggt 4081 tatggatacc ccagtgccgt ggagttacag cgatgcatgc caatccctag ataaaactac 4141 tagttctggt ttcccccatc acaagaagaa gaatgatgat tggaacggca ccgccttcat 4201 tagagagtta ggagagcagg cagcgcacgc caataatatg tacgaacaag ctaagagcat 4261 gaagcccatg tacacggcgg cgcttaagga tgaattagtg aaaccagaaa aagtgtacca 4321 aaaagtgaag aagcgcttgc tttggggggc agatctagga acagtgatcc gggccgcacg 4381 ggcctttggc ccattctgtg atgctataaa gtcccacaca attaaactac ctatcaaagt 4441 tggaatgaat tcaatcgagg atgggccatt aatttatgca gagcactcaa aatataaata 4501 tcactttgat gcagactata cggcttggga ctcaacacaa aataggcaaa ttatgactga 4561 atcattctca atcatgtgtc ggctaactgc ttctccagaa ttggcctcag tggtggcaca 4621 agatctgctt gcaccctcag aaatggacgt tggtgactac gtcataagag tgaaggaagg 4681 cctcccatct ggctttccat gcacatcaca ggtcaatagc ataaaccatt ggttgataac 4741 tttgtgtgcc ctctctgagg tgactggcct gtcaccagat gttatccagt ccatgtcata 4801 cttttctttc tatggtgatg atgaaatagt ttccactgac atagaatttg acccagcaaa 4861 gctgacacaa gtccttaaag agtatggcct caaacccacc cgccctgaca agagtgaagg 4921 tccaataatt gtgaggaaga gtgtggatgg cctggtcttc ttacgtcgca ccatttcccg 4981 cgacgccgcg gggttccagg ggcgactgga ccgagcatcc attgaaaggc agatctactg 5041 gaccagaggg cccaatcact cagacccctt tgaaaccttg gtgccccacc agcaaagaaa 5101 aatccagctg atatcactgt taggtgaagc ctcattgcat ggtgaaaaat tctacaggaa 5161 gatctcaagc aaagtcatcc aagagattaa gacagggggt cttgaaatgt atgtgccagg 5221 gtggcaagcc atgttccgct ggatgcggtt ccacgacctt ggcctgtgga caggagatcg 5281 caatctcctg cccgaattcg taaatgatga tggcgtctaa ggacgcccca acatcccctg 5341 atggcgccag tggcgccggc cagctggtac cggaggctaa tacagctgag caaatttcaa 5401 tggaccctgt cgcgggtgct tcaacagcag tcgcgacagc tgggcaagtt aatatgattg 5461 acccatggat tttcaataac tttgtccagg cacctcaagg agaattcact atttccccta 5521 ataatacccc cggtgacatt ttgtttgatt tacaactagg accccacctt aatccatttc 5581 tagcccatct ctcacagatg tacaatggtt gggtcggcaa tatgcgtgtg cgcatactgt 5641 tggccgggaa tgccttcaca gctggaaaga taatcatttg ctgtgtcccc cctggttttg 5701 atgctagaat actcacaata gctcaagcaa ctctctttcc acacttaatt gccgatgtaa 5761 ggacccttga gcctgtggaa cttcccttgg aggatgtgcg caatgttctc ttccacaaca 5821 gtagccaacc gcagccaaca atgcggctgg ttgctatgtt gtacactccc ctccgcactg 5881 gtggtggttc cggaggcact gatgcctttg tggtggcggg tagggtactt acgtgccccg 5941 cccccgactt tagttttctg tttctggtcc ccccttccgt tgaacaaaag accagagttt 6001 tcagtgtccc caatataccc ctgaaggatc tctcaaattc ccgtgtccct gtgcctgtac 6061 agggcatgtt tatgtctcca gatgttaatc agtcagtcca gtttcagaat ggacgctgcc 6121 aaattgatgg ccaactccaa ggcaccactc cagtctcgct cagccaactc tgcaagatta 6181 ggggtaaaac ttctagtaat gctagggtgc tcaacttaag tgaggtagat ggcacaccct 6241 tcatcccact tgaatcacca gcgccagttg gttttcccga cttgggaggt tgtgactggc 6301 atgtaaattt cactttccag gctcaagatc aggatccatc tcaaagtgta acctttgcaa 6361 ctaatgatgc cagctttgtc ccctatctag gtagtatctc tcctcacaat gggggagatt 6421 ttcatgcagg tgacatcata ggtagccttg gttggatttc agccccgtct gacaattcac 6481 aactcaatgt ttggacaata ccaaagtatg gatctagtct cccagatgtc actcatcttg 6541 cccctgctgt gttcccccca ggctttggtg aggtgatctt gtacttctac tctaccttcc 6601 caggttctgg acaatccagc caacttcaag tcccatgttt gttgcctcag gagttcatta 6661 cccacttctg taacgaacag gctcccatcg ctggggaggc tgccctcctc cactacgtgg 6721 accctgatac ggggcgaaac ttgggagaat ttaaacttta ccctgatggg tttatgacct 6781 gtgtccccaa tagtgttagt agtggccctc aaacccttcc tatcaatgga gtctttgttt 6841 ttgtttcatg ggtgtctaga ttctatcaac tcaagcctgt gggaacggcc tcagcggcta 6901 gaaggcttgg attgcggcgc atataatggc ccaagctgtc atcggtgcca tagccgcatc 6961 tgccgctggt agtatactag gggcaggcat acaggctggt gctgaggctg gtctccaggc 7021 tcaacggtat cagcaggatc tgcagttgca acaaaattcc ttcaagcatg ataaggaaat 7081 gttaggctac caagttcagg ctagtaatgc tcttttagct aagaatctta acactagata 7141 tgcccttctg caggcagggg gcctatctag tgctgatgct gctcgggcag tggctggtgc 7201 tcctgtcacc cgtatagtgg actggaatgg cacgcgcatt gcagcgccca cttcaagcac 7261 cactacactc agatctggtg gttttatggc tgtccctata ccattgtctt caaagaccaa 7321 gcaaccagtg atgtctgggc aggataaccc aaattatgct gcttcttcta tctctagaac 7381 tgcttcatgg gtgcaatctc aaaattctat gaggtctgtt tctccttttc acagtgatgc 7441 tttgagaacc gtttgggtta caccacccgg ttcaacatca acttcatctg tgcaatctag 7501 tttctatggt gtttttaata cagatagatt gcctctgttc gcaaacagaa ggtaataaaa 7561 ttttgtaata ggatgccagt gggcaccata ttcagatttg attttaattg gattgtttag 7621 atcggaatag cgtcgtttgt t //