Typing tool
|
Complete norovirus genomes
MK789655 | GI.6 | ||
---|---|---|---|
GI.P11 |
ORF1: 1..5347 ORF2: 5331..6953 ORF3: 6953..7582LOCUS MK789655 7679 bp RNA linear VRL 01-NOV-2019 DEFINITION Norovirus GI isolate G19_018 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK789655 VERSION MK789655.1 KEYWORDS . SOURCE Norovirus GI ORGANISM Norovirus GI Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7679) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S.F. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7679) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (02-APR-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7679 /organism="Norovirus GI" /mol_type="genomic RNA" /isolate="G19_018" /isolation_source="sewage" /db_xref="taxon:122928" /country="France: Nantes" /collection_date="30-Mar-2014" /note="genotype: GI.6-GI.Pb" gene <1..5347 /gene="ORF1" CDS <1..5347 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="QCA41816.1" /translation="VVATNVASNNNANNTSATSRFLSRFKGLGGGASPPNPIKIKSTE MALGLIGKTTQEAVGASDLPPKQQRDRPPRTQEEVQYGMGWTERPVDQNVKSWEELDT STKEEILDSHKEWFDAGGLGPCTMPSTCEQAKDDSPPGEQVRWSARDGVNLGVNRLTT VSGPEWNLCPLPPIDLRNMEPASEPTIGDMIEFYEGHIYHYSIYIGQGKTVGVHSPQA AFSVARVTIQPIAAWWRVCYIPQPKHRLSYDQLRELENEPWPYAAITNNCFEFCCQVM NLEDTWLQRRLITSGRFHHPSQSWSQQTPEFQQDSKLELVRDAILAAVNGLVSQPFKN FLGKLKPLNVLNILSNCDWTFMGVVEMVILLLELFGVFWNPPDVSNFIASLLPDFHLQ GPEDLARDLVPVILGGIGLAIGFTRDKVTKVMKSAVDGLRAATQLGQYGLEIFSLLKK YFFGGDQTERTLKGIEAAVIDMEVLSSTSVTQLVRDKQAAKAYMNILDNEEEKARKLS AKNADPHVISSTNALISRIAMARSALAKAQAEMTSRMRPVVIMMCGPPGIGKTKAAEH LAKRLANEIRPGGKVGLVPREAVDHWDGYHGEEVMLWDDYGMTKIQDDCNKLQAIADS APLTLNCDRIENKGMQFVSDAIVITTNAPGPAPVDFVNLGPVCRRVDFLVYCSAPEVE QIRRVSPGDTSALKDCFKSDFSHLKMELAPQGGFDNQGNTPFGKGVMKPTTINRLLIQ AVALTMERQDEFRLQGKMYDFDDDRVSAFTTMARDNGLGILSMASLGKKLRGVTSMEG LKNALKGYKIGACTIKWQAKVYSLESDGNSVNIREEKNVLTQQQQSVCAASIALTRLR AARAVAYASCIQSAITSILQIAASALVVNRAVKRMFGTRTAALSLEGPPKEHKCRVHQ AKAAGKGPIGHDDMVDKYGLCETEEDEEVVHTEIPSATMEGKNKGKNKKGRGRKNNYN AFSRRGLNDEEYEEYKKIREEKGGNYSIQEYLEDRQRYEEELAEVQAGGDGGIGETEM EIRHRVFYKSKSKKHHQEERRQLGLVTGSDIRKRKPIDWTPPKSAWADDEREVDYNER INFEAPPTLWSRVTKFGSGWGFWVSPTVFITTTHVIPTSAKEFFGEPLASIAIHRAGE FTLFRFAKKIRPDLTGMILEEGCPEGTVCSVLIKRDSGELLPLAVRMGAIASMRIQGR LVHGQSGMLLTGANAKGMDLGTIPGDCGAPYVYKRANDWVVCGVHAAATKSGNTVVCA VQASEGETTLEGGDKGHYAGHEIIKHGCGPALSTKTKFWKSSPEPLPPGVYEPAYLGG RDPRVSGGPSLQQVLRDQLKPFAEPRGRMPEPGLLEAAVETVTSSLEQVMDTPVPWSY SDACQSLDKTTSSGFPHHKKKNDDWNGTAFIRELGEQAAHANNMYEQAKSMKPMYTAA LKDELVKPEKVYQKVKKRLLWGADLGTVIRAARAFGPFCDAIKSHTIKLPIKVGMNSI EDGPLIYAEHSKYKYHFDADYTAWDSTQNRQIMTESFSIMCRLTASPELASVVAQDLL APSEMDVGDYVIRVKEGLPSGFPCTSQVNSINHWLITLCALSEVTGLSPDVIQSMSYF SFYGDDEIVSTDIEFDPAKLTQVLKEYGLKPTRPDKSEGPIIVRKSVDGLVFLRRTIS RDAAGFQGRLDRASIERQIYWTRGPNHSDPFETLVPHQQRKIQLISLLGEASLHGEKF YRKISSKVIQEIKTGGLEMYVPGWQAMFRWMRFHDLGLWTGDRNLLPEFVNDDGV" mat_peptide <1..1177 /gene="ORF1" /product="p48" mat_peptide 1178..2266 /gene="ORF1" /product="NTPase" mat_peptide 2267..2863 /gene="ORF1" /product="p22" mat_peptide 2864..3277 /gene="ORF1" /product="VPg" mat_peptide 3278..3820 /gene="ORF1" /product="Pro" mat_peptide 3821..5344 /gene="ORF1" /product="RdRp" gene 5331..6953 /gene="ORF2" CDS 5331..6953 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCA41818.1" /translation="MMMASKDAPTSPDGASGAGQLVPEANTAEQISMDPVAGASTAVA TAGQVNMIDPWIFNNFVQAPQGEFTISPNNTPGDILFDLQLGPHLNPFLAHLSQMYNG WVGNMRVRILLAGNAFTAGKIIICCVPPGFDARILTIAQATLFPHLIADVRTLEPVEL PLEDVRNVLFHNSSQPQPTMRLVAMLYTPLRTGGGSGGTDAFVVAGRVLTCPAPDFSF LFLVPPSVEQKTRVFSVPNIPLKDLSNSRVPVPVQGMFMSPDVNQSVQFQNGRCQIDG QLQGTTPVSLSQLCKIRGKTSSNARVLNLSEVDGTPFIPLESPAPVGFPDLGGCDWHV NFTFQAQNQDPSQSVTFATNDASFVPYLGSISPHNGGDFYAGDIIGSLGWISAPSDNS QLNVWTIPKYGSSLPDVTHLAPAVFPPGFGEVILYFYSTFPGSGQSSQLQVPCLLPQE FITHFCNEQAPIAGEAALLHYVDPDTGRNLGEFKLYPDGFMTCVPNSVSSGPQTLPIN GVFVFVSWVSRFYQLKPVGTASAARRLGLRRI" gene 6953..7582 /gene="ORF3" CDS 6953..7582 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCA41817.1" /translation="MAQAVIGAIAASAAGSILGAGIQAGAEAGLQAQRYQQDLQLQQN SFKHDKEMLGYQVQASNALLAKNLNTRYALLQAGGLSSADAARAVAGAPVTRIVDWNG TRIAAPTSSTTTLRSGGFMAVPIPLSSKTKQPVMSGQDNPNYAASSISRTASWVQSQN SMRSVSPFHSDALRTVWVTPPGSSSTSSVQSSFYGVFNTDRLPLFANRR" ORIGIN 1 cgtcgttgcg actaatgttg caagcaataa caacgctaac aacactagtg ctacatctcg 61 atttttgtcg agatttaagg gtttaggtgg tggcgcgagc ccccctaacc ccataaagat 121 caaaagcaca gaaatggcct tgggtttgat tggcaagaca acccaagagg cagtaggggc 181 cagtgaccta ccgcctaaac agcaaagaga ccgacccccc agaacccaag aggaagttca 241 gtacggcatg ggatggactg aaaggcccgt ggaccagaat gttaagtcat gggaggaact 301 tgacacctcc accaaggaag agattttgga cagccacaaa gaatggttcg atgccggcgg 361 tttgggccca tgcactatgc cctcaacttg tgaacaggct aaagatgata gcccacctgg 421 tgagcaagtc agatggtcag cgcgtgatgg agtcaacctt ggagtgaacc gtctcacgac 481 ggtgagtggc cctgagtgga acctctgccc tctacccccc attgacctaa ggaatatgga 541 accagctagt gaacccacca ttggagacat gatagagttt tatgaaggtc acatctacca 601 ctactccata tatattgggc aaggaaaaac agttggtgtg cattccccac aggcagcatt 661 ctcagtggct agagtaacca tccaacctat agctgcttgg tggagggttt gttatatacc 721 ccagcccaaa catagactga gttatgacca gcttagggaa ttggaaaatg aaccttggcc 781 atacgcagct atcaccaaca attgttttga gttttgctgt caagttatga accttgagga 841 cacatggttg cagaggcggc taataacatc aggtagattc caccaccctt cccagtcttg 901 gtcacaacag acccctgaat ttcagcagga cagtaaactg gaactggtta gggatgctat 961 attggccgcg gtgaatggcc ttgtctcaca acccttcaag aatttcttgg gcaagctcaa 1021 gcctcttaat gtattaaaca tcctatctaa ctgtgactgg acttttatgg gggtggtaga 1081 gatggttata ctactccttg aactctttgg cgtgttctgg aacccgcccg atgtgtctaa 1141 ttttatagca tctctcctcc ctgacttcca cctccagggg ccagaagacc tggcccggga 1201 tttggtgcca gtcattcttg gtggcatagg actagccatt ggatttacca gagacaaagt 1261 caccaaggtc atgaagagtg ctgtggatgg actccgggcc gctacacagc tggggcaata 1321 cgggttggaa atattctcac tcctcaagaa gtatttcttt ggtggggatc agactgaacg 1381 aaccctcaaa ggcattgaag cagcagtcat agatatggag gttctgtcct ctacatcagt 1441 gacacaattg gtgagagaca agcaggcagc taaagcttac atgaacatcc tggataatga 1501 agaggaaaaa gctagaaaac tctctgctaa gaatgctgac ccccatgtaa tatcctcaac 1561 aaatgcccta atatcacgta tagccatggc gcgatccgct ctggctaagg ctcaagctga 1621 gatgaccagc cgaatgaggc cagttgtcat catgatgtgt ggacctcctg gaattgggaa 1681 gactaaagca gcggaacact tggcaaaacg cctagctaat gagatcaggc ctggcggcaa 1741 agtgggactg gtaccacgtg aagctgttga ccactgggat ggctaccatg gtgaggaagt 1801 gatgctatgg gatgactatg gtatgacaaa gatacaagat gactgcaaca agctccaggc 1861 tattgctgac tctgccccac tcactctcaa ttgtgatagg attgaaaata aagggatgca 1921 gtttgtgtca gatgcaatag tcatcaccac caacgctcca gggcctgccc ctgtggactt 1981 tgtcaacctt ggccctgtgt gtagacgggt tgacttcctg gtctactgtt ccgccccaga 2041 ggtagagcag ataaggagag tcagccctgg agacacgtcg gcacttaaag actgcttcaa 2101 gtcagacttc tcccacctga agatggagtt agctcctcaa ggggggtttg acaaccaggg 2161 gaacacacca ttcggcaagg gtgtcatgaa accaacaacc atcaacaggc tcctcataca 2221 agctgtggct ctcaccatgg agagacagga tgagttccgt ctccaaggga agatgtatga 2281 ttttgatgat gatagggtgt cagcctttac caccatggca cgcgataatg gattgggcat 2341 cctgagcatg gcgagcctag gtaagaagct gcgcggtgtc acatcgatgg aaggtctgaa 2401 gaatgctctg aaagggtata aaattggtgc gtgcacaatt aagtggcagg ccaaggtgta 2461 ttcactagag tcagatggca acagtgttaa catcagggag gagaagaacg tcttaactca 2521 acaacagcag tcggtgtgcg ctgcctctat tgcactcacc cgtctgcggg ccgcacgcgc 2581 ggtggcgtat gcgtcatgca ttcaatcagc cataacctcc atactacaaa ttgctgcctc 2641 tgccctagtg gtcaacaggg ccgtgaaaag aatgtttggc acacgcactg ctgctttgtc 2701 tctagagggc ccccccaaag aacacaaatg cagagtccac caggctaaag ccgcaggaaa 2761 agggcccatt ggccatgatg acatggttga taaatatggt ctatgtgaga ctgaggagga 2821 tgaagaagta gtccacactg agataccatc tgccaccatg gaaggtaaga acaaaggaaa 2881 gaacaagaaa gggcgcggcc gaaagaacaa ctacaacgcc ttttcccgta gagggctcaa 2941 tgacgaagag tatgaggagt acaagaaaat acgggaagag aagggtggaa attacagcat 3001 tcaggagtac ctagaggata gacaaaggta tgaagaggag cttgctgagg ttcaagcagg 3061 tggtgatgga ggaatcggtg aaactgaaat ggaaatccgc catagagtgt tctacaagtc 3121 taagagcaag aagcaccacc aggaagaacg acgccaactg ggattagtca caggttctga 3181 catccggaaa agaaaaccaa ttgactggac cccccctaag tcagcatggg cagatgatga 3241 gcgtgaagtg gactacaatg agaggatcaa ctttgaggcg ccccccactt tgtggagccg 3301 ggtcacaaag tttggatctg ggtggggctt ctgggtcagt cccacagtct tcataaccac 3361 aacgcacgtt ataccaacca gtgcaaaaga gttctttggt gaaccccttg ccagcatagc 3421 catccacaga gccggagaat tcaccctctt caggttcgct aagaaaatca ggcccgatct 3481 cacgggtatg attcttgagg aaggttgccc agagggtacg gtgtgctcag tattaataaa 3541 aagggactcc ggtgagctac taccactagc tgtaagaatg ggcgcaatag catcaatgcg 3601 catacagggt cgccttgtcc atggccagtc tggtatgttg ctcactgggg cgaatgctaa 3661 gggcatggac ctcggaacta ttccagggga ttgtggagct ccctatgtct ataagagagc 3721 aaatgactgg gtggtttgtg gtgtgcacgc tgctgccact aaatcaggca acacggtagt 3781 gtgtgccgtc caggctagtg aaggggaaac cacacttgag gggggtgaca aaggtcacta 3841 tgccggacat gagataatca agcatggatg tgggccagcc ctgtcaacca aaacaaagtt 3901 ctggaaatcg tcccctgaac cactaccccc tggagtctat gaacctgcat acctcggtgg 3961 ccgggatcca agagtaagtg gtggcccctc gctccaacag gtattacggg atcagttaaa 4021 gccatttgct gagccacggg ggcgtatgcc ggaaccaggt ctcctggagg ccgcagttga 4081 gactgtgacc tcatcactgg agcaggttat ggatacccca gtgccgtgga gttacagtga 4141 tgcatgtcag tccctagata aaactactag ttctggtttc ccccaccaca agaagaagaa 4201 tgatgattgg aacggcaccg ccttcattag agagttggga gagcaggcag cgcacgccaa 4261 taatatgtac gaacaagcta agagcatgaa gcccatgtac acggcggcgc ttaaggatga 4321 attagtgaaa ccagaaaaag tgtaccaaaa agtgaagaag cgcttgcttt ggggggcaga 4381 tctaggaaca gtgatccggg ccgcacgggc ctttggccca ttctgtgatg ccataaagtc 4441 ccacacaatt aaattaccta tcaaagttgg aatgaattca atcgaggatg ggccattaat 4501 ttatgcagag cattcaaaat ataaatatca ctttgatgca gactatacgg cttgggactc 4561 aacacaaaat aggcaaatta tgactgaatc attctcaatc atgtgtcggc taactgcttc 4621 tccagaattg gcctcagtgg tggcacaaga tctgcttgca ccctcagaaa tggacgttgg 4681 tgactacgtc ataagagtga aggaaggcct cccatccggc tttccatgca catcacaggt 4741 caatagcata aaccattggt tgataacttt gtgcgccctc tctgaggtga ctggcctgtc 4801 accagatgtt atccagtcca tgtcatactt ttctttctat ggtgatgatg aaatagtttc 4861 tactgacata gaatttgacc cagcaaagct aacacaagtc cttaaagagt atggcctcaa 4921 acccacccgc cctgacaaga gtgaaggtcc aataattgtg aggaagagtg tggatggcct 4981 ggtcttctta cgtcgcacca tttcccgcga cgccgcgggg ttccaggggc gactggaccg 5041 agcatccatt gaaaggcaga tctactggac cagagggccc aatcactcag acccctttga 5101 aaccttggtg ccccaccagc aaagaaaaat ccagctgata tcactgttag gtgaggcctc 5161 attgcatggt gaaaaattct acaggaagat ctcaagcaaa gtcatccaag agattaagac 5221 agggggcctt gaaatgtatg tgccagggtg gcaagccatg ttccgctgga tgcggttcca 5281 cgaccttggc ctgtggacag gagatcgcaa tcttctgccc gaattcgtaa atgatgatgg 5341 cgtctaagga cgccccaaca tcccctgatg gcgccagtgg cgccggccag ctggtaccgg 5401 aggctaatac agctgagcaa atttcaatgg accctgttgc gggtgcttca acagcagtcg 5461 cgacggctgg gcaagttaat atgattgacc catggatttt taataacttt gtccaggcac 5521 ctcaaggaga attcactatt tcccctaata atacccccgg tgacattttg tttgatttac 5581 aactaggacc ccaccttaat ccatttctag cccatctctc acagatgtac aatggttggg 5641 tcggcaatat gcgtgtgcgc atactgttgg ccgggaatgc cttcacagct ggaaagataa 5701 tcatttgctg tgtcccccct ggttttgatg ctagaatact cacaatagct caagcaactc 5761 tcttcccaca cttaattgct gatgtaagga cccttgagcc tgtggagctt cctttggagg 5821 atgtgcgcaa tgttctcttc cacaacagta gccaaccgca gccaacaatg cggctggttg 5881 ctatgttgta cactcccctc cgcactggtg gtggttccgg aggcactgat gcctttgtgg 5941 tggcgggtag ggtgcttacg tgccccgccc ccgactttag ttttctgttt cttgtccccc 6001 cttccgttga acaaaagacc agagttttca gtgtccccaa tatacccctg aaggatctct 6061 caaattcccg tgtccctgtg cctgtacagg gcatgtttat gtccccagat gttaatcagt 6121 cagtccagtt tcagaatgga cgctgccaaa ttgatggcca actccaaggc accactccag 6181 tctcgctcag ccaactctgc aagattaggg gtaaaacttc tagtaatgct agggtgctca 6241 acttaagtga ggtagatggc acacccttca tcccacttga atcaccagcg ccagttggtt 6301 ttcccgactt gggaggttgt gactggcatg taaacttcac tttccaggct caaaatcagg 6361 acccgtctca aagtgtaacc tttgcaacta atgatgccag ctttgtcccc tatctaggta 6421 gtatctctcc tcacaatggg ggagattttt atgcaggtga catcataggt agccttggtt 6481 ggatttcagc cccgtctgat aattcacaac ttaatgtttg gacaatacca aagtatggat 6541 ctagtctccc agatgtcact catcttgccc ctgctgtgtt ccccccaggc tttggggagg 6601 tgatcttgta cttctactct accttcccag gttctggaca atccagccaa cttcaagtcc 6661 catgtttgtt gcctcaggag ttcatcaccc acttctgtaa cgaacaggct cccatcgctg 6721 gggaggccgc cctcctccac tacgtggacc ctgatacggg gcgaaacttg ggggaattta 6781 aactttaccc tgatgggttt atgacctgtg tccccaatag tgttagtagt ggccctcaaa 6841 cccttcctat caatggagtc tttgtctttg tttcatgggt atctagattc tatcaactca 6901 agcctgtggg aacggcctca gcggctagaa ggcttggatt gcggcgcata taatggccca 6961 agctgtcatc ggtgccatag ccgcatctgc cgctggtagc atactagggg caggtataca 7021 ggctggtgct gaggctggtc tccaggctca acggtatcag caggatctgc agttgcaaca 7081 aaattcattc aagcatgata aggaaatgtt aggctaccaa gttcaggcta gtaatgctct 7141 tttagctaag aatcttaaca ctagatatgc tcttctgcag gcagggggcc tatctagtgc 7201 tgatgctgct cgggcagtgg ctggtgctcc tgtcacccgt atagtggact ggaatggcac 7261 gcgcattgca gcgcccacct caagcaccac tacacttaga tctggtggtt ttatggctgt 7321 ccctatacca ttgtcttcaa agaccaagca accagtgatg tctgggcagg ataatccaaa 7381 ttatgctgct tcttctatct ctagaactgc ttcatgggtg caatctcaaa attctatgag 7441 gtctgtttct ccctttcaca gtgatgctct gagaaccgta tgggttacac cacccggttc 7501 atcatcaact tcatctgtgc aatctagttt ctatggtgtt tttaatacag atagattgcc 7561 tctgttcgca aacagaaggt aatgaaattt tgtaatagga tgccagtggg caccatattc 7621 agatttgatt tttaattgga ttgttttaat taaaatttgg cttaattggt gttaaaaaa //