Typing tool
|
Complete norovirus genomes
MK789654 | GI.3 | ||
---|---|---|---|
GI.P3 |
ORF1: 1..5356 ORF2: 5340..6977 ORF3: 6977..7624LOCUS MK789654 7648 bp RNA linear VRL 01-NOV-2019 DEFINITION Norovirus GI isolate G19_013 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK789654 VERSION MK789654.1 KEYWORDS . SOURCE Norovirus GI ORGANISM Norovirus GI Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7648) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S.F. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7648) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (02-APR-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7648 /organism="Norovirus GI" /mol_type="genomic RNA" /isolate="G19_013" /isolation_source="sewage" /db_xref="taxon:122928" /country="France: Nantes" /collection_date="08-Jan-2014" /note="genotype: GI.3-GI.P3" gene <1..5356 /gene="ORF1" CDS <1..5356 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="QCA41813.1" /translation="NNNANTNNDNIGSRLMARIRGRMGPQRGETTTKITDANMALNLL RRSQTPSPSRQESPPKSQRDRPPRTASEVKKVLGWDVEPEHQESTAKAWCDLTQEEKE EIMRNNEKLFDAGGITPSTLPSTFERADPVDSPIEQQPVTWSASGGVDIGVNDLTTVS GPFWNMCPLPPLDARNNGPAKEPLIGDMIEFYEGHIFHYAIYIGQGKTIGVHSPQAAF SIPRITIHPLVAWWRVCYVPTNQQRLTYDQLKELENEPWPYASITNNCYEFCCRVMAL DDTWLERRLVSTGKFNHPTQDWSQDTPDFHQDSKLEMVRDAVLSAINGLVSQPFKNIL SKIKPLNVLNLLSNCDWTFMGVVELIVLLAELFDVFWTPPDISSFIASLLPEFHLQGP EDLARDLVPLILGGIGLAIGFTRDKVTKVMKSAVDGLRSATQLGQYGLEIFSIIKKYF FGGDQTEKTLRGIEDAVIDMEVLSSTNVTQLVKDKKLARTYMNVLDNEEEKARKLSVR SADPHIVTSVNNLISRISMARSALAKAQAEMTSRPRPVVIMMCGPPGIGKTKAAEHLA GRLANEIRPGGKVGLVPRESIDHWDGYHGEDVLLWDDYGMSKITEDCNKLQAIADTAP LSLNCDRIENKGMQFSSDAIIITTNAPGPAPVDFVNLGPVCRRVDFLVYCSAPEIEQM RRTHPGDANAIKDLYKRDYSHLKMELAPQGGFDSQGNTPFGKGIMKPTTLNRLLIQAT ALAMERQDEFQLQGAVYNFDEDRVSAFTNLARANGLGLLSMATLGKRLRNVKSMEGLR NALVGYKIGECDIIWNTRVYSIKSDGSIVTIKEKQTPTSPQYQAISTATLALSRLRAA RALAYASCLQSAVLSILQVAGSALVVSRAVKRMFGTRTEQPMLEGKHKEHNCRVHRAE AAGHGPIGHDGVIERYGLCESEQEEEGEQTVELPTANKEGKNKGKTKKGRGRKSNFNA FSRRGLSDEEYEEYKKIREEKSGNYSIQEYLEDRQRYEEELAEVQAGGDGGIGETEAE IRHRVFYKSKSGMRKQRQEERRQLGLVSGSEIRKRKPIDWTPPKNDWSEDTRTVNYDE HISFEAPPSIWSRVVKFGSGWGFWVSSTVFITTTHVIPPGAKEVFGEDLSNVAIHRVG EFTQFRFSKKMRPDLTGMVLEEGCPEGTVCTIMIKRDSGELLPLAVRMGAVASMKIQG KLMHGQSGMLLTGANAKGMDLGTIPGDCGAPYVHKRGNDWVVCGVHAAATKSGNTVVC AIQGGDGEATLEGGGQNKGHYAGHPILRYGNGPSLSTKTKFWKSTPQPLPPGTYEPAY LGGRDPRVEGGPSLQQVLRDQLKPFAEPRGRLPEAGLLEAAVETVTNAIEQVMDTPVA WSYSDACMSLDKTTSSGHPHHKKKNDDWNGNSFVRELGDQAAHANSMYELGKSMKPVY TAALKDELVKPDKVYTKIKKRLLWGADLGTVIRAARAFGPFCEAIKPHVIKLPIKVGM NAIEDGPLIYAEHSKYKFHYDADYTAWDSTQNREIMMESFNIMCKLTANPSLAAVVAQ DLLSPSEMDVGDYVISVKDGLPSGFPCTSQVNSINHWILTLCALSEVTGLSPDVIQSQ SYFSFYGDDEIVSTDIEFDPIRLTQILKEYGLKPTRPDKTDGPIIIRQQVDGLVFLRR TISKDAIGYQGRLDRNSIERQLWWTRGPNHEDPFETLVPHSQRKVQLISLLGEAALHG EKFYRKIAGRVIQEVKEGGLEIYIPGWQAMFRWMRFHDLSLWTGDRDLLPDYVNDDGV " mat_peptide <1..1171 /gene="ORF1" /product="p48" mat_peptide 1172..2260 /gene="ORF1" /product="NTPase" mat_peptide 2261..2860 /gene="ORF1" /product="p22" mat_peptide 2861..3280 /gene="ORF1" /product="VPg" mat_peptide 3281..3823 /gene="ORF1" /product="Pro" mat_peptide 3824..5353 /gene="ORF1" /product="RdRp" gene 5340..6977 /gene="ORF2" CDS 5340..6977 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCA41814.1" /translation="MMMASKDAPTNMDGTSGAGQLVPEANTAEPISMEPVAGAATAAA TAGQVNMIDPWIMNNYVQAPQGEFTISPNNTPGDVLFDLQLGPHLNPFLSHLAQMYNG WVGNMKVKVLLAGNAFTAGKIIISCIPPGFAAQNISIAQATMFPHVIADVRVLEPIEV PLEDVRNVLFHNNDNTPTMRLVCMLYTPLRASGSSSGTDPFVIAGRVLTCPSPDFSFL FLVPPNVEQKTKPFSVPNLPLNTLSNSRVPSLIKSMMVSRDHGQMVQFQNGRVTLDGQ LQGTTPTSASQLCKIRGSVFHANGGNGYNLTELDGSPYHAFESPAPIGFPDLGECDWH MEASPTTQFNTGDVIRQINVKQESAFAPHLGTVQVDGLSDVSVNTNMIAKLGWVSPVS DGHKGDVDPWVIPRYGSTLTEAAQLAPPIYPPGFGEAIVFFMSDFPIAHGTNGLSVPC TIPQEFVTHFVNEQAPTRGEAALLHYLDPDTNRNLGEFKLYPDGFMTCVPNSSGTGPQ TLPINGVFVFVSWVSRFYQLKPVGTAGPARRLGIRRS" gene 6977..7624 /gene="ORF3" CDS 6977..7624 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCA41815.1" /translation="MAQAILGAIAATAAGSAVGAGIQAGTEAALQHQRFQQDLTLQKN SFIHDKEMMGLQVEASTALLQNSLGTRYNMLTKAGMTSADAARMVVGAPATRVVDWNG TRIAAPMSTATTLRSGGFMTVPTVYRGSKNKQSIGGGFSNVNYDPSVSSSRTSQWVSS QNSMRSTLEPFHPGALRTTWVTPPGSTSTSTISTVSTVPKFFNTERLPLFANRGK" ORIGIN 1 caataacaat gctaacacca acaatgataa cattggatcc cgcctcatgg cgaggatccg 61 tgggcgcatg ggcccacaac gtggtgaaac aaccacaaaa ataacagacg ccaatatggc 121 gcttaatctg ctacgccggt cacaaacacc ttccccatcc cggcaagaga gccctcctaa 181 gagccagaga gaccgccccc ctcggactgc ttctgaggtg aagaaagtgt tggggtggga 241 tgttgaacca gaacaccagg aaagcacggc caaggcatgg tgcgacctca cgcaagaaga 301 gaaggaggaa ataatgcgca acaatgaaaa gctcttcgat gctgggggaa tcaccccatc 361 aactttgcct agcacgtttg aaagggccga cccagttgat agccccattg aacagcaacc 421 tgtgacatgg tctgctagtg gtggggtcga tattggtgtt aatgatttga ccaccgtcag 481 tggcccattt tggaatatgt gccccctccc gccactggat gctcggaata atggccccgc 541 taaggagccc ctcataggag atatgataga gttctatgag ggtcatatat tccattatgc 601 catctatatt ggacaaggta agaccattgg ggtccactca ccgcaagcag ctttctccat 661 accaaggata accatccatc ctctggtagc ctggtggcgc gtgtgctacg ttccaactaa 721 ccagcagcgc ctcacttatg accagctcaa agagctggag aatgagccat ggccctacgc 781 ttccattaca aacaattgct atgagttttg ctgcagagtg atggcacttg acgacacatg 841 gcttgagagg aggttagtca gcactggaaa attcaatcac cccacacaag attggtcaca 901 ggataccccg gattttcacc aggactcgaa acttgagatg gtaagagatg ccgtgctatc 961 agctattaat ggccttgtgt cacagccctt taaaaacatt ttgtcaaaga tcaaaccatt 1021 aaacgtcctc aatctcttgt caaattgtga ttggaccttc atgggggttg tggaactgat 1081 tgtgctccta gcggagttgt ttgatgtgtt ctggacccca ccagacatct caagtttcat 1141 tgcctcacta ctacctgaat ttcacctgca gggcccagaa gacctagcta gggacttggt 1201 ccctctcatc ctaggtggca taggcctggc tattggcttt acaagagaca aagtgacgaa 1261 agtgatgaag agtgccgtcg atgggttgag atcggcgaca cagctcgggc agtacggact 1321 ggaaatattc tcaatcatca agaaatattt ctttggtggg gatcaaacgg aaaagacctt 1381 gcggggtatt gaggatgcag tgatagacat ggaggtgtta tctagcacca atgtgacgca 1441 attggttaaa gataagaaat tggccagaac ctacatgaat gtgctagaca atgaagagga 1501 aaaagcaagg aaactctcag tccgtagtgc tgatccgcac atagtgacct cagtcaacaa 1561 tctgatatca cgtatatcta tggcaagatc agccttggcc aaagcccagg ctgagatgac 1621 atctcgccca cgccctgtgg tcataatgat gtgtggcccc cccggcatcg gaaaaaccaa 1681 ggccgccgag cacctggctg gccgattggc aaacgagatt agaccaggag gtaaggttgg 1741 acttgtgccc cgggaatcca tagatcactg ggacggatac catggcgagg atgtcctact 1801 atgggatgac tatggcatgt caaagataac agaggactgt aataagctac aagccatagc 1861 tgacactgca cccctatcac ttaattgtga caggatagag aacaaaggca tgcaattctc 1921 ctcagatgca ataatcataa ccaccaatgc ccctggcccg gccccggtgg actttgtcaa 1981 cctaggccct gtgtgcaggc gagtggactt tttagtttac tgttctgccc ctgaaattga 2041 gcaaatgaga aggacccatc ctggtgatgc caatgccatc aaagacctat acaaaaggga 2101 ctactcacac ttaaaaatgg aattggcccc acaaggtggt tttgatagtc agggtaacac 2161 accatttggg aagggcatca tgaaacccac gaccctcaat aggctactca ttcaggcaac 2221 agcattggct atggagcggc aagacgaatt ccagctccaa ggggctgtct acaactttga 2281 tgaagacagg gtgtccgcct tcacaaacct ggcccgagct aacggcttgg ggcttttaag 2341 catggcaaca ttgggtaaaa ggctcagaaa tgtcaaatct atggagggcc tgcgtaacgc 2401 attggtgggc tacaaaatag gggagtgtga tataatctgg aacactagag tgtattcaat 2461 taaaagtgat ggtagtattg tcacaatcaa ggaaaaacaa accccaactt caccccaata 2521 ccaagccatt agcacagcca ctcttgcact ctctaggttg cgggccgcta gagctctcgc 2581 gtacgcctcc tgcctgcaga gtgcagtgct gtcaatactt caagtagctg gctctgccct 2641 agtggtgagc agagccgtta agcgcatgtt tggcacgcgc actgaacaac ctatgttgga 2701 aggcaagcac aaagaacaca actgtagagt gcaccgcgca gaggcagctg gccatggtcc 2761 tattgggcac gatggagtca tcgaacgcta tggcctctgt gagagtgagc aagaagaaga 2821 aggcgagcaa acagttgagc tacccactgc caataaagaa ggcaaaaaca agggcaagac 2881 caaaaagggc cgtggacgaa aatccaattt taatgctttc tcaagacgag gcctcagtga 2941 tgaagagtat gaagaataca agaagatcag agaagaaaag agtggcaatt acagcatcca 3001 ggagtacctc gaggaccgcc aacgatatga ggaagaactt gctgaagtgc aggctggtgg 3061 ggatggtgga attggtgaga ctgaagcaga gattcgccat cgggttttct ataaatcaaa 3121 atccggcatg agaaagcaac gccaagaaga acggcgtcag ttgggtctcg ttagcggttc 3181 tgagatccgc aaacgcaagc ccatagactg gaccccccct aagaatgatt ggtcagaaga 3241 caccaggact gtcaactatg atgaacacat tagctttgaa gcccctccct caatttggag 3301 tagggtggtg aagtttggca gtggttgggg tttttgggtg agctcaacag tgttcataac 3361 cacgacccat gtcataccac ctggggccaa agaagtcttt ggtgaagacc tcagtaatgt 3421 ggccatccat agagttggtg agttcaccca gttccgtttc tccaaaaaga tgagacctga 3481 cctcaccggt atggttttag aagaaggttg cccagaggga acagtgtgca ccataatgat 3541 caagagagat tctggtgagt tgctgccgct cgccgttcgg atgggggcag tggcatcaat 3601 gaagatccaa ggcaaactta tgcatgggca atcaggaatg ttactcaccg gtgctaatgc 3661 caaaggcatg gaccttggga ctatacccgg tgattgtggg gcaccctatg tccacaagag 3721 gggaaatgac tgggtggtct gtggagtaca tgcagccgcc acaaaatcag gcaacactgt 3781 ggtgtgcgcc atacaaggtg gcgatggtga agccactcta gaaggcggcg gtcaaaacaa 3841 gggccactat gcagggcatc cgatactaag gtatgggaat ggcccttcat tatcaaccaa 3901 aacaaaattt tggaaatcaa ccccccagcc tttgccccct ggtacatatg agccagctta 3961 ccttggaggc agagacccca gagttgaggg gggcccatcc ttacagcagg tgttaaggga 4021 ccagttaaaa ccatttgccg agccccgtgg cagattaccc gaagcgggtt tgttggaagc 4081 tgcagttgaa acggtcacca atgctattga acaggtcatg gacacaccgg tggcatggag 4141 ctatagtgat gcttgcatgt cactagataa gacaaccagt tccggccatc cccatcacaa 4201 gaagaaaaat gatgattgga atgggaattc atttgtgaga gaattgggtg accaggcggc 4261 acacgctaac agcatgtatg agcttggtaa gtccatgaaa cctgtgtaca cagcagccct 4321 aaaggatgag ctagtcaaac cagataaggt gtacacaaag atcaaaaaga gactgctttg 4381 gggcgcggac ctcggtaccg tcattcgcgc tgccagagca tttggaccat tttgtgaggc 4441 cataaaaccc catgtaatta agttgcctat caaggtgggt atgaatgcca tagaggatgg 4501 ccccttgatt tatgctgagc actccaaata taaatttcat tatgatgctg actatacagc 4561 ttgggactca acacaaaaca gagaaatcat gatggaatct ttcaacatta tgtgcaagct 4621 tacagccaat ccctccttgg ccgcagtggt ggcacaggat ctactctccc catctgaaat 4681 ggatgttggt gactatgtga tcagtgtcaa agatggttta ccatctggct ttccatgcac 4741 ttcacaggtg aacagtatca accactggat actaaccctg tgtgcactgt cagaagtcac 4801 tggcttgtcc ccagatgtga tacagtcaca atcctacttc tcattttatg gtgatgatga 4861 aatagtctca acagatatag aatttgaccc aattagactg acacaaatat tgaaggaata 4921 tggtttgaag cctacaagac ctgacaaaac tgatggccca attattatta gacaacaggt 4981 tgatggcctg gtcttcctcc ggcgcactat ttctaaggat gccattggat accagggacg 5041 actcgaccgt aactctattg aaaggcagct ttggtggact cgtggaccaa accatgagga 5101 cccatttgag acactggtcc cacattcaca gaggaaggtc caattaatat ccttgctagg 5161 tgaagcagca ctccacggtg aaaagttcta caggaagata gctggcaggg tcatccaaga 5221 agtcaaagag ggagggcttg aaatctacat tcctggctgg caggccatgt tccgctggat 5281 gcgattccat gatctaagtt tgtggacagg agaccgcgat ctcctgcccg attatgtaaa 5341 tgatgatggc gtctaaggac gccccaacaa acatggatgg caccagtggt gccggccagc 5401 tggtaccaga ggcaaataca gctgagccta tatcaatgga gcccgtggct ggggcagcaa 5461 cagctgccgc aactgctggc caagttaata tgattgaccc ctggataatg aacaattatg 5521 tacaagcccc ccaaggtgaa tttaccatat cgcctaataa cacaccaggt gatgttttgt 5581 ttgatttaca gttaggcccc caccttaacc ctttcttatc tcatttggcc caaatgtata 5641 atggctgggt tggcaatatg aaagtgaagg ttctattggc tggtaatgct ttcacggctg 5701 gtaaaataat tattagttgc ataccccctg gctttgccgc acagaatatt tctattgccc 5761 aggccaccat gttcccccat gtcatagctg atgttagggt tttggaaccc attgaggtgc 5821 cattggaaga tgtgaggaat gtgctcttcc acaacaatga caacacgcca actatgcggt 5881 tggtgtgcat gctatatacc cccttgcggg ccagtggtag ctcgtctgga actgacccct 5941 ttgtgattgc tgggcgtgtt ttgacatgcc caagtcctga ttttagcttt ttgtttttgg 6001 tcccccccaa tgtagagcaa aagactaaac ctttcagtgt cccaaacctt ccattgaata 6061 ccctttcaaa ttcgagagtc ccttctctaa tcaaatcaat gatggtgtcc agggaccatg 6121 ggcagatggt tcagtttcaa aatggtaggg tcaccctaga tgggcagctg caaggcacca 6181 cacccacatc agctagtcag ctgtgcaaaa tcaggggcag tgtcttccat gccaatggtg 6241 ggaatgggta caacttaaca gaactggatg ggagcccata ccacgcattt gagagccccg 6301 caccaatagg gtttccggat ctaggtgaat gtgattggca catggaggct tctcctacca 6361 cccaatttaa tactggagat gttataagac aaattaatgt taaacaagag tcagcatttg 6421 ccccccacct tggcaccgta caggtagacg gcctgagtga tgtgagtgtc aacaccaaca 6481 tgatagctaa attaggatgg gtgtctcccg tcagtgatgg acacaaaggg gatgtcgacc 6541 cgtgggtcat tccacgctat ggttcaactc tgactgaggc cgcccaatta gcccccccaa 6601 tatatccccc aggctttggt gaggccattg tgtttttcat gtcagatttt cccattgccc 6661 atggcaccaa tggcttgagt gtgccttgca ctatacccca agaatttgtc acccattttg 6721 tcaatgaaca ggcccctact agaggggaag cagccttgct acactatctg gaccctgata 6781 ctaatagaaa tcttggtgag tttaaattat atcctgatgg cttcatgaca tgtgtgccta 6841 actccagtgg cactggtcca caaactcttc caatcaatgg tgtctttgtc tttgtgtcct 6901 gggtttctag attttatcaa ttaaagcctg tgggaacagc cggcccggct cgtaggcttg 6961 ggattaggag atcataatgg ctcaagccat acttggagcc atagcagcaa cagcagccgg 7021 tagtgctgtt ggtgctggta tacaagctgg aaccgaagct gccctacaac atcaaagatt 7081 tcagcaagat ttgaccttac aaaagaattc ctttattcat gataaagaga tgatgggcct 7141 acaggtagag gcttcaactg cactcctcca aaacagcttg ggaacgaggt acaatatgtt 7201 gaccaaagca gggatgacat ccgcagacgc ggcacgtatg gtagtggggg cacccgcgac 7261 tcgtgtcgtt gactggaacg ggactaggat cgccgcaccc atgtcaactg cgacgacact 7321 taggtctggt ggtttcatga ccgtaccaac tgtttatagg ggtagtaaaa ataaacaatc 7381 aattggtggt ggtttttcta atgtaaatta tgatccctca gtctcctctt cccgcacttc 7441 tcaatgggtc tcttctcaaa attcaatgcg ttccactttg gaaccatttc atccaggtgc 7501 tctgagaacc acatgggtca ccccacctgg gtcaacttct acttctacaa tttctacagt 7561 ttctactgtg ccaaaattct ttaatacaga aaggttaccc ttattcgcaa acaggggtaa 7621 gtgattttgt aatatgaatt agtgggca //