Typing tool
|
Complete norovirus genomes
MK907797 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 1..4677 ORF2: 4658..6280 ORF3: 6280..6976LOCUS MK907797 6976 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_033 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MK907797 VERSION MK907797.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 6976) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 6976) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..6976 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_033" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="13-May-2014" /note="genotype: GII.4-GII.Pe" gene <1..4677 /gene="ORF1" CDS <1..4677 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QCO93088.1" /translation="LHVERGLILGVHKPPAAISLAKVELAPLSLFWRPVYTPQYLISP DTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFRPYQDWNRKPL PTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDWTFAGIVESLI LLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGLVLGFTKEKIG KMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANELAMVRSIEDAVLDLEAIE NNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINSLLARIAAARSLV HRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVGLIPRNGVDHW DAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKVFDSDAIIITT NLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNAFSPDFSHIKL SLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQGPALTTFNFDR NKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKCQIVYNGGTYT LEADGKGSVKVDRVQSATVQTNNELAGALHHLRCARIRYYVKCVQEALYSIIQIAGAA FVTTRIAKRMNIQNLWSKPQVEDTEETANKDGCLKPKDDEEFVVSSDDIKTEGKKGKN KSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYYEEVAIARATE EDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKPKGKLWADDDR SVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAKEFFGVPIKQI QIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTLLIKRPTGELMPLAARMGTHA TMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVVIGVHTAAARG GNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSTKTKFWRSSTTPLPPGT YEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKKTIINVLEQTI DPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASKANLMFEEGKN MTPVYTGALKDELVKTEKVYGKVKKRLLWGSDLATMIRCARAFGGLMDELKAHCVTLP VRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIMVKFSPEPHLA QTVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCALSEVTDLSPD IIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGPLVISEDLDGL TFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQRPIQLMSLLGE AALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLAPSFV NEDGVE" mat_peptide <1..567 /gene="ORF1" /product="p48" mat_peptide 568..1665 /gene="ORF1" /product="NTPase" mat_peptide 1666..2202 /gene="ORF1" /product="p22" mat_peptide 2203..2601 /gene="ORF1" /product="VPg" mat_peptide 2602..3144 /gene="ORF1" /product="Pro" mat_peptide 3145..4674 /gene="ORF1" /product="RdRp" gene 4658..6280 /gene="ORF2" CDS 4658..6280 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93089.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPCQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEVNQNTKFTPVGVIQDG GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6280..>6976 /gene="ORF3" CDS 6280..>6976 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93090.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEYENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKPS NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI SFVTPPSSRSSSQG" ORIGIN 1 ctgcacgtgg aacgaggtct tatacttggt gtgcacaaac caccggcagc cattagcctt 61 gccaaggtcg agctagcacc gctctctttg ttctggagac ccgtatacac cccacagtat 121 ctcatctctc cagacactct taggagatta catggagagt cgttccccta cactgcattc 181 gacaacaatt gctacgcctt ttgttgttgg gtattagacc taaacgactc atggctgagc 241 aggagaatga ttcaaagaac aacaggcttc ttcaggccgt accaggattg gaacaggaaa 301 cccctcccca ctatggatga ttccaaatta aagaaggtag ccaacatatt cttgtgcact 361 ttgtcttcac tattcaccag acccattaag gacataatag ggaagttgaa acctcttaac 421 atccttaaca ttctggctac atgtgattgg accttcgcag gcatagtgga atccttaata 481 ctcttggcag aactctttgg agtcttctgg acacccccag atgtgtctgc gatgatcgcc 541 cccttgctag gtgattatga actgcaagga cctgaggacc ttgcagtgga attggtccca 601 atagtgatgg gggggatagg tttggtgcta ggatttacca aagagaaaat cggaaagatg 661 ctatcatccg ccgcatccac tttaagagct tgtaaagacc ttggtgcata cggactggaa 721 atcttaaaat tggtcatgaa gtggttcttc ccaaagaaag aggaagcaaa tgaactggct 781 atggtgagat ccatcgagga tgcagtacta gacctcgagg caattgaaaa caaccacatg 841 accaccctac tcaaagacaa agacagcttg gcaacctaca tgagaaccct tgaccttgag 901 gaggagaaag ccagaaaact ctcaaccaaa tctgcttcac ccgatattgt gggcacaatc 961 aactctcttc tggcaagaat cgctgctgca cgttccctag tgcatcgggc gaaagaagag 1021 ctctccagca ggccgagacc tgtcgttgtg atgatatcgg gaagaccagg gatagggaaa 1081 actcaccttg ccagggagct ggccaagaag atcgcggcct ctctcacagg ggaccagcgt 1141 gtgggtctta tcccacgcaa tggtgtcgac cactgggacg catacaaggg cgaaagagtt 1201 gtcctatggg acgactatgg aatgagcaac cccatccatg atgccctcag gttgcaggag 1261 cttgctgaca cttgccccct cacgctaaat tgtgacagaa ttgagaacaa ggggaaagtc 1321 tttgacagtg atgccataat tatcaccacc aatctggcca acccagcacc actggattat 1381 gtcaactttg aagcgtgctc gagacgcatt gatttcctcg tgtacgcaga agcccctgag 1441 gtggagaagg caaagcgcga cttcccaggt caacctgaca tgtggaagaa cgctttcagt 1501 cctgacttct cacacataaa actgtcattg gctccacagg gtggtttcga caagaacggc 1561 aacaccccgc atggaaaagg ggtcatgaag accctcacca ctggctccct catcgcccga 1621 gcatcagggt tactccatga gaggctagat gaatatgaac tgcaaggccc agccctcacc 1681 actttcaact ttgaccgcaa caagatactt gcttttagac agcttgctgc tgaaaacaag 1741 tatgggttga tggacacaat gagagttgga aaacagctca aggatgtcaa gaccatgtca 1801 gacctcaaac aagcactcaa gaacatcgcg atcaagaagt gccagatagt gtacaatggt 1861 ggcacctaca cacttgaagc tgatggcaag ggtagtgtga aagttgacag agtgcaaagt 1921 gccactgtgc agaccaacaa tgaactagcc ggtgccctac accacctaag gtgcgctaga 1981 atcagatact atgttaagtg cgtccaggag gcactgtatt ccatcatcca aatcgctggg 2041 gctgcattcg tcaccacgcg catcgctaag cgcatgaata tacaaaatct ctggtccaag 2101 ccacaggtgg aagacacaga agagacggcc aacaaagatg gttgcctgaa acccaaagat 2161 gatgaagagt ttgtcgtctc atccgacgac atcaaaactg agggcaagaa agggaagaac 2221 aagtccggcc gtggcaagaa gcacacggcc ttttcaagta aagggctcag tgatgaggag 2281 tacgatgagt acaagagaat cagagaagaa aggaatggta agtactccat agaagagtac 2341 cttcaggaca gagacaggta ctacgaggag gtggccattg ccagggcaac cgaagaggac 2401 ttctgtgaag aagaagaggc caaaatccgg cagagaattt ttaggccaac aaggaaacaa 2461 cgcaaagaag aaagggcctc tctcggcttg gtcacaggct ctgaaatcag gaagagaaac 2521 ccagaagact tcaaacccaa gggaaagttg tgggctgatg atgacagaag tgttgactac 2581 aatgagaaac tcaactttga agccccacca agcatctggt cgcggatagt caactttggc 2641 tcaggctggg gcttctgggt ctcccccagt ctgtttataa catcaaccca tgtcataccc 2701 caaggtgcaa aagagttctt cggagtccct atcaagcaaa tccagataca caagtcaggt 2761 gaattctgcc ggttgagatt cccaaagcca atcagaactg atgtgacggg catgattcta 2821 gaagaaggtg cgcccgaggg gaccgtggtc acactgctca tcaagagacc aactggagag 2881 ctcatgcccc tggcagccag aatggggacc catgcaacca tgaaaattca ggggcgcaca 2941 gttggagggc aaatgggtat gctcctgaca gggtccaacg ccaagagtat ggacctaggc 3001 acaacaccag gcgactgcgg ctgcccctac atctacaaga gggggaatga ctacgtggtc 3061 ataggagtcc atacggccgc tgcccgtgga ggaaacactg tcatatgtgc cacccagggg 3121 agtgagggag aagccacact cgaaggaggt gacagtaaag ggacatactg tggcgcacca 3181 atcttgggcc cagggagcgc tccgaagctc agcaccaaga ctaagttttg gagatcgtcc 3241 acaacaccac tcccacctgg cacctacgaa ccagcctatc tcggtggcaa agaccctaga 3301 gtcaaaggtg gcccttcatt gcaacaagtt atgagggacc agctgaagcc attcacagaa 3361 cccagaggta aaccaccaag accaaatgtg ttggaagctg ccaagaaaac catcatcaat 3421 gtccttgagc aaacaattga tccaccccaa aaatggtcat ttgcgcaagc ttgcgcatcc 3481 cttgacaaaa ccacctccag cggccacccg caccacatgc ggaaaaacga ctgttggaat 3541 ggggagtcct tcacaggaaa attggctgat caagcctcca aggccaacct aatgtttgaa 3601 gagggaaaga acatgactcc agtctacaca ggtgcactta aagatgagtt ggtaaagacc 3661 gaaaaagttt atggtaaggt caagaagagg cttctgtggg gttcagatct ggcgaccatg 3721 atacggtgcg cccgagcttt tggaggcctt atggatgaac tcaaggcaca ctgtgtcaca 3781 cttcctgtca gagttggtat gaacatgaat gaggatggcc ccatcatctt tgagaagcac 3841 tccagatata gatatcacta tgatgctgat tattcccggt gggactcaac acagcaaagg 3901 gatgtgctag cagcagcact agaaatcatg gttaagttct ctccagaacc acacctggcc 3961 cagacagttg cagaagacct cctttcccct agcgtgatgg atgtaggtga ctttcaaata 4021 tcaataagtg agggtctccc ctctggggta ccttgtacct cccagtggaa ttccatcgcc 4081 cactggctcc tcactctgtg tgcactctct gaagtcacgg acctgtcccc tgatatcatt 4141 caggccaact cccttttctc cttctatggt gatgatgaga ttgtaagcac agacataaag 4201 ttggacccag agaagctgac agcaaaactt aaggagtatg ggctgaaacc aacccgcccc 4261 gacaaaactg aaggacccct tgttatctct gaagacctgg atggcctgac attcctccgg 4321 agaactgtga cccgtgatcc agctggctgg tttggaaaat tggaacaaag ttcaattctc 4381 aggcaaatgt actggaccag gggtcccaac catgaagatc cttttgaaac aatgatacca 4441 cactcccaaa gacccataca attgatgtcc ttgctgggcg aggctgcgct ccacggcccg 4501 gcattctata gcaaaattag caaattagtc attgcagagt tgaaggaagg tggcatggat 4561 ttttacgtac ccagacaaga gccaatgttc agatggatga gattctcaga tctgagcacg 4621 tgggagggcg atcgcaatct ggctcccagt tttgtgaatg aagatggcgt cgagtgacgc 4681 caacccatct gatgggtccg cagccaacct cgtcccagag gtcaacaatg aggttatggc 4741 tctggagccc gttgttggtg ccgccattgc ggcacccgta gcgggccaac aaaatgtaat 4801 tgacccctgg attagaaata attttgtaca agcccctggt ggagagttta cagtatcccc 4861 tagaaacgct ccaggtgaaa tactatggag cgcacccttg ggccctgatc taaatcccta 4921 cctatcccat ttggccagaa tgtacaatgg ttatgcaggt ggttttgaag tgcaggtaat 4981 tctcgcgggg aacgcgttca ccgccgggaa ggttatattt gcagcagtcc caccaaattt 5041 tccaactgaa ggcttgagcc cctgccaggt tactatgttc ccccatatag tagtagatgt 5101 taggcaacta gaacctgtgt tgattccctt acccgatgtt aggaataatt tctatcatta 5161 caatcaatca aatgacccca ccattaagtt gatagcaatg ttgtatacac cacttagggc 5221 taacaatgct ggggatgatg tcttcacagt ttcttgccga gttctcacga gaccttcccc 5281 cgattttgat ttcatatttc tagtgccacc cacagttgag tcaagaacta aaccattctc 5341 tgtcccagtt ttaactgttg aggagatgac caattcaaga ttccccattc ctttggaaaa 5401 gttgttcacg ggtcccagca gtgcctttgt tgtccaacca caaaacggta ggtgcacgac 5461 tgatggcgtg ctcctaggca ccacccaact gtctcccgtc aacatctgca ccttcagagg 5521 agatgtcacc catatcacag gtagtcataa ctacacaatg aatttggctt ctcaaaattg 5581 gagcaattat gacccaacag aagaaatccc agcccctcta ggaactccag actttgtggg 5641 gaagattcaa ggcgtgctca cccaaaccac aaggacagat ggctcaacac gcggccacaa 5701 agccacagtg tacactggga gcgccgactt tgctccaaaa ctgggtagag ttcaatttga 5761 aactgacaca aaccatgatt ttgaagttaa tcaaaacaca aagttcaccc cagttggtgt 5821 catccaagat ggtggcacca cccaccgaaa tgaaccccaa cagtgggtgc tcccaagtta 5881 ctcaggcaga aatactccta atgtgcatct ggcccccgct gtagccccca cttttccggg 5941 tgagcaactt ctcttcttca gatccaccat gcccggatgc agcgggtacc ccaacatgga 6001 tttggactgt ctgctccccc aggaatgggt gcagtacttc taccaagagg cagccccagc 6061 acaatctgat gtggctctgc taagatttgt gaatccagac acaggtaggg ttttgtttga 6121 gtgtaagctt cataaatcag gctatgttac agtggctcac actggccaac atgatttggt 6181 tatccccccc aatggttatt ttaggtttga ttcctgggtc aaccaatttt acacgcttgc 6241 ccccatggga aatggaacgg ggcgtagacg tgcattataa tggctggagc tttctttgct 6301 ggattggcat ctgatgtcct tggctctgga cttggttccc ttatcaatgc tggggctggg 6361 gccatcaatc aaaaagttga gtatgaaaat aacagaaaat tgcaacaagc atccttccaa 6421 tttagcagca atctacaaca ggcttctttt caacatgaca aagagatgct ccaagcacaa 6481 attgaggcca ccaaaaggct acaacaagaa atgatgaaag ttaagcaggc aatgctccta 6541 gagggtgggt tctctgagac agatgcagcc cgcggggcaa tcaacgcccc catgacaaaa 6601 gctttggact ggagcgggac aaggtactgg gctcccgatg ctaggactac aacatacaat 6661 gcaggccgct tttccacccc tcaaccatcg ggggcactgc caggaagagc taatcttagg 6721 gatgctgtcc ctgctcgggg ttcctccagc aagccttcta attcttctac tgccacttct 6781 gtgtactcaa atcaaactac ttcaacgaga cttggttcta cagctggttc tggtaccagt 6841 gtctcgagct tcccgtcaac tgcaaggact aggagctggg ttgaggatca aagtaggaat 6901 ttgtcacctt tcatgagggg ggcccacaac atatcgtttg tcaccccacc atctagcaga 6961 tcctctagcc aaggga //