Typing tool
|
Complete norovirus genomes
MK907785 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 1..5038 ORF2: 5019..6641 ORF3: 6641..7447LOCUS MK907785 7497 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_014 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK907785 VERSION MK907785.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7497) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7497) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7497 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_014" /isolation_source="sewage" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="08-Jan-2014" /note="genotype: GII.4-GII.Pe" gene <1..5038 /gene="ORF1" CDS <1..5038 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="QCO93054.1" /translation="KSSSDGVLSSMAVTFKRALGARPKQPPPKEIPPRPPRPPTPDLV KKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPEETNTAFSVPPLNQRESRDAKE PLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKVELTPLSLFWRPVYTPQY LISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFRPYQDWN RKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDWTFAGIV ESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGLVLGFTK EKIGKMLSSAASTLRACKDLGAYGLEILKLIMKWFFPKKEEANELAMVRSIEDAVLDL EAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINSLLARIAAA RSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVGLIPRNG VDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKVFDSDAI IITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNAFSPDFS HIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQGPALTTF NFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKCQIVYNG GSYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARIRYYVKCVQEALYSIIQI AGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGCLKPKDDEEFVVSSDDIKTEGK KGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYYEEVAIA RATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKPKGKLWA DDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAKEFFGVP IKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRPTGELMPLAARM GTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVVIGVHTA AARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSTKTKFWRSSTTPL PPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKKTIINVL EQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASKANLMFE EGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDLATMIRCARAFGGLMDELKAHC VTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIMVKFSPE PHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCALSEVTD LSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGPLVISED LDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQRPIQLMS LLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLA PSFVNEDGVE" mat_peptide <1..928 /gene="ORF1" /product="p48" mat_peptide 929..2026 /gene="ORF1" /product="NTPase" mat_peptide 2027..2563 /gene="ORF1" /product="p22" mat_peptide 2564..2962 /gene="ORF1" /product="VPg" mat_peptide 2963..3505 /gene="ORF1" /product="Pro" mat_peptide 3506..5035 /gene="ORF1" /product="RdRp" gene 5019..6641 /gene="ORF2" CDS 5019..6641 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93055.1" /translation="MKMASSDVNPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVL" gene 6641..7447 /gene="ORF3" CDS 6641..7447 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93056.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPTRGSSSKSS NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA" ORIGIN 1 aaaatcttca agtgacggtg tgctttctag catggctgtc acttttaagc gggccctcgg 61 ggcgcggcct aaacagccgc ccccgaagga gataccaccc agacccccgc gaccacccac 121 gccagacttg gttaaaaaga tccctcctcc cccacccaac ggggaggatg aactagtggt 181 ctcttacagc gccaaagatg gcgtttccgg actgcctgag ctcaccactg tcagacaacc 241 ggaagaaacc aacacggcgt tcagtgtccc cccactcaac caaagggaga gcagggacgc 301 caaggagcca ctaactggaa caattattga aatgtgggat ggagaaatct accattacgg 361 cctgtacgtg gaacgaggtc ttatacttgg tgtgcacaag ccaccggcag ccatcagcct 421 tgccaaggtc gagctaacac cgctctcttt gttctggaga cctgtataca ccccccagta 481 tctcatctct ccagacactc ttaggagatt acatggagag tcattcccct acactgcatt 541 tgacaacaat tgctacgcct tttgttgttg ggtattagac ctaaacgact catggctaag 601 caggagaatg attcagagaa caacaggctt cttcaggccg taccaggatt ggaacaggaa 661 acccctcccc actatggatg attccaaatt aaagaaggta gccaacatat tcttgtgcac 721 tttgtcttca ctattcacca gacccattaa ggacataata gggaagttga aacctcttaa 781 catactcaac attctggcca catgtgattg gaccttcgca ggcatagtgg aatccttaat 841 actcttggca gaactctttg gagttttctg gacaccccca gatgtgtctg cgatgatcgc 901 ccccttgcta ggtgattatg aactgcaagg acctgaggac cttgcagtgg aactggtccc 961 aatagtgatg ggggggatag gtttggtgct aggatttacc aaagagaaaa tcggaaagat 1021 gctatcatcc gctgcatcca ctttaagagc ttgtaaagac cttggtgcat acggactgga 1081 aatcttaaaa ttgatcatga agtggttctt cccaaagaag gaggaagcaa atgaactggc 1141 tatggtgaga tccatcgagg atgcagtact agacctcgag gcaattgaaa acaaccacat 1201 gaccaccctg ctcaaagaca aagacagctt ggcaacctac atgagaaccc ttgaccttga 1261 ggaggagaaa gccagaaaac tctcaaccaa atctgcttca cccgatattg tgggcacaat 1321 caactctctt ctggcaagaa tcgctgctgc acgctcccta gtgcatcggg cgaaagaaga 1381 gctctccagc aggccgagac ctgtcgttgt gatgatatcg ggaagaccag ggatagggaa 1441 aactcacctt gccagggagc tggccaagaa gatcgcggcc tccctcacag gggaccagcg 1501 tgtgggtctt atcccacgca atggtgtcga ccactgggac gcatacaagg gcgaaagagt 1561 tgtcctatgg gacgactatg gaatgagcaa ccccatccat gatgccctca ggctgcagga 1621 gcttgctgac acttgccccc tcacgctaaa ttgtgacaga attgagaaca aagggaaagt 1681 ctttgacagt gatgccataa ttatcaccac caacctggcc aacccagcac cactggatta 1741 tgtcaacttt gaagcgtgct cgagacgcat tgatttcctc gtgtacgcag aagcccctga 1801 ggtggagaag gcaaagcgcg acttcccagg tcaacctgac atgtggaaga acgctttcag 1861 tcctgacttc tcacacataa aactgtcatt ggctccacag ggtggttttg acaagaacgg 1921 caacaccccg catggaaaag gggtcatgaa gaccctcacc actggctccc tcatcgcccg 1981 agcatcaggg ttactccatg agaggctaga tgaatatgaa ctgcaaggcc cagccctcac 2041 cactttcaac tttgaccgca acaagatact tgcttttaga cagcttgctg ctgaaaacaa 2101 gtatgggctg atggacacaa tgagagttgg aaaacagctc aaggatgtca agaccatgtc 2161 agacctcaaa caagcactca agaacatcgc gatcaagaag tgccagatag tgtacaatgg 2221 tggctcctac acacttgagg ctgatggcaa gggtagtgtg aaagttgaca aagtgcaaag 2281 tgccactgtg cagaccaaca atgaactagc cggtgcccta caccacctaa ggtgcgctag 2341 aatcagatac tatgttaagt gcgtccagga ggcgctgtat tccatcatcc aaatcgctgg 2401 ggctgcgttc gtcaccacgc gcatcgctaa gcgcatgaat atacagaatc tctggtccaa 2461 gccacaggtg gaagacacag aagagatggc caacaaagat ggttgcctaa aacccaaaga 2521 tgatgaagag tttgtcgtct catccgacga catcaaaact gagggcaaga aagggaaaaa 2581 caagtccggc cgtggcaaga agcacacagc cttttcaagt aaagggctca gtgatgagga 2641 gtacgatgag tacaagagaa tcagagaaga aaggaatggt aagtactcca tagaagagta 2701 ccttcaggac agagacaggt actacgagga ggtggccatt gccagggcaa ccgaagagga 2761 cttctgtgaa gaagaagagg ccaaaatccg gcagagaatt ttcagaccaa caagaaaaca 2821 acgcaaagaa gagagggcct ctctcggctt ggtcacaggc tctgaaatca ggaagagaaa 2881 cccagaagac ttcaaaccca agggaaagct gtgggctgat gacgacagaa gtgttgacta 2941 taatgagaaa ctcaactttg aggccccacc aagcatctgg tcgcgaatag tcaactttgg 3001 ttcaggctgg ggcttctggg tctcccccag tctgtttata acatcaaccc atgtcatacc 3061 ccaaggtgca aaagagttct tcggagtccc tatcaagcaa atccagatac acaagtcagg 3121 tgaattctgc cggttgagat tcccaaagcc aatcagaact gatgtgacgg gcatgattct 3181 agaagaaggt gcgcccgagg ggaccgtggc cacactgctc atcaagagac caactggaga 3241 gctcatgcct ctggcagcca gaatggggac ccatgcaacc atgaaaattc aggggcgcac 3301 agttggaggg caaatgggta tgctcctgac aggatccaac gccaagagta tggacctagg 3361 cacaacgcca ggcgactgcg gctgccccta catctacaag agggggaatg actacgtggt 3421 cataggagtc catacggccg ctgcccgtgg aggaaacact gtcatatgtg ccacccaggg 3481 gagtgaggga gaagccacac ttgaaggagg tgacagtaaa gggacatact gtggcgcacc 3541 aatcttgggc ccagggagcg ctccgaagct cagtaccaaa actaagtttt ggagatcatc 3601 cacaacacca ctcccacctg gcacctacga accagcctac ctcggtggca aagaccctag 3661 agtcaaaggt ggcccttcat tgcaacaagt tatgagggac cagctgaagc cattcacaga 3721 acccagaggc aaaccaccaa gaccaaatgt gttggaagct gccaagaaaa ccatcatcaa 3781 tgtccttgag caaacaattg atccacccca aaaatggtca tttgcgcaag cttgcgcatc 3841 ccttgacaaa accacctcca gcggccaccc gcaccacatg cggaaaaacg actgttggaa 3901 tggggagtcc ttcacaggaa aattggctga tcaagcctcc aaggccaacc taatgtttga 3961 agagggaaag aacatgactc cagtctacac aggtgcactt aaagatgagt tggtgaagac 4021 cgataaagtt tatggtaagg tcaagaagag gcttctgtgg ggttcagatc tggcgaccat 4081 gatacggtgc gcccgagctt ttggaggcct tatggatgaa ctcaaggcgc actgtgtcac 4141 acttcctgtc agagttggta tgaacatgaa tgaggatggc cccatcatct ttgagaagca 4201 ctccagatat agatatcact atgatgctga ttattcccgg tgggactcaa cacaacaaag 4261 ggatgtgcta gcagcagcac tagaaatcat ggttaagttc tctccagaac cacacctggc 4321 ccagatagtt gcagaagacc tcctttcccc tagcgtgatg gatgtaggtg actttcaaat 4381 atcaataagt gagggtctcc cctctggggt accttgtacc tcccagtgga attccatcgc 4441 ccactggctc ctcaccctgt gtgcactctc tgaagtcacg gacctgtccc ccgatatcat 4501 tcaggccaac tcccttttct ccttctatgg tgatgatgag attgtaagca cagacataaa 4561 attggaccca gagaagctga cagcaaagct caaggagtac gggctgaaac caacccgccc 4621 cgacaaaact gaaggacccc ttgttatctc tgaagacctg gatggcctga cattcctccg 4681 gagaactgtg acccgtgatc cagctggctg gtttggaaaa ttggaacaaa gttcaattct 4741 caggcaaatg tactggacca ggggtcccaa ccatgaagac ccatttgaaa caatgatacc 4801 acactcccaa agacccatac aattgatgtc cttgctgggc gaggctgcgc tccacggccc 4861 ggcattctat agcaaaatta gcaaattagt cattgcagag ttgaaggaag gtggcatgga 4921 tttttacgta cccagacaag agccaatgtt cagatggatg aggttctcag atctgagcac 4981 gtgggagggc gatcgcaatc tggctcccag ttttgtgaat gaagatggcg tcgagtgacg 5041 tcaacccatc tgatgggtcc gcagccaacc tcgtcccaga ggtcaacaat gaggttatgg 5101 ctctggagcc cgttgttggt gccgccattg cggcacctgt agcgggccaa caaaatgtaa 5161 ttgacccctg gattagaaat aattttgtac aagcccctgg tggagagttt acagtatccc 5221 ctagaaacgc tccaggtgaa atactatgga gcgcgccctt gggccctgat ctaaacccct 5281 acctatccca tttggccaga atgtacaatg gttatgcagg tggttttgaa gtgcaggtaa 5341 ttctcgcggg gaacgcgttc accgccggga aggtcatatt tgcagcagtc ccaccaaatt 5401 ttccaactga aggcttgagc cccagccagg tcactatgtt cccccatata gtagtagatg 5461 ttaggcaact agaacctgtg ttgattccct tacccgatgt taggaataat ttctatcatt 5521 acaatcaatc aaatgacccc accattaagt tgatagcaat gttgtacaca ccacttaggg 5581 ctaataatgc tggggatgat gtcttcacag tttcttgccg agttctcacg agaccatccc 5641 ccgattttga tttcatattt ctagtgccac ccacagttga gtcaagaact aaaccattct 5701 ctgtcccagt tttaactgtt gaggagatga ccaattcaag attccccatt cctttggaaa 5761 agttgttcac gggtcccagc agtgcctttg ttgtccaacc acaaaacggt aggtgcacga 5821 ctgatggcgt gctcctaggc accacccaac tgtctcctgt caacatctgc accttcagag 5881 gagatgtcac ccatatcaca ggtagtcgta actacacaat gaatttggct tctcaaaatt 5941 ggaacaacta tgacccaaca gaagaaatcc cagcccctct aggaactcca gactttgtgg 6001 ggaagattca aggcgtgctc acccaaacca caaggacaga tggctcaaca cgcggccaca 6061 aagccacagt gtacactggg agcgccgact ttgctccaaa actgggtaga gttcaatttg 6121 aaactgacac agaccatgat tttgaagcta accaaaacac aaagttcacc ccagttggtg 6181 tcatccaaga tggtagcacc acccaccgaa atgaacccca acagtgggtg ctcccaagtt 6241 actcaggcag aaatactcct aatgtgcatc tggcccccgc tgtggccccc acttttccgg 6301 gcgagcaact tctcttcttc agatccacca tgcccggatg cagcgggtac cccaacatgg 6361 atttggactg tctgctcccc caggaatggg tgcagtactt ctaccaagag gcagccccag 6421 cacaatctga tgtggctctg ctaagatttg tgaatccaga cacaggtagg gttttgtttg 6481 agtgtaagct tcataaatca ggttatgtta cagtggctca cactggccaa catgatttgg 6541 ttatcccccc caatggttat tttaggtttg attcctgggt caaccagttt tacacgcttg 6601 cccccatggg aaatggaacg gggcgtagac gtgtactata atggctggag ctttctttgc 6661 tggattggca tctgatgtcc ttggctctgg acttggatcc cttatcaatg ctggggctgg 6721 ggccatcaac caaaaagttg agtttgaaaa taacagaaaa ttgcaacaag catccttcca 6781 atttagcagc aatctacaac aggcttcctt tcaacatgac aaagagatgc tccaagcaca 6841 aattgaggcc accaaaaggc tacaacagga aatgatgaaa gttaagcagg caatgctcct 6901 agagggtggg ttctctgaga cagatgcagc ccgcggggca atcaacgccc ccatgacaaa 6961 agctttggac tggagcggga caaggtactg ggctcccgat gctaggacta caacatacaa 7021 tgcaggccgc ttttccaccc ctcaaccatc gggggcactg ccaggaagag ctaatcttag 7081 ggatgctgtc cctactcggg gttcctccag taagtcttct aattcttcta ctgctacttc 7141 tgtgtactca aatcaaacca cttcaacgag acttggttct acagctggtt ctggtaccag 7201 tgtctcgagc ttcccgtcaa ctgcaaggac taggagctgg gttgaggatc aaagtaggaa 7261 tttgtcacct ttcatgaggg gggcccacaa catatcgttt gtcaccccac catctagcag 7321 atcctctagc caaggcacag tctcaaccgt gcctaaagag attttggact cctggactgg 7381 cgctttcaac acgcgcaggc agccactctt cgctcacatt cgtaagcgag gggagtcacg 7441 ggcgtaatga gaaaagacaa aattgattat ctttcttttc tttagtgtct tttaaaa //