Typing tool
|
Complete norovirus genomes
MK907802 | GII.2 | ||
---|---|---|---|
GII.P16 |
ORF1: 1..2210 ORF2: 2191..3819 ORF3: 3819..4598LOCUS MK907802 4598 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_038 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK907802 VERSION MK907802.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 4598) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 4598) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..4598 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_038" /isolation_source="sewage" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="11-Dec-2016" /note="genotype: GII.2-GII.P16" gene <1..2210 /gene="ORF1" CDS <1..2210 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QCO93103.1" /translation="ERATLGLVTGSEIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFE APPSIWSRIVSFGSGWGFWVSPSLFITSTHVIPAGITEAFGVPIKQIQIHKSGEFCRF RFPKPIRPDVTGMILEEGAPEGTVATVLIKRPTGELMPLAARMGTHATMKIQGRMVGG QMGMLLTGSNAKGMDLGTTPGDCGCPYIYKRGNDYIVIGVHTAAARGGNTVICATQGS EGEATLEGGDDKGTYCGAPILGPGGAPKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDP RVKGGPSLQQVMRDQLKPFTEPRGKPPRPSVLEAAKQTIINVLEQTLDPPQKWTYAQA CASLDKTTSSGHPYHVRKNEFWNGETFTGKLADQASKANLMFEEGKHMTPVYTAALKD ELVKTEKIYGKIKKRLLWGSDLSTMIRCARSFGGLMDEMKAHCISLPVRVGMNMNEDG PIIFEKHSRYKYHYDADYSRWDSTQQRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPS VVDVGDFKITINEGLPSGVPCTSQWNSIAHWLLTLCALSEVTKLSPDIIQANSMFSFY GDDEIVSTDIKLDPEQLTAKLKEYGLKPTRPDKTEGPLIISEDLNGLTFLRRTVTRDP AGWFGKLDQSSILRQMYWTRGPNHEDPNETMIPHSQRPIQLIVLLGEASLHGPSFYSK ISKLVITELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..134 /gene="ORF1" /product="VPg" mat_peptide 135..677 /gene="ORF1" /product="Pro" mat_peptide 678..2207 /gene="ORF1" /product="RdRp" gene 2191..3819 /gene="ORF2" CDS 2191..3819 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93104.1" /translation="MKMASNDAAPSTDGAAGLVPESNNEVMALEPVAGAALAAPVTGQ TNIIDPWIRANFVQAPNGEFTVSPRNAPGEVLLNLELGPELNPYLAHLARMYNGYAGG MEVQVMLAGNAFTAGKLVFAAVPPHFPVENLSPQQITMFPHVIIDVRTLEPVLLPLPD VRNNFFHYNQKDDPKMRIVAMLYTPLRSNGSGDDVFTVSCRVLTRPSPDFDFTYLVPP TVESKTKPFTLPILTLGELSNSRFPVSIDQMYTSPNEVISVQCQNGRCTLDGELQGTT QLQVSGICAFKGEVTAHLHDNDHLYNVTITNLNGSPFDPSEDIPAPLGVPDFQGRVFG IISQRDKHNSPGHNEPANRGHDAVVPTYTAQYTPKLGQIQIGTWQTDDLTVNQPVKFT PVGLNDTEHFNQWVVPRYAGALNLNTNLAPSVAPVFPGERLLFFRSYIPLKGGYGNPA IDCLLPQEWVQHFYQEAAPSMSEVALVRYINPDTGRALFEAKLHRAGFMTVSSNTSAP VVVPANGYFRFDSWVNQFYSLAPMGTGNGRRRVQ" gene 3819..4598 /gene="ORF3" CDS 3819..4598 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93105.1" /translation="MAGAFVAGLAGDVLSNGLSSLINAGANAINQRAEFDFNQKLQQN SFNHDKEMLQAQIQATKQLQADMMAIKQGVLTAGGFSPADAARGAVNAPMTQALDWNG TRYWAPNSMRTTSYSGKFTSTAPVRQADFQYTQSRPSSGSSVSSFATQSSRPTLTTTT GSSHGTVSSNSTRSTSLSQSTVSRATSRTGEWVRDQNRNLEPYMHGALQTAFVTPPSS RASDGTVSTVPKGVLDSWTPAFNTRRQPLFAHLRKRGESQA" ORIGIN 1 aagagagggc cacattaggg ctagtaacag gttcagaaat cagaaaaaga aaccctgatg 61 acttcaaacc caaagggaag ctgtgggccg atgacaacag gagtgttgac tacaatgaga 121 aactggactt tgaggccccc ccaagcatat ggtctaggat tgtgagcttt ggttctggct 181 ggggcttctg ggtgtcacca agccttttca taacgtcaac tcatgtaatc cccgcaggca 241 taacagaagc atttggagtc cccatcaaac aaattcagat ccacaagtca ggtgaatttt 301 gccgattcag attcccaaaa ccaattagac cagatgtgac agggatgatc ttggaagaag 361 gtgcgcctga gggcaccgtg gcaactgtgc tcatcaaacg ccccaccgga gagctcatgc 421 ctcttgcagc cagaatggga acacacgcaa ccatgaaaat tcaaggccgc atggttggcg 481 gacagatggg tatgttgctc actggatcaa atgctaaagg aatggatttg ggaacaactc 541 ctggtgactg tggctgtcct tacatctaca aaaggggcaa tgactatata gtcattgggg 601 tgcacactgc agcagcccgt ggtggaaaca ccgtcatctg cgccacacag ggaagtgagg 661 gtgaggcgac tcttgagggt ggagatgaca aaggaacata ctgtggggca cccattctag 721 gccctggggg tgcaccaaaa ttgagcacca aaaccaaatt ttggaggtca tcgaacacgc 781 cccttccacc agggacatat gaacctgcct acctcggtgg ccgtgatccg cgtgttaagg 841 gtgggccctc cctgcagcag gtaatgagag accagttgaa gccattcact gaacccaggg 901 gcaaacctcc aagaccaagt gtattggaag cagccaaaca aaccatcatc aatgtcctcg 961 aacaaaccct ggaccctcca caaaaatgga catacgcaca ggcgtgtgcc tcacttgaca 1021 aaaccacctc cagcgggcat ccctatcacg tccgaaagaa tgaattctgg aatggtgaga 1081 ccttcactgg taaattggca gaccaagcat caaaagcaaa cctaatgttt gaggaaggga 1141 aacacatgac accagtgtac acagcagcac tcaaggacga gctagtcaag actgagaaaa 1201 tctatggaaa gatcaagaag agactgctct ggggctctga cttgtccacc atgatccggt 1261 gcgctaggtc atttggtggg ctcatggacg agatgaaggc acactgcata tcactcccag 1321 tccgagttgg catgaatatg aatgaagatg gcccaataat atttgagaaa cattccagat 1381 acaaatatca ctatgatgca gactactctc gttgggattc aacacaacag agggcagtac 1441 tagcagcagc cttggaaatt atggtcagat tctctgcaga accacaattg gcacaaatag 1501 tcgctgagga tctgctggcc cctagtgtag tagatgtagg agactttaaa attacaataa 1561 atgaagggct cccctctggt gtgccatgca cttctcaatg gaactccatc gcacactggc 1621 tgctaactct ctgtgccttg tctgaagtca ccaaactgtc ccctgacatt atacaggcaa 1681 attccatgtt ctcattttac ggtgatgacg agattgtcag caccgacata aaattggacc 1741 ctgaacagtt aaccgccaaa ttgaaggagt acggcctgaa accaacccgc ccagacaaaa 1801 ccgagggacc cctgattatc agtgaagatt tgaacggact cactttcctc cgaaggacag 1861 tgactcgtga cccagctggc tggtttggaa aactggacca aagctcaatt ttgaggcaga 1921 tgtactggac tagaggacca aatcatgaag accccaatga gacaatgata ccccattctc 1981 aaagacccat acagctcatt gtactgcttg gtgaagcctc tcttcacgga ccctctttct 2041 acagtaaaat cagtaaattg gtcataactg aactcaaaga aggtgggatg gacttttacg 2101 tgccaaggca ggaacccatg ttcaggtgga tgaggttttc tgacttgagc acgtgggagg 2161 gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgaatga cgccgctcca 2221 tctactgatg gtgcagccgg cctcgtgcca gaaagtaaca atgaggtcat ggctcttgaa 2281 cccgtggctg gtgccgcctt ggcagccccg gtcactggtc aaacaaatat tatagaccct 2341 tggattagag caaattttgt ccaggccccc aatggtgaat ttacagtctc tccccgtaat 2401 gcccctggtg aagtgctact gaatctagag ttgggtccag aattaaatcc ttatctggca 2461 catttagcaa gaatgtacaa tgggtatgcc ggtgggatgg aggtgcaggt catgttggct 2521 gggaacgcgt tcacagccgg caagttggtc ttcgccgccg tgccacctca cttcccagtt 2581 gaaaacctta gcccacagca aatcaccatg ttccctcatg tgatcataga tgtgagaacc 2641 ttggaacctg tcctgttacc actccctgat gttaggaata atttctttca ttataatcag 2701 aaagatgatc ccaagatgag aattgtggct atgctttata cccccctcag gtctaatggt 2761 tcaggtgatg atgtgtttac agtctcctgc agagtgttga ctagaccttc ccctgacttt 2821 gacttcacat acctggtgcc accaacagtg gagtctaaaa caaaaccatt caccctccca 2881 atcctcacac ttggggaact ttccaattcc aggttcccag tgtccataga ccagatgtat 2941 accagcccta atgaagttat atcagtgcag tgtcaaaatg gtaggtgcac actggacggg 3001 gagctccagg ggacaacaca actccaagtc agtggcattt gtgctttcaa aggtgaagtg 3061 accgcccact tgcatgacaa tgatcaccta tataatgtca ccatcacaaa cttgaatggg 3121 tccccttttg atccctccga ggacatccct gcccctctgg gtgtgcctga cttccaggga 3181 agggtttttg gtatcatctc ccaaagggac aaacacaata gtcctgggca taatgaacca 3241 gcaaacaggg gacacgacgc tgtggtcccc acttacacag cacagtacac tccaaaactg 3301 ggacaaattc aaattggcac atggcagact gacgacctta cagtcaacca accagtcaaa 3361 ttcaccccag ttggactcaa tgacactgaa cattttaacc aatgggtggt ccccaggtat 3421 gctggtgccc taaacctcaa tacaaacctt gccccttctg ttgctccagt gtttccagga 3481 gagcgcctgc tcttcttcag atcatacatt cccctcaagg gcggttatgg aaacccagcc 3541 attgattgcc tactaccaca agagtgggtg caacacttct atcaggaagc agccccttca 3601 atgagtgagg tggccctcgt cagatacatc aacccggaca ctggtcgggc actgtttgag 3661 gccaagctcc acagagctgg tttcatgaca gtctcgagca acaccagtgc cccggtggtt 3721 gtgcctgcca acggatactt cagatttgat tcttgggtga accaattcta ttctctcgcc 3781 cccatgggaa ctgggaatgg gcgtagaagg gttcaataat ggctggagct tttgtagctg 3841 gtcttgcagg ggacgtgctc agcaatgggc ttagttcatt gatcaatgca ggtgctaatg 3901 caataaatca gagggcagaa tttgatttta atcagaaatt gcagcaaaat tcttttaatc 3961 atgataagga gatgttgcag gctcagattc aggcaactaa gcagctgcag gcagacatga 4021 tggctataaa acaaggggtc ttgaccgctg gcggcttttc ccctgctgat gcagccagag 4081 gtgctgtgaa cgcgcccatg acacaagcgc tggattggaa tggtacaagg tattgggcac 4141 caaactccat gaggaccaca tcttattctg gaaaattcac atcaaccgcc ccagtgaggc 4201 aggccgactt ccagtacacc caaagccggc cttcgagtgg ctcctctgtg tcttcctttg 4261 ccactcagtc ttcaaggcca actctgacca caaccactgg ttcctcacat ggcacagtct 4321 catcaaattc aactcgcagc acaagcctct cccaatcaac ggtctccaga gctacatcta 4381 ggactggtga gtgggttaga gatcaaaata gaaatttgga accctacatg catggtgcct 4441 tacagacagc ctttgtcacc ccaccttcca gcagggcatc tgacgggaca gtctcaaccg 4501 tcccgaaagg tgttttggac tcctggacac ctgcgtttaa cacccgcagg cagccgcttt 4561 ttgcacacct ccgtaagagg ggggagtcac aagcttag //