Typing tool

Complete norovirus genomes

MK907802  GII.2
 GII.P16

Length: 4,598 | 3 CDS

ORF1: 1..2210
ORF2: 2191..3819
ORF3: 3819..4598
LOCUS       MK907802                4598 bp    RNA     linear   VRL 02-NOV-2019
DEFINITION  Norovirus GII isolate G19_038 nonstructural polyprotein (ORF1)
            gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
            cds.
ACCESSION   MK907802
VERSION     MK907802.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 4598)
  AUTHORS   Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
            Guyader,S.
  TITLE     Optimisation of agnostic metagenomic approaches to characterise
            human enteric viruses in sewage
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 4598)
  AUTHORS   Le Guyader,S. and Strubbia,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
            44311, France
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. 3.12.0
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..4598
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="G19_038"
                     /isolation_source="sewage"
                     /db_xref="taxon:122929"
                     /country="France: Nantes"
                     /collection_date="11-Dec-2016"
                     /note="genotype: GII.2-GII.P16"
     gene            <1..2210
                     /gene="ORF1"
     CDS             <1..2210
                     /gene="ORF1"
                     /codon_start=3
                     /product="nonstructural polyprotein"
                     /protein_id="QCO93103.1"
                     /translation="ERATLGLVTGSEIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFE
                     APPSIWSRIVSFGSGWGFWVSPSLFITSTHVIPAGITEAFGVPIKQIQIHKSGEFCRF
                     RFPKPIRPDVTGMILEEGAPEGTVATVLIKRPTGELMPLAARMGTHATMKIQGRMVGG
                     QMGMLLTGSNAKGMDLGTTPGDCGCPYIYKRGNDYIVIGVHTAAARGGNTVICATQGS
                     EGEATLEGGDDKGTYCGAPILGPGGAPKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDP
                     RVKGGPSLQQVMRDQLKPFTEPRGKPPRPSVLEAAKQTIINVLEQTLDPPQKWTYAQA
                     CASLDKTTSSGHPYHVRKNEFWNGETFTGKLADQASKANLMFEEGKHMTPVYTAALKD
                     ELVKTEKIYGKIKKRLLWGSDLSTMIRCARSFGGLMDEMKAHCISLPVRVGMNMNEDG
                     PIIFEKHSRYKYHYDADYSRWDSTQQRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPS
                     VVDVGDFKITINEGLPSGVPCTSQWNSIAHWLLTLCALSEVTKLSPDIIQANSMFSFY
                     GDDEIVSTDIKLDPEQLTAKLKEYGLKPTRPDKTEGPLIISEDLNGLTFLRRTVTRDP
                     AGWFGKLDQSSILRQMYWTRGPNHEDPNETMIPHSQRPIQLIVLLGEASLHGPSFYSK
                     ISKLVITELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..134
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     135..677
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     678..2207
                     /gene="ORF1"
                     /product="RdRp"
     gene            2191..3819
                     /gene="ORF2"
     CDS             2191..3819
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QCO93104.1"
                     /translation="MKMASNDAAPSTDGAAGLVPESNNEVMALEPVAGAALAAPVTGQ
                     TNIIDPWIRANFVQAPNGEFTVSPRNAPGEVLLNLELGPELNPYLAHLARMYNGYAGG
                     MEVQVMLAGNAFTAGKLVFAAVPPHFPVENLSPQQITMFPHVIIDVRTLEPVLLPLPD
                     VRNNFFHYNQKDDPKMRIVAMLYTPLRSNGSGDDVFTVSCRVLTRPSPDFDFTYLVPP
                     TVESKTKPFTLPILTLGELSNSRFPVSIDQMYTSPNEVISVQCQNGRCTLDGELQGTT
                     QLQVSGICAFKGEVTAHLHDNDHLYNVTITNLNGSPFDPSEDIPAPLGVPDFQGRVFG
                     IISQRDKHNSPGHNEPANRGHDAVVPTYTAQYTPKLGQIQIGTWQTDDLTVNQPVKFT
                     PVGLNDTEHFNQWVVPRYAGALNLNTNLAPSVAPVFPGERLLFFRSYIPLKGGYGNPA
                     IDCLLPQEWVQHFYQEAAPSMSEVALVRYINPDTGRALFEAKLHRAGFMTVSSNTSAP
                     VVVPANGYFRFDSWVNQFYSLAPMGTGNGRRRVQ"
     gene            3819..4598
                     /gene="ORF3"
     CDS             3819..4598
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QCO93105.1"
                     /translation="MAGAFVAGLAGDVLSNGLSSLINAGANAINQRAEFDFNQKLQQN
                     SFNHDKEMLQAQIQATKQLQADMMAIKQGVLTAGGFSPADAARGAVNAPMTQALDWNG
                     TRYWAPNSMRTTSYSGKFTSTAPVRQADFQYTQSRPSSGSSVSSFATQSSRPTLTTTT
                     GSSHGTVSSNSTRSTSLSQSTVSRATSRTGEWVRDQNRNLEPYMHGALQTAFVTPPSS
                     RASDGTVSTVPKGVLDSWTPAFNTRRQPLFAHLRKRGESQA"
ORIGIN      
        1 aagagagggc cacattaggg ctagtaacag gttcagaaat cagaaaaaga aaccctgatg
       61 acttcaaacc caaagggaag ctgtgggccg atgacaacag gagtgttgac tacaatgaga
      121 aactggactt tgaggccccc ccaagcatat ggtctaggat tgtgagcttt ggttctggct
      181 ggggcttctg ggtgtcacca agccttttca taacgtcaac tcatgtaatc cccgcaggca
      241 taacagaagc atttggagtc cccatcaaac aaattcagat ccacaagtca ggtgaatttt
      301 gccgattcag attcccaaaa ccaattagac cagatgtgac agggatgatc ttggaagaag
      361 gtgcgcctga gggcaccgtg gcaactgtgc tcatcaaacg ccccaccgga gagctcatgc
      421 ctcttgcagc cagaatggga acacacgcaa ccatgaaaat tcaaggccgc atggttggcg
      481 gacagatggg tatgttgctc actggatcaa atgctaaagg aatggatttg ggaacaactc
      541 ctggtgactg tggctgtcct tacatctaca aaaggggcaa tgactatata gtcattgggg
      601 tgcacactgc agcagcccgt ggtggaaaca ccgtcatctg cgccacacag ggaagtgagg
      661 gtgaggcgac tcttgagggt ggagatgaca aaggaacata ctgtggggca cccattctag
      721 gccctggggg tgcaccaaaa ttgagcacca aaaccaaatt ttggaggtca tcgaacacgc
      781 cccttccacc agggacatat gaacctgcct acctcggtgg ccgtgatccg cgtgttaagg
      841 gtgggccctc cctgcagcag gtaatgagag accagttgaa gccattcact gaacccaggg
      901 gcaaacctcc aagaccaagt gtattggaag cagccaaaca aaccatcatc aatgtcctcg
      961 aacaaaccct ggaccctcca caaaaatgga catacgcaca ggcgtgtgcc tcacttgaca
     1021 aaaccacctc cagcgggcat ccctatcacg tccgaaagaa tgaattctgg aatggtgaga
     1081 ccttcactgg taaattggca gaccaagcat caaaagcaaa cctaatgttt gaggaaggga
     1141 aacacatgac accagtgtac acagcagcac tcaaggacga gctagtcaag actgagaaaa
     1201 tctatggaaa gatcaagaag agactgctct ggggctctga cttgtccacc atgatccggt
     1261 gcgctaggtc atttggtggg ctcatggacg agatgaaggc acactgcata tcactcccag
     1321 tccgagttgg catgaatatg aatgaagatg gcccaataat atttgagaaa cattccagat
     1381 acaaatatca ctatgatgca gactactctc gttgggattc aacacaacag agggcagtac
     1441 tagcagcagc cttggaaatt atggtcagat tctctgcaga accacaattg gcacaaatag
     1501 tcgctgagga tctgctggcc cctagtgtag tagatgtagg agactttaaa attacaataa
     1561 atgaagggct cccctctggt gtgccatgca cttctcaatg gaactccatc gcacactggc
     1621 tgctaactct ctgtgccttg tctgaagtca ccaaactgtc ccctgacatt atacaggcaa
     1681 attccatgtt ctcattttac ggtgatgacg agattgtcag caccgacata aaattggacc
     1741 ctgaacagtt aaccgccaaa ttgaaggagt acggcctgaa accaacccgc ccagacaaaa
     1801 ccgagggacc cctgattatc agtgaagatt tgaacggact cactttcctc cgaaggacag
     1861 tgactcgtga cccagctggc tggtttggaa aactggacca aagctcaatt ttgaggcaga
     1921 tgtactggac tagaggacca aatcatgaag accccaatga gacaatgata ccccattctc
     1981 aaagacccat acagctcatt gtactgcttg gtgaagcctc tcttcacgga ccctctttct
     2041 acagtaaaat cagtaaattg gtcataactg aactcaaaga aggtgggatg gacttttacg
     2101 tgccaaggca ggaacccatg ttcaggtgga tgaggttttc tgacttgagc acgtgggagg
     2161 gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgaatga cgccgctcca
     2221 tctactgatg gtgcagccgg cctcgtgcca gaaagtaaca atgaggtcat ggctcttgaa
     2281 cccgtggctg gtgccgcctt ggcagccccg gtcactggtc aaacaaatat tatagaccct
     2341 tggattagag caaattttgt ccaggccccc aatggtgaat ttacagtctc tccccgtaat
     2401 gcccctggtg aagtgctact gaatctagag ttgggtccag aattaaatcc ttatctggca
     2461 catttagcaa gaatgtacaa tgggtatgcc ggtgggatgg aggtgcaggt catgttggct
     2521 gggaacgcgt tcacagccgg caagttggtc ttcgccgccg tgccacctca cttcccagtt
     2581 gaaaacctta gcccacagca aatcaccatg ttccctcatg tgatcataga tgtgagaacc
     2641 ttggaacctg tcctgttacc actccctgat gttaggaata atttctttca ttataatcag
     2701 aaagatgatc ccaagatgag aattgtggct atgctttata cccccctcag gtctaatggt
     2761 tcaggtgatg atgtgtttac agtctcctgc agagtgttga ctagaccttc ccctgacttt
     2821 gacttcacat acctggtgcc accaacagtg gagtctaaaa caaaaccatt caccctccca
     2881 atcctcacac ttggggaact ttccaattcc aggttcccag tgtccataga ccagatgtat
     2941 accagcccta atgaagttat atcagtgcag tgtcaaaatg gtaggtgcac actggacggg
     3001 gagctccagg ggacaacaca actccaagtc agtggcattt gtgctttcaa aggtgaagtg
     3061 accgcccact tgcatgacaa tgatcaccta tataatgtca ccatcacaaa cttgaatggg
     3121 tccccttttg atccctccga ggacatccct gcccctctgg gtgtgcctga cttccaggga
     3181 agggtttttg gtatcatctc ccaaagggac aaacacaata gtcctgggca taatgaacca
     3241 gcaaacaggg gacacgacgc tgtggtcccc acttacacag cacagtacac tccaaaactg
     3301 ggacaaattc aaattggcac atggcagact gacgacctta cagtcaacca accagtcaaa
     3361 ttcaccccag ttggactcaa tgacactgaa cattttaacc aatgggtggt ccccaggtat
     3421 gctggtgccc taaacctcaa tacaaacctt gccccttctg ttgctccagt gtttccagga
     3481 gagcgcctgc tcttcttcag atcatacatt cccctcaagg gcggttatgg aaacccagcc
     3541 attgattgcc tactaccaca agagtgggtg caacacttct atcaggaagc agccccttca
     3601 atgagtgagg tggccctcgt cagatacatc aacccggaca ctggtcgggc actgtttgag
     3661 gccaagctcc acagagctgg tttcatgaca gtctcgagca acaccagtgc cccggtggtt
     3721 gtgcctgcca acggatactt cagatttgat tcttgggtga accaattcta ttctctcgcc
     3781 cccatgggaa ctgggaatgg gcgtagaagg gttcaataat ggctggagct tttgtagctg
     3841 gtcttgcagg ggacgtgctc agcaatgggc ttagttcatt gatcaatgca ggtgctaatg
     3901 caataaatca gagggcagaa tttgatttta atcagaaatt gcagcaaaat tcttttaatc
     3961 atgataagga gatgttgcag gctcagattc aggcaactaa gcagctgcag gcagacatga
     4021 tggctataaa acaaggggtc ttgaccgctg gcggcttttc ccctgctgat gcagccagag
     4081 gtgctgtgaa cgcgcccatg acacaagcgc tggattggaa tggtacaagg tattgggcac
     4141 caaactccat gaggaccaca tcttattctg gaaaattcac atcaaccgcc ccagtgaggc
     4201 aggccgactt ccagtacacc caaagccggc cttcgagtgg ctcctctgtg tcttcctttg
     4261 ccactcagtc ttcaaggcca actctgacca caaccactgg ttcctcacat ggcacagtct
     4321 catcaaattc aactcgcagc acaagcctct cccaatcaac ggtctccaga gctacatcta
     4381 ggactggtga gtgggttaga gatcaaaata gaaatttgga accctacatg catggtgcct
     4441 tacagacagc ctttgtcacc ccaccttcca gcagggcatc tgacgggaca gtctcaaccg
     4501 tcccgaaagg tgttttggac tcctggacac ctgcgtttaa cacccgcagg cagccgcttt
     4561 ttgcacacct ccgtaagagg ggggagtcac aagcttag
//