Typing tool

Complete norovirus genomes

MK907785  GII.4 Sydney
 GII.P31

Length: 7,497 | 3 CDS

ORF1: 1..5038
ORF2: 5019..6641
ORF3: 6641..7447
LOCUS       MK907785                7497 bp    RNA     linear   VRL 02-NOV-2019
DEFINITION  Norovirus GII isolate G19_014 nonstructural polyprotein (ORF1)
            gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
            cds.
ACCESSION   MK907785
VERSION     MK907785.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7497)
  AUTHORS   Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
            Guyader,S.
  TITLE     Optimisation of agnostic metagenomic approaches to characterise
            human enteric viruses in sewage
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7497)
  AUTHORS   Le Guyader,S. and Strubbia,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
            44311, France
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. 3.12.0
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7497
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="G19_014"
                     /isolation_source="sewage"
                     /db_xref="taxon:122929"
                     /country="France: Nantes"
                     /collection_date="08-Jan-2014"
                     /note="genotype: GII.4-GII.Pe"
     gene            <1..5038
                     /gene="ORF1"
     CDS             <1..5038
                     /gene="ORF1"
                     /codon_start=2
                     /product="nonstructural polyprotein"
                     /protein_id="QCO93054.1"
                     /translation="KSSSDGVLSSMAVTFKRALGARPKQPPPKEIPPRPPRPPTPDLV
                     KKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPEETNTAFSVPPLNQRESRDAKE
                     PLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKVELTPLSLFWRPVYTPQY
                     LISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFRPYQDWN
                     RKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDWTFAGIV
                     ESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGLVLGFTK
                     EKIGKMLSSAASTLRACKDLGAYGLEILKLIMKWFFPKKEEANELAMVRSIEDAVLDL
                     EAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINSLLARIAAA
                     RSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVGLIPRNG
                     VDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKVFDSDAI
                     IITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNAFSPDFS
                     HIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQGPALTTF
                     NFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKCQIVYNG
                     GSYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARIRYYVKCVQEALYSIIQI
                     AGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGCLKPKDDEEFVVSSDDIKTEGK
                     KGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYYEEVAIA
                     RATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKPKGKLWA
                     DDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAKEFFGVP
                     IKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRPTGELMPLAARM
                     GTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVVIGVHTA
                     AARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSTKTKFWRSSTTPL
                     PPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKKTIINVL
                     EQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASKANLMFE
                     EGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDLATMIRCARAFGGLMDELKAHC
                     VTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIMVKFSPE
                     PHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCALSEVTD
                     LSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGPLVISED
                     LDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQRPIQLMS
                     LLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLA
                     PSFVNEDGVE"
     mat_peptide     <1..928
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     929..2026
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2027..2563
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2564..2962
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     2963..3505
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3506..5035
                     /gene="ORF1"
                     /product="RdRp"
     gene            5019..6641
                     /gene="ORF2"
     CDS             5019..6641
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QCO93055.1"
                     /translation="MKMASSDVNPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVL"
     gene            6641..7447
                     /gene="ORF3"
     CDS             6641..7447
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QCO93056.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPTRGSSSKSS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN      
        1 aaaatcttca agtgacggtg tgctttctag catggctgtc acttttaagc gggccctcgg
       61 ggcgcggcct aaacagccgc ccccgaagga gataccaccc agacccccgc gaccacccac
      121 gccagacttg gttaaaaaga tccctcctcc cccacccaac ggggaggatg aactagtggt
      181 ctcttacagc gccaaagatg gcgtttccgg actgcctgag ctcaccactg tcagacaacc
      241 ggaagaaacc aacacggcgt tcagtgtccc cccactcaac caaagggaga gcagggacgc
      301 caaggagcca ctaactggaa caattattga aatgtgggat ggagaaatct accattacgg
      361 cctgtacgtg gaacgaggtc ttatacttgg tgtgcacaag ccaccggcag ccatcagcct
      421 tgccaaggtc gagctaacac cgctctcttt gttctggaga cctgtataca ccccccagta
      481 tctcatctct ccagacactc ttaggagatt acatggagag tcattcccct acactgcatt
      541 tgacaacaat tgctacgcct tttgttgttg ggtattagac ctaaacgact catggctaag
      601 caggagaatg attcagagaa caacaggctt cttcaggccg taccaggatt ggaacaggaa
      661 acccctcccc actatggatg attccaaatt aaagaaggta gccaacatat tcttgtgcac
      721 tttgtcttca ctattcacca gacccattaa ggacataata gggaagttga aacctcttaa
      781 catactcaac attctggcca catgtgattg gaccttcgca ggcatagtgg aatccttaat
      841 actcttggca gaactctttg gagttttctg gacaccccca gatgtgtctg cgatgatcgc
      901 ccccttgcta ggtgattatg aactgcaagg acctgaggac cttgcagtgg aactggtccc
      961 aatagtgatg ggggggatag gtttggtgct aggatttacc aaagagaaaa tcggaaagat
     1021 gctatcatcc gctgcatcca ctttaagagc ttgtaaagac cttggtgcat acggactgga
     1081 aatcttaaaa ttgatcatga agtggttctt cccaaagaag gaggaagcaa atgaactggc
     1141 tatggtgaga tccatcgagg atgcagtact agacctcgag gcaattgaaa acaaccacat
     1201 gaccaccctg ctcaaagaca aagacagctt ggcaacctac atgagaaccc ttgaccttga
     1261 ggaggagaaa gccagaaaac tctcaaccaa atctgcttca cccgatattg tgggcacaat
     1321 caactctctt ctggcaagaa tcgctgctgc acgctcccta gtgcatcggg cgaaagaaga
     1381 gctctccagc aggccgagac ctgtcgttgt gatgatatcg ggaagaccag ggatagggaa
     1441 aactcacctt gccagggagc tggccaagaa gatcgcggcc tccctcacag gggaccagcg
     1501 tgtgggtctt atcccacgca atggtgtcga ccactgggac gcatacaagg gcgaaagagt
     1561 tgtcctatgg gacgactatg gaatgagcaa ccccatccat gatgccctca ggctgcagga
     1621 gcttgctgac acttgccccc tcacgctaaa ttgtgacaga attgagaaca aagggaaagt
     1681 ctttgacagt gatgccataa ttatcaccac caacctggcc aacccagcac cactggatta
     1741 tgtcaacttt gaagcgtgct cgagacgcat tgatttcctc gtgtacgcag aagcccctga
     1801 ggtggagaag gcaaagcgcg acttcccagg tcaacctgac atgtggaaga acgctttcag
     1861 tcctgacttc tcacacataa aactgtcatt ggctccacag ggtggttttg acaagaacgg
     1921 caacaccccg catggaaaag gggtcatgaa gaccctcacc actggctccc tcatcgcccg
     1981 agcatcaggg ttactccatg agaggctaga tgaatatgaa ctgcaaggcc cagccctcac
     2041 cactttcaac tttgaccgca acaagatact tgcttttaga cagcttgctg ctgaaaacaa
     2101 gtatgggctg atggacacaa tgagagttgg aaaacagctc aaggatgtca agaccatgtc
     2161 agacctcaaa caagcactca agaacatcgc gatcaagaag tgccagatag tgtacaatgg
     2221 tggctcctac acacttgagg ctgatggcaa gggtagtgtg aaagttgaca aagtgcaaag
     2281 tgccactgtg cagaccaaca atgaactagc cggtgcccta caccacctaa ggtgcgctag
     2341 aatcagatac tatgttaagt gcgtccagga ggcgctgtat tccatcatcc aaatcgctgg
     2401 ggctgcgttc gtcaccacgc gcatcgctaa gcgcatgaat atacagaatc tctggtccaa
     2461 gccacaggtg gaagacacag aagagatggc caacaaagat ggttgcctaa aacccaaaga
     2521 tgatgaagag tttgtcgtct catccgacga catcaaaact gagggcaaga aagggaaaaa
     2581 caagtccggc cgtggcaaga agcacacagc cttttcaagt aaagggctca gtgatgagga
     2641 gtacgatgag tacaagagaa tcagagaaga aaggaatggt aagtactcca tagaagagta
     2701 ccttcaggac agagacaggt actacgagga ggtggccatt gccagggcaa ccgaagagga
     2761 cttctgtgaa gaagaagagg ccaaaatccg gcagagaatt ttcagaccaa caagaaaaca
     2821 acgcaaagaa gagagggcct ctctcggctt ggtcacaggc tctgaaatca ggaagagaaa
     2881 cccagaagac ttcaaaccca agggaaagct gtgggctgat gacgacagaa gtgttgacta
     2941 taatgagaaa ctcaactttg aggccccacc aagcatctgg tcgcgaatag tcaactttgg
     3001 ttcaggctgg ggcttctggg tctcccccag tctgtttata acatcaaccc atgtcatacc
     3061 ccaaggtgca aaagagttct tcggagtccc tatcaagcaa atccagatac acaagtcagg
     3121 tgaattctgc cggttgagat tcccaaagcc aatcagaact gatgtgacgg gcatgattct
     3181 agaagaaggt gcgcccgagg ggaccgtggc cacactgctc atcaagagac caactggaga
     3241 gctcatgcct ctggcagcca gaatggggac ccatgcaacc atgaaaattc aggggcgcac
     3301 agttggaggg caaatgggta tgctcctgac aggatccaac gccaagagta tggacctagg
     3361 cacaacgcca ggcgactgcg gctgccccta catctacaag agggggaatg actacgtggt
     3421 cataggagtc catacggccg ctgcccgtgg aggaaacact gtcatatgtg ccacccaggg
     3481 gagtgaggga gaagccacac ttgaaggagg tgacagtaaa gggacatact gtggcgcacc
     3541 aatcttgggc ccagggagcg ctccgaagct cagtaccaaa actaagtttt ggagatcatc
     3601 cacaacacca ctcccacctg gcacctacga accagcctac ctcggtggca aagaccctag
     3661 agtcaaaggt ggcccttcat tgcaacaagt tatgagggac cagctgaagc cattcacaga
     3721 acccagaggc aaaccaccaa gaccaaatgt gttggaagct gccaagaaaa ccatcatcaa
     3781 tgtccttgag caaacaattg atccacccca aaaatggtca tttgcgcaag cttgcgcatc
     3841 ccttgacaaa accacctcca gcggccaccc gcaccacatg cggaaaaacg actgttggaa
     3901 tggggagtcc ttcacaggaa aattggctga tcaagcctcc aaggccaacc taatgtttga
     3961 agagggaaag aacatgactc cagtctacac aggtgcactt aaagatgagt tggtgaagac
     4021 cgataaagtt tatggtaagg tcaagaagag gcttctgtgg ggttcagatc tggcgaccat
     4081 gatacggtgc gcccgagctt ttggaggcct tatggatgaa ctcaaggcgc actgtgtcac
     4141 acttcctgtc agagttggta tgaacatgaa tgaggatggc cccatcatct ttgagaagca
     4201 ctccagatat agatatcact atgatgctga ttattcccgg tgggactcaa cacaacaaag
     4261 ggatgtgcta gcagcagcac tagaaatcat ggttaagttc tctccagaac cacacctggc
     4321 ccagatagtt gcagaagacc tcctttcccc tagcgtgatg gatgtaggtg actttcaaat
     4381 atcaataagt gagggtctcc cctctggggt accttgtacc tcccagtgga attccatcgc
     4441 ccactggctc ctcaccctgt gtgcactctc tgaagtcacg gacctgtccc ccgatatcat
     4501 tcaggccaac tcccttttct ccttctatgg tgatgatgag attgtaagca cagacataaa
     4561 attggaccca gagaagctga cagcaaagct caaggagtac gggctgaaac caacccgccc
     4621 cgacaaaact gaaggacccc ttgttatctc tgaagacctg gatggcctga cattcctccg
     4681 gagaactgtg acccgtgatc cagctggctg gtttggaaaa ttggaacaaa gttcaattct
     4741 caggcaaatg tactggacca ggggtcccaa ccatgaagac ccatttgaaa caatgatacc
     4801 acactcccaa agacccatac aattgatgtc cttgctgggc gaggctgcgc tccacggccc
     4861 ggcattctat agcaaaatta gcaaattagt cattgcagag ttgaaggaag gtggcatgga
     4921 tttttacgta cccagacaag agccaatgtt cagatggatg aggttctcag atctgagcac
     4981 gtgggagggc gatcgcaatc tggctcccag ttttgtgaat gaagatggcg tcgagtgacg
     5041 tcaacccatc tgatgggtcc gcagccaacc tcgtcccaga ggtcaacaat gaggttatgg
     5101 ctctggagcc cgttgttggt gccgccattg cggcacctgt agcgggccaa caaaatgtaa
     5161 ttgacccctg gattagaaat aattttgtac aagcccctgg tggagagttt acagtatccc
     5221 ctagaaacgc tccaggtgaa atactatgga gcgcgccctt gggccctgat ctaaacccct
     5281 acctatccca tttggccaga atgtacaatg gttatgcagg tggttttgaa gtgcaggtaa
     5341 ttctcgcggg gaacgcgttc accgccggga aggtcatatt tgcagcagtc ccaccaaatt
     5401 ttccaactga aggcttgagc cccagccagg tcactatgtt cccccatata gtagtagatg
     5461 ttaggcaact agaacctgtg ttgattccct tacccgatgt taggaataat ttctatcatt
     5521 acaatcaatc aaatgacccc accattaagt tgatagcaat gttgtacaca ccacttaggg
     5581 ctaataatgc tggggatgat gtcttcacag tttcttgccg agttctcacg agaccatccc
     5641 ccgattttga tttcatattt ctagtgccac ccacagttga gtcaagaact aaaccattct
     5701 ctgtcccagt tttaactgtt gaggagatga ccaattcaag attccccatt cctttggaaa
     5761 agttgttcac gggtcccagc agtgcctttg ttgtccaacc acaaaacggt aggtgcacga
     5821 ctgatggcgt gctcctaggc accacccaac tgtctcctgt caacatctgc accttcagag
     5881 gagatgtcac ccatatcaca ggtagtcgta actacacaat gaatttggct tctcaaaatt
     5941 ggaacaacta tgacccaaca gaagaaatcc cagcccctct aggaactcca gactttgtgg
     6001 ggaagattca aggcgtgctc acccaaacca caaggacaga tggctcaaca cgcggccaca
     6061 aagccacagt gtacactggg agcgccgact ttgctccaaa actgggtaga gttcaatttg
     6121 aaactgacac agaccatgat tttgaagcta accaaaacac aaagttcacc ccagttggtg
     6181 tcatccaaga tggtagcacc acccaccgaa atgaacccca acagtgggtg ctcccaagtt
     6241 actcaggcag aaatactcct aatgtgcatc tggcccccgc tgtggccccc acttttccgg
     6301 gcgagcaact tctcttcttc agatccacca tgcccggatg cagcgggtac cccaacatgg
     6361 atttggactg tctgctcccc caggaatggg tgcagtactt ctaccaagag gcagccccag
     6421 cacaatctga tgtggctctg ctaagatttg tgaatccaga cacaggtagg gttttgtttg
     6481 agtgtaagct tcataaatca ggttatgtta cagtggctca cactggccaa catgatttgg
     6541 ttatcccccc caatggttat tttaggtttg attcctgggt caaccagttt tacacgcttg
     6601 cccccatggg aaatggaacg gggcgtagac gtgtactata atggctggag ctttctttgc
     6661 tggattggca tctgatgtcc ttggctctgg acttggatcc cttatcaatg ctggggctgg
     6721 ggccatcaac caaaaagttg agtttgaaaa taacagaaaa ttgcaacaag catccttcca
     6781 atttagcagc aatctacaac aggcttcctt tcaacatgac aaagagatgc tccaagcaca
     6841 aattgaggcc accaaaaggc tacaacagga aatgatgaaa gttaagcagg caatgctcct
     6901 agagggtggg ttctctgaga cagatgcagc ccgcggggca atcaacgccc ccatgacaaa
     6961 agctttggac tggagcggga caaggtactg ggctcccgat gctaggacta caacatacaa
     7021 tgcaggccgc ttttccaccc ctcaaccatc gggggcactg ccaggaagag ctaatcttag
     7081 ggatgctgtc cctactcggg gttcctccag taagtcttct aattcttcta ctgctacttc
     7141 tgtgtactca aatcaaacca cttcaacgag acttggttct acagctggtt ctggtaccag
     7201 tgtctcgagc ttcccgtcaa ctgcaaggac taggagctgg gttgaggatc aaagtaggaa
     7261 tttgtcacct ttcatgaggg gggcccacaa catatcgttt gtcaccccac catctagcag
     7321 atcctctagc caaggcacag tctcaaccgt gcctaaagag attttggact cctggactgg
     7381 cgctttcaac acgcgcaggc agccactctt cgctcacatt cgtaagcgag gggagtcacg
     7441 ggcgtaatga gaaaagacaa aattgattat ctttcttttc tttagtgtct tttaaaa
//