Typing tool

Complete norovirus genomes

MK907797  GII.4 Sydney
 GII.P31

Length: 6,976 | 3 CDS

ORF1: 1..4677
ORF2: 4658..6280
ORF3: 6280..6976
LOCUS       MK907797                6976 bp    RNA     linear   VRL 02-NOV-2019
DEFINITION  Norovirus GII isolate G19_033 nonstructural polyprotein (ORF1)
            gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3)
            gene, partial cds.
ACCESSION   MK907797
VERSION     MK907797.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 6976)
  AUTHORS   Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
            Guyader,S.
  TITLE     Optimisation of agnostic metagenomic approaches to characterise
            human enteric viruses in sewage
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 6976)
  AUTHORS   Le Guyader,S. and Strubbia,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
            44311, France
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. 3.12.0
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..6976
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="G19_033"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="France: Nantes"
                     /collection_date="13-May-2014"
                     /note="genotype: GII.4-GII.Pe"
     gene            <1..4677
                     /gene="ORF1"
     CDS             <1..4677
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QCO93088.1"
                     /translation="LHVERGLILGVHKPPAAISLAKVELAPLSLFWRPVYTPQYLISP
                     DTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFRPYQDWNRKPL
                     PTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDWTFAGIVESLI
                     LLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGLVLGFTKEKIG
                     KMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANELAMVRSIEDAVLDLEAIE
                     NNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINSLLARIAAARSLV
                     HRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVGLIPRNGVDHW
                     DAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKVFDSDAIIITT
                     NLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNAFSPDFSHIKL
                     SLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQGPALTTFNFDR
                     NKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKCQIVYNGGTYT
                     LEADGKGSVKVDRVQSATVQTNNELAGALHHLRCARIRYYVKCVQEALYSIIQIAGAA
                     FVTTRIAKRMNIQNLWSKPQVEDTEETANKDGCLKPKDDEEFVVSSDDIKTEGKKGKN
                     KSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYYEEVAIARATE
                     EDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKPKGKLWADDDR
                     SVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAKEFFGVPIKQI
                     QIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTLLIKRPTGELMPLAARMGTHA
                     TMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVVIGVHTAAARG
                     GNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSTKTKFWRSSTTPLPPGT
                     YEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKKTIINVLEQTI
                     DPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASKANLMFEEGKN
                     MTPVYTGALKDELVKTEKVYGKVKKRLLWGSDLATMIRCARAFGGLMDELKAHCVTLP
                     VRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIMVKFSPEPHLA
                     QTVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCALSEVTDLSPD
                     IIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGPLVISEDLDGL
                     TFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQRPIQLMSLLGE
                     AALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLAPSFV
                     NEDGVE"
     mat_peptide     <1..567
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     568..1665
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     1666..2202
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2203..2601
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     2602..3144
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3145..4674
                     /gene="ORF1"
                     /product="RdRp"
     gene            4658..6280
                     /gene="ORF2"
     CDS             4658..6280
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QCO93089.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPCQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEVNQNTKFTPVGVIQDG
                     GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
     gene            6280..>6976
                     /gene="ORF3"
     CDS             6280..>6976
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QCO93090.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEYENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKPS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SFVTPPSSRSSSQG"
ORIGIN      
        1 ctgcacgtgg aacgaggtct tatacttggt gtgcacaaac caccggcagc cattagcctt
       61 gccaaggtcg agctagcacc gctctctttg ttctggagac ccgtatacac cccacagtat
      121 ctcatctctc cagacactct taggagatta catggagagt cgttccccta cactgcattc
      181 gacaacaatt gctacgcctt ttgttgttgg gtattagacc taaacgactc atggctgagc
      241 aggagaatga ttcaaagaac aacaggcttc ttcaggccgt accaggattg gaacaggaaa
      301 cccctcccca ctatggatga ttccaaatta aagaaggtag ccaacatatt cttgtgcact
      361 ttgtcttcac tattcaccag acccattaag gacataatag ggaagttgaa acctcttaac
      421 atccttaaca ttctggctac atgtgattgg accttcgcag gcatagtgga atccttaata
      481 ctcttggcag aactctttgg agtcttctgg acacccccag atgtgtctgc gatgatcgcc
      541 cccttgctag gtgattatga actgcaagga cctgaggacc ttgcagtgga attggtccca
      601 atagtgatgg gggggatagg tttggtgcta ggatttacca aagagaaaat cggaaagatg
      661 ctatcatccg ccgcatccac tttaagagct tgtaaagacc ttggtgcata cggactggaa
      721 atcttaaaat tggtcatgaa gtggttcttc ccaaagaaag aggaagcaaa tgaactggct
      781 atggtgagat ccatcgagga tgcagtacta gacctcgagg caattgaaaa caaccacatg
      841 accaccctac tcaaagacaa agacagcttg gcaacctaca tgagaaccct tgaccttgag
      901 gaggagaaag ccagaaaact ctcaaccaaa tctgcttcac ccgatattgt gggcacaatc
      961 aactctcttc tggcaagaat cgctgctgca cgttccctag tgcatcgggc gaaagaagag
     1021 ctctccagca ggccgagacc tgtcgttgtg atgatatcgg gaagaccagg gatagggaaa
     1081 actcaccttg ccagggagct ggccaagaag atcgcggcct ctctcacagg ggaccagcgt
     1141 gtgggtctta tcccacgcaa tggtgtcgac cactgggacg catacaaggg cgaaagagtt
     1201 gtcctatggg acgactatgg aatgagcaac cccatccatg atgccctcag gttgcaggag
     1261 cttgctgaca cttgccccct cacgctaaat tgtgacagaa ttgagaacaa ggggaaagtc
     1321 tttgacagtg atgccataat tatcaccacc aatctggcca acccagcacc actggattat
     1381 gtcaactttg aagcgtgctc gagacgcatt gatttcctcg tgtacgcaga agcccctgag
     1441 gtggagaagg caaagcgcga cttcccaggt caacctgaca tgtggaagaa cgctttcagt
     1501 cctgacttct cacacataaa actgtcattg gctccacagg gtggtttcga caagaacggc
     1561 aacaccccgc atggaaaagg ggtcatgaag accctcacca ctggctccct catcgcccga
     1621 gcatcagggt tactccatga gaggctagat gaatatgaac tgcaaggccc agccctcacc
     1681 actttcaact ttgaccgcaa caagatactt gcttttagac agcttgctgc tgaaaacaag
     1741 tatgggttga tggacacaat gagagttgga aaacagctca aggatgtcaa gaccatgtca
     1801 gacctcaaac aagcactcaa gaacatcgcg atcaagaagt gccagatagt gtacaatggt
     1861 ggcacctaca cacttgaagc tgatggcaag ggtagtgtga aagttgacag agtgcaaagt
     1921 gccactgtgc agaccaacaa tgaactagcc ggtgccctac accacctaag gtgcgctaga
     1981 atcagatact atgttaagtg cgtccaggag gcactgtatt ccatcatcca aatcgctggg
     2041 gctgcattcg tcaccacgcg catcgctaag cgcatgaata tacaaaatct ctggtccaag
     2101 ccacaggtgg aagacacaga agagacggcc aacaaagatg gttgcctgaa acccaaagat
     2161 gatgaagagt ttgtcgtctc atccgacgac atcaaaactg agggcaagaa agggaagaac
     2221 aagtccggcc gtggcaagaa gcacacggcc ttttcaagta aagggctcag tgatgaggag
     2281 tacgatgagt acaagagaat cagagaagaa aggaatggta agtactccat agaagagtac
     2341 cttcaggaca gagacaggta ctacgaggag gtggccattg ccagggcaac cgaagaggac
     2401 ttctgtgaag aagaagaggc caaaatccgg cagagaattt ttaggccaac aaggaaacaa
     2461 cgcaaagaag aaagggcctc tctcggcttg gtcacaggct ctgaaatcag gaagagaaac
     2521 ccagaagact tcaaacccaa gggaaagttg tgggctgatg atgacagaag tgttgactac
     2581 aatgagaaac tcaactttga agccccacca agcatctggt cgcggatagt caactttggc
     2641 tcaggctggg gcttctgggt ctcccccagt ctgtttataa catcaaccca tgtcataccc
     2701 caaggtgcaa aagagttctt cggagtccct atcaagcaaa tccagataca caagtcaggt
     2761 gaattctgcc ggttgagatt cccaaagcca atcagaactg atgtgacggg catgattcta
     2821 gaagaaggtg cgcccgaggg gaccgtggtc acactgctca tcaagagacc aactggagag
     2881 ctcatgcccc tggcagccag aatggggacc catgcaacca tgaaaattca ggggcgcaca
     2941 gttggagggc aaatgggtat gctcctgaca gggtccaacg ccaagagtat ggacctaggc
     3001 acaacaccag gcgactgcgg ctgcccctac atctacaaga gggggaatga ctacgtggtc
     3061 ataggagtcc atacggccgc tgcccgtgga ggaaacactg tcatatgtgc cacccagggg
     3121 agtgagggag aagccacact cgaaggaggt gacagtaaag ggacatactg tggcgcacca
     3181 atcttgggcc cagggagcgc tccgaagctc agcaccaaga ctaagttttg gagatcgtcc
     3241 acaacaccac tcccacctgg cacctacgaa ccagcctatc tcggtggcaa agaccctaga
     3301 gtcaaaggtg gcccttcatt gcaacaagtt atgagggacc agctgaagcc attcacagaa
     3361 cccagaggta aaccaccaag accaaatgtg ttggaagctg ccaagaaaac catcatcaat
     3421 gtccttgagc aaacaattga tccaccccaa aaatggtcat ttgcgcaagc ttgcgcatcc
     3481 cttgacaaaa ccacctccag cggccacccg caccacatgc ggaaaaacga ctgttggaat
     3541 ggggagtcct tcacaggaaa attggctgat caagcctcca aggccaacct aatgtttgaa
     3601 gagggaaaga acatgactcc agtctacaca ggtgcactta aagatgagtt ggtaaagacc
     3661 gaaaaagttt atggtaaggt caagaagagg cttctgtggg gttcagatct ggcgaccatg
     3721 atacggtgcg cccgagcttt tggaggcctt atggatgaac tcaaggcaca ctgtgtcaca
     3781 cttcctgtca gagttggtat gaacatgaat gaggatggcc ccatcatctt tgagaagcac
     3841 tccagatata gatatcacta tgatgctgat tattcccggt gggactcaac acagcaaagg
     3901 gatgtgctag cagcagcact agaaatcatg gttaagttct ctccagaacc acacctggcc
     3961 cagacagttg cagaagacct cctttcccct agcgtgatgg atgtaggtga ctttcaaata
     4021 tcaataagtg agggtctccc ctctggggta ccttgtacct cccagtggaa ttccatcgcc
     4081 cactggctcc tcactctgtg tgcactctct gaagtcacgg acctgtcccc tgatatcatt
     4141 caggccaact cccttttctc cttctatggt gatgatgaga ttgtaagcac agacataaag
     4201 ttggacccag agaagctgac agcaaaactt aaggagtatg ggctgaaacc aacccgcccc
     4261 gacaaaactg aaggacccct tgttatctct gaagacctgg atggcctgac attcctccgg
     4321 agaactgtga cccgtgatcc agctggctgg tttggaaaat tggaacaaag ttcaattctc
     4381 aggcaaatgt actggaccag gggtcccaac catgaagatc cttttgaaac aatgatacca
     4441 cactcccaaa gacccataca attgatgtcc ttgctgggcg aggctgcgct ccacggcccg
     4501 gcattctata gcaaaattag caaattagtc attgcagagt tgaaggaagg tggcatggat
     4561 ttttacgtac ccagacaaga gccaatgttc agatggatga gattctcaga tctgagcacg
     4621 tgggagggcg atcgcaatct ggctcccagt tttgtgaatg aagatggcgt cgagtgacgc
     4681 caacccatct gatgggtccg cagccaacct cgtcccagag gtcaacaatg aggttatggc
     4741 tctggagccc gttgttggtg ccgccattgc ggcacccgta gcgggccaac aaaatgtaat
     4801 tgacccctgg attagaaata attttgtaca agcccctggt ggagagttta cagtatcccc
     4861 tagaaacgct ccaggtgaaa tactatggag cgcacccttg ggccctgatc taaatcccta
     4921 cctatcccat ttggccagaa tgtacaatgg ttatgcaggt ggttttgaag tgcaggtaat
     4981 tctcgcgggg aacgcgttca ccgccgggaa ggttatattt gcagcagtcc caccaaattt
     5041 tccaactgaa ggcttgagcc cctgccaggt tactatgttc ccccatatag tagtagatgt
     5101 taggcaacta gaacctgtgt tgattccctt acccgatgtt aggaataatt tctatcatta
     5161 caatcaatca aatgacccca ccattaagtt gatagcaatg ttgtatacac cacttagggc
     5221 taacaatgct ggggatgatg tcttcacagt ttcttgccga gttctcacga gaccttcccc
     5281 cgattttgat ttcatatttc tagtgccacc cacagttgag tcaagaacta aaccattctc
     5341 tgtcccagtt ttaactgttg aggagatgac caattcaaga ttccccattc ctttggaaaa
     5401 gttgttcacg ggtcccagca gtgcctttgt tgtccaacca caaaacggta ggtgcacgac
     5461 tgatggcgtg ctcctaggca ccacccaact gtctcccgtc aacatctgca ccttcagagg
     5521 agatgtcacc catatcacag gtagtcataa ctacacaatg aatttggctt ctcaaaattg
     5581 gagcaattat gacccaacag aagaaatccc agcccctcta ggaactccag actttgtggg
     5641 gaagattcaa ggcgtgctca cccaaaccac aaggacagat ggctcaacac gcggccacaa
     5701 agccacagtg tacactggga gcgccgactt tgctccaaaa ctgggtagag ttcaatttga
     5761 aactgacaca aaccatgatt ttgaagttaa tcaaaacaca aagttcaccc cagttggtgt
     5821 catccaagat ggtggcacca cccaccgaaa tgaaccccaa cagtgggtgc tcccaagtta
     5881 ctcaggcaga aatactccta atgtgcatct ggcccccgct gtagccccca cttttccggg
     5941 tgagcaactt ctcttcttca gatccaccat gcccggatgc agcgggtacc ccaacatgga
     6001 tttggactgt ctgctccccc aggaatgggt gcagtacttc taccaagagg cagccccagc
     6061 acaatctgat gtggctctgc taagatttgt gaatccagac acaggtaggg ttttgtttga
     6121 gtgtaagctt cataaatcag gctatgttac agtggctcac actggccaac atgatttggt
     6181 tatccccccc aatggttatt ttaggtttga ttcctgggtc aaccaatttt acacgcttgc
     6241 ccccatggga aatggaacgg ggcgtagacg tgcattataa tggctggagc tttctttgct
     6301 ggattggcat ctgatgtcct tggctctgga cttggttccc ttatcaatgc tggggctggg
     6361 gccatcaatc aaaaagttga gtatgaaaat aacagaaaat tgcaacaagc atccttccaa
     6421 tttagcagca atctacaaca ggcttctttt caacatgaca aagagatgct ccaagcacaa
     6481 attgaggcca ccaaaaggct acaacaagaa atgatgaaag ttaagcaggc aatgctccta
     6541 gagggtgggt tctctgagac agatgcagcc cgcggggcaa tcaacgcccc catgacaaaa
     6601 gctttggact ggagcgggac aaggtactgg gctcccgatg ctaggactac aacatacaat
     6661 gcaggccgct tttccacccc tcaaccatcg ggggcactgc caggaagagc taatcttagg
     6721 gatgctgtcc ctgctcgggg ttcctccagc aagccttcta attcttctac tgccacttct
     6781 gtgtactcaa atcaaactac ttcaacgaga cttggttcta cagctggttc tggtaccagt
     6841 gtctcgagct tcccgtcaac tgcaaggact aggagctggg ttgaggatca aagtaggaat
     6901 ttgtcacctt tcatgagggg ggcccacaac atatcgtttg tcaccccacc atctagcaga
     6961 tcctctagcc aaggga
//