Typing tool

Complete norovirus genomes

MK907796  GII.4 Sydney
 GII.P31

Length: 7,557 | 3 CDS

ORF1: 1..5092
ORF2: 5073..6695
ORF3: 6695..7501
LOCUS       MK907796                7557 bp    RNA     linear   VRL 02-NOV-2019
DEFINITION  Norovirus GII isolate G19_032 nonstructural polyprotein (ORF1)
            gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
            cds.
ACCESSION   MK907796
VERSION     MK907796.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7557)
  AUTHORS   Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
            Guyader,S.
  TITLE     Optimisation of agnostic metagenomic approaches to characterise
            human enteric viruses in sewage
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7557)
  AUTHORS   Le Guyader,S. and Strubbia,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
            44311, France
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. 3.12.0
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7557
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="G19_032"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="France: Nantes"
                     /collection_date="12-May-2014"
                     /note="genotype: GII.4-GII.Pe"
     gene            <1..5092
                     /gene="ORF1"
     CDS             <1..5092
                     /gene="ORF1"
                     /codon_start=2
                     /product="nonstructural polyprotein"
                     /protein_id="QCO93085.1"
                     /translation="ASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPKQP
                     PPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRHPEEAN
                     TAFSVPPLSQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAK
                     VELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLS
                     RRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKP
                     LNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAV
                     ELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKE
                     EANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSA
                     SPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKK
                     IAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLT
                     LNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKR
                     DFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGL
                     LHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDL
                     KQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDRVQSATVQTNNELAGALHHLRCAR
                     IRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETANKDGCLKP
                     KDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYS
                     IEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGS
                     EIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLF
                     ITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVV
                     TLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGC
                     PYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGS
                     APKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGK
                     PPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGE
                     SFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTEKVYGKVKKRLLWGSDLATM
                     IRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQ
                     QRDVLAAALEIMVKFSPEPHLAQTVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQW
                     NSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYG
                     LKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHE
                     DPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMF
                     RWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..982
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     983..2080
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2081..2617
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2618..3016
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3017..3559
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3560..5089
                     /gene="ORF1"
                     /product="RdRp"
     gene            5073..6695
                     /gene="ORF2"
     CDS             5073..6695
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QCO93086.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEVNQNTKFTPVGVIQDG
                     GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
     gene            6695..7501
                     /gene="ORF3"
     CDS             6695..7501
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QCO93087.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEYENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKPS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN      
        1 ggcgtctaac gacgcttccg ctgccgctgt tgccaacagc aacaacgaca tcgcaaaatc
       61 ttcaagtgac ggtgtgtttt ctaacatggc tgtcactttt aagcgggccc tcggggcgcg
      121 gcctaaacag ccgcccccga aggaaatacc acccagaccc ccgcgaccac ccacaccaga
      181 attggtcaaa aagatccctc ctcccccacc caacggggag gatgaactag tggtctctta
      241 cagcgccaaa gatggcgttt ccggactgcc tgagctcacc actgtcagac atccggaaga
      301 agccaacacg gcgttcagtg tccccccact cagccaaagg gaaagcaggg acgccaagga
      361 gccactaact gggacaatca ttgaaatgtg ggatggagaa atctaccatt acggcctgta
      421 cgtggaacga ggtcttatac ttggtgtgca caaaccaccg gcagccatta gccttgccaa
      481 ggtcgagcta gcaccgctct ctttgttctg gagacccgta tacaccccac agtatctcat
      541 ctctccagac actcttagga gattacatgg agagtcgttc ccctacactg catttgacaa
      601 caattgctac gccttttgtt gttgggtatt agacctaaac gactcatggc tgagcaggag
      661 aatgattcaa agaacaacag gcttcttcag gccgtaccag gattggaaca ggaaacccct
      721 ccccactatg gatgattcca aattaaagaa ggtagccaac atattcttgt gcactttgtc
      781 ttcactattc accagaccca ttaaggacat aatagggaag ttgaaacctc ttaacatcct
      841 taacattctg gctacatgtg attggacctt cgcaggcata gtggaatcct taatactctt
      901 ggcagaactc tttggagtct tctggacacc cccagatgtg tctgcgatga tcgccccctt
      961 gctaggtgat tatgaactgc aaggacctga ggaccttgca gtggaattgg tcccaatagt
     1021 gatggggggg ataggtttgg tgctaggatt taccaaagag aaaatcggaa agatgctatc
     1081 atccgccgca tccactttaa gagcttgtaa agaccttggt gcatacggac tggaaatctt
     1141 aaaattggtc atgaagtggt tcttcccaaa gaaagaggaa gcaaatgaac tggctatggt
     1201 gagatccatc gaggatgcag tactagacct cgaggcaatt gaaaacaacc acatgaccac
     1261 cctactcaaa gacaaagaca gcttggcaac ctacatgaga acccttgacc ttgaggagga
     1321 gaaagccaga aaactctcaa ccaaatctgc ttcacccgat attgtgggca caatcaactc
     1381 tcttctggca agaatcgctg ctgcacgttc cctagtgcat cgggcgaaag aagagctctc
     1441 cagcaggccg agacctgtcg ttgtgatgat atcgggaaga ccagggatag ggaaaactca
     1501 ccttgccagg gagctggcca agaagatcgc ggcctctctc acaggggacc agcgtgtggg
     1561 tcttatccca cgcaatggtg tcgaccactg ggacgcatac aagggcgaaa gagttgtcct
     1621 atgggacgac tatggaatga gcaaccccat ccatgatgcc ctcaggttgc aggagcttgc
     1681 tgacacttgc cccctcacgc taaattgtga cagaattgag aacaagggga aagtctttga
     1741 cagtgatgcc ataattatca ccaccaatct ggccaaccca gcaccactgg attatgtcaa
     1801 ctttgaagcg tgctcgagac gcattgattt cctcgtgtac gcagaagccc ctgaggtgga
     1861 gaaggcaaag cgcgacttcc caggtcaacc tgacatgtgg aagaacgctt tcagtcctga
     1921 cttctcacac ataaaactgt cattggctcc acagggtggt ttcgacaaga acggcaacac
     1981 cccgcatgga aaaggggtca tgaagaccct caccactggc tccctcatcg cccgagcatc
     2041 agggttactc catgagaggc tagatgaata tgaactgcaa ggcccagccc tcaccacttt
     2101 caactttgac cgcaacaaga tacttgcttt tagacagctt gctgctgaaa acaagtatgg
     2161 gttgatggac acaatgagag ttggaaaaca gctcaaggat gtcaagacca tgtcagacct
     2221 caaacaagca ctcaagaaca tcgcgatcaa gaagtgccag atagtgtaca atggtggcac
     2281 ctacacactt gaagctgatg gcaagggtag tgtgaaagtt gacagagtgc aaagtgccac
     2341 tgtgcagacc aacaatgaac tagccggtgc cctacaccac ctaaggtgcg ctagaatcag
     2401 atactatgtt aagtgcgtcc aggaggcact gtattccatc atccaaatcg ctggggctgc
     2461 attcgtcacc acgcgcatcg ctaagcgcat gaatatacaa aatctctggt ccaagccaca
     2521 ggtggaagac acagaagaga cggccaacaa agatggttgc ctgaaaccca aagatgatga
     2581 agagtttgtc gtctcatccg acgacatcaa aactgagggc aagaaaggga agaacaagtc
     2641 cggccgtggc aagaagcaca cggccttttc aagtaaaggg ctcagtgatg aggagtacga
     2701 tgagtacaag agaatcagag aagaaaggaa tggtaagtac tccatagaag agtaccttca
     2761 ggacagagac aggtactacg aggaggtggc cattgccagg gcaaccgaag aggacttctg
     2821 tgaagaagaa gaggccaaaa tccggcagag aatttttagg ccaacaagga aacaacgcaa
     2881 agaagaaagg gcctctctcg gcttggtcac aggctctgaa atcaggaaga gaaacccaga
     2941 agacttcaaa cccaagggaa agttgtgggc tgatgatgac agaagtgttg actacaatga
     3001 gaaactcaac tttgaagccc caccaagcat ctggtcgcgg atagtcaact ttggctcagg
     3061 ctggggcttc tgggtctccc ccagtctgtt tataacatca acccatgtca taccccaagg
     3121 tgcaaaagag ttcttcggag tccctatcaa gcaaatccag atacacaagt caggtgaatt
     3181 ctgccggttg agattcccaa agccaatcag aactgatgtg acgggcatga ttctagaaga
     3241 aggtgcgccc gaggggaccg tggtcacact gctcatcaag agaccaactg gagagctcat
     3301 gcccctggca gccagaatgg ggacccatgc aaccatgaaa attcaggggc gcacagttgg
     3361 agggcaaatg ggtatgctcc tgacagggtc caacgccaag agtatggacc taggcacaac
     3421 accaggcgac tgcggctgcc cctacatcta caagaggggg aatgactacg tggtcatagg
     3481 agtccatacg gccgctgccc gtggaggaaa cactgtcata tgtgccaccc aggggagtga
     3541 gggagaagcc acactcgaag gaggtgacag taaagggaca tactgtggcg caccaatctt
     3601 gggcccaggg agcgctccga agctcagcac caagactaag ttttggagat cgtccacaac
     3661 accactccca cctggcacct acgaaccagc ctatctcggt ggcaaagacc ctagagtcaa
     3721 aggtggccct tcattgcaac aagttatgag ggaccagctg aagccattca cagaacccag
     3781 aggtaaacca ccaagaccaa atgtgttgga agctgccaag aaaaccatca tcaatgtcct
     3841 tgagcaaaca attgatccac cccaaaaatg gtcatttgcg caagcttgcg catcccttga
     3901 caaaaccacc tccagcggcc acccgcacca catgcggaaa aacgactgtt ggaatgggga
     3961 gtccttcaca ggaaaattgg ctgatcaagc ctccaaggcc aacctaatgt ttgaagaggg
     4021 aaagaacatg actccagtct acacaggtgc acttaaagat gagttggtaa agaccgaaaa
     4081 agtttatggt aaggtcaaga agaggcttct gtggggttca gatctggcga ccatgatacg
     4141 gtgcgcccga gcttttggag gccttatgga tgaactcaag gcacactgtg tcacacttcc
     4201 tgtcagagtt ggtatgaaca tgaatgagga tggccccatc atctttgaga agcactccag
     4261 atatagatat cactatgatg ctgattattc ccggtgggac tcaacacagc aaagggatgt
     4321 gctagcagca gcactagaaa tcatggttaa gttctctcca gaaccacacc tggcccagac
     4381 agttgcagaa gacctccttt cccctagcgt gatggatgta ggtgactttc aaatatcaat
     4441 aagtgagggt ctcccctctg gggtaccttg tacctcccag tggaattcca tcgcccactg
     4501 gctcctcact ctgtgtgcac tctctgaagt cacggacctg tcccctgata tcattcaggc
     4561 caactccctt ttctccttct atggtgatga tgagattgta agcacagaca taaagttgga
     4621 cccagagaag ctgacagcaa aacttaagga gtatgggctg aaaccaaccc gccccgacaa
     4681 aactgaagga ccccttgtta tctctgaaga cctggatggc ctgacattcc tccggagaac
     4741 tgtgacccgt gatccagctg gctggtttgg aaaattggaa caaagttcaa ttctcaggca
     4801 aatgtactgg accaggggtc ccaaccatga agatcctttt gaaacaatga taccacactc
     4861 ccaaagaccc atacaattga tgtccttgct gggcgaggct gcgctccacg gcccggcatt
     4921 ctatagcaaa attagcaaat tagtcattgc agagttgaag gaaggtggca tggattttta
     4981 cgtacccaga caagagccaa tgttcagatg gatgagattc tcagatctga gcacgtggga
     5041 gggcgatcgc aatctggctc ccagttttgt gaatgaagat ggcgtcgagt gacgccaacc
     5101 catctgatgg gtccgcagcc aacctcgtcc cagaggtcaa caatgaggtt atggctctgg
     5161 agcccgttgt tggtgccgcc attgcggcac ccgtagcggg ccaacaaaat gtaattgacc
     5221 cctggattag aaataatttt gtacaagccc ctggtggaga gtttacagta tcccctagaa
     5281 acgctccagg tgaaatacta tggagcgcac ccttgggccc tgatctaaat ccctacctat
     5341 cccatttggc cagaatgtac aatggttatg caggtggttt tgaagtgcag gtaattctcg
     5401 cggggaacgc gttcaccgcc gggaaggtta tatttgcagc agtcccacca aattttccaa
     5461 ctgaaggctt gagccccagc caggttacta tgttccccca tatagtagta gatgttaggc
     5521 aactagaacc tgtgttgatt cccttacccg atgttaggaa taatttctat cattataatc
     5581 aatcaaatga ccccaccatt aagttgatag caatgttgta tacaccactt agggctaaca
     5641 atgctgggga tgatgtcttc acagtttctt gccgagttct cacgagacct tcccccgatt
     5701 ttgatttcat atttctagtg ccacccacag ttgagtcaag aactaaacca ttctctgtcc
     5761 cagttttaac tgttgaggag atgaccaatt caagattccc cattcctttg gaaaagttgt
     5821 tcacgggtcc cagcagtgcc tttgttgtcc aaccacaaaa cggtaggtgc acgactgatg
     5881 gcgtgctcct aggcaccacc caactgtctc ccgtcaacat ctgcaccttc agaggagatg
     5941 tcacccatat cacaggtagt cataactaca caatgaattt ggcttctcaa aattggagca
     6001 attatgaccc aacagaagaa atcccagccc ctctaggaac tccagacttt gtggggaaga
     6061 ttcaaggcgt gctcacccaa accacaagga cagatggctc aacacgcggc cacaaagcca
     6121 cagtgtacac tgggagcgcc gactttgctc caaaactggg tagagttcaa tttgaaactg
     6181 acacaaacca tgattttgaa gttaatcaaa acacaaagtt caccccagtt ggtgtcatcc
     6241 aagatggtgg caccacccac cgaaatgaac cccaacagtg ggtgctccca agttactcag
     6301 gcagaaatac tcctaatgtg catctggccc ccgctgtagc ccccactttt ccgggtgagc
     6361 aacttctctt cttcagatcc accatgcccg gatgcagcgg gtaccccaac atggatttgg
     6421 actgtctgct cccccaggaa tgggtgcagt acttctacca agaggcagcc ccagcacaat
     6481 ctgatgtggc tctgctaaga tttgtgaatc cagacacagg tagggttttg tttgagtgta
     6541 agcttcataa atcaggctat gttacagtgg ctcacactgg ccaacatgat ttggttatcc
     6601 cccccaatgg ttattttagg tttgattcct gggtcaacca attttacacg cttgccccca
     6661 tgggaaatgg aacggggcgt agacgtgcat tataatggct ggagctttct ttgctggatt
     6721 ggcatctgat gtccttggct ctggacttgg ttcccttatc aatgctgggg ctggggccat
     6781 caatcaaaaa gttgagtatg aaaataacag aaaattgcaa caagcatcct tccaatttag
     6841 cagcaatcta caacaggctt cttttcaaca tgacaaagag atgctccaag cacaaattga
     6901 ggccaccaaa aggctacaac aagaaatgat gaaagttaag caggcaatgc tcctagaggg
     6961 tgggttctct gagacagatg cagcccgcgg ggcaatcaac gcccccatga caaaagcttt
     7021 ggactggagc gggacaaggt actgggctcc cgatgctagg actacaacat acaatgcagg
     7081 ccgcttttcc acccctcaac catcgggggc actgccagga agagctaatc ttagggatgc
     7141 tgtccctgct cggggttcct ccagcaagcc ttctaattct tctactgcca cttctgtgta
     7201 ctcaaatcaa actacttcaa cgagacttgg ttctacagct ggttctggta ccagtgtctc
     7261 gagcttcccg tcaactgcaa ggactaggag ctgggttgag gatcaaagta ggaatttgtc
     7321 acctttcatg aggggggccc acaacatatc gtttgtcacc ccaccatcta gcagatcctc
     7381 tagccaaggc acagtctcaa ccgtgcctaa agagattttg gactcctgga ctggcgcttt
     7441 caacacgcgc aggcagccac tcttcgctca cattcgtaag cgaggggagt cacgggcgta
     7501 atgtgaaaag acaaaattga ttatctttct tttctttagt gtcttttaaa aaaaaaa
//