Typing tool

Complete norovirus genomes

MK907799  GII.4 Sydney
 GII.P4 New Orleans

Length: 7,390 | 3 CDS

ORF1: 1..4899
ORF2: 4880..6502
ORF3: 6502..7308
LOCUS       MK907799                7390 bp    RNA     linear   VRL 02-NOV-2019
DEFINITION  Norovirus GII isolate G19_035 nonstructural polyprotein (ORF1)
            gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
            cds.
ACCESSION   MK907799
VERSION     MK907799.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7390)
  AUTHORS   Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
            Guyader,S.
  TITLE     Optimisation of agnostic metagenomic approaches to characterise
            human enteric viruses in sewage
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7390)
  AUTHORS   Le Guyader,S. and Strubbia,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
            44311, France
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. 3.12.0
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7390
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="G19_035"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="France: Nantes"
                     /collection_date="01-Oct-2008"
                     /note="genotype: GII.4-GII.P4_New Orleans 2009"
     gene            <1..4899
                     /gene="ORF1"
     CDS             <1..4899
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QCO93096.1"
                     /translation="IPPPPPNGEDEIVVSYSVKDGVSGLPDLSTVRQPEESNTAFSVP
                     PLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLAKVELAPL
                     SLYWRPVYTPQYLISPDTLKKLSGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQR
                     TTGFFRPYQDWNRKPLPTMDDSKIKKVANIFLCALSSLFTRPIKDIIGKIRPLNILNI
                     LASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVV
                     MGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANELA
                     IVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVG
                     TINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREVAKRIAASLT
                     GDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRI
                     ENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPDVEKAKRDFPGQP
                     DMWKNAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKSLTTGSLIARASGLLHERLD
                     EFELQGPALTTFNFDRNKVLAFRQLAAENKYGLLDTMRVGKQLKDVKTMPELKQALKN
                     VSIKKCQIVYSGCTYMLDSDGKGNVKVDRIQSATVQTNNELVGALHHLRCARIRYYVK
                     CVQEALYSIIQIAGAAFVTTRIAKRMNIQDLWSKPQVENTEETTSKDGCPRPKDDEEF
                     VISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQ
                     DRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERVSLGLVTGSEIRKRN
                     PDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHV
                     IPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTLLIKR
                     STGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKR
                     GNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILGPGSAPTLST
                     KTKFWRSSTASLPPGTYEPAYLGGKDPRVKGGPSLQQVMREQLKPFTEPRGKPPKPSV
                     LEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKL
                     ADQASKANLMFEEGKNMTPVYTAALKDELVKTDKIYGKIKKRLLWGSDLATMIRCARA
                     FGGLMDELKAHCVTLPIRVGMNMNEDGPIIFERHSRYTYHYDADYSRWDSTQQRAVLA
                     AALEIMVKFSPEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCTSQWNSIAHW
                     LLTLCALSEVTNLSPDTIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLREYGLKPTRP
                     DKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHGDPSETM
                     IPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFS
                     DLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..789
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     790..1887
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     1888..2424
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2425..2823
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     2824..3366
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3367..4896
                     /gene="ORF1"
                     /product="RdRp"
     gene            4880..6502
                     /gene="ORF2"
     CDS             4880..6502
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QCO93094.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGAPDFVGKIQGML
                     TQTTRADGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRVV"
     gene            6502..7308
                     /gene="ORF3"
     CDS             6502..7308
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QCO93095.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVLARGSSSKSY
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPLMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEVLNSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN      
        1 attccccccc ccccacccaa cggagaggat gaaatagtgg tctcttatag tgtcaaagat
       61 ggtgtttccg gcttgcctga cctttccacc gtcaggcagc cggaagaatc taacacggcc
      121 ttcagtgtcc ctccactcaa ccagagggag aatagagatg ctaaggaacc actcactgga
      181 acaattctgg aaatgtggga cggggaaatc taccattatg gcctgtatgt ggagcgaggt
      241 cttgtactag gcgtgcacaa accgccagct gccattagcc tcgctaaggt tgagttagca
      301 ccactctcat tgtactggag acctgtgtac actcctcagt acctcatctc tccagacact
      361 ctcaagaaat tgtccggaga aacgttcccc tacacagcct ttgacaacaa ctgctatgcc
      421 ttttgttgct gggtcctgga cctaaatgac tcgtggctga gcaggagaat gatccagaga
      481 acaactggtt tcttcaggcc ctaccaagac tggaatagga aaccccttcc cactatggat
      541 gactccaaaa taaagaaggt ggccaacata tttctgtgtg ctctgtcctc gctattcacc
      601 aggcccataa aagatataat agggaagata aggcctctta acatcctcaa catcttagcc
      661 tcatgtgatt ggacctttgc gggtatagtg gagtccctga tactcttggc agaactcttt
      721 ggagttttct ggacaccccc agatgtgtct gcgatgattg cccccttact tggtgattac
      781 gagctacaag ggcctgagga ccttgcagtg gagctcgtcc ccgtggtgat ggggggaatt
      841 ggtttggtgc taggattcac caaagagaag attgggaaga tgttgtcatc agctgcgtcc
      901 accttaagag cttgcaaaga ccttggtgca tatgggctag agatcctaaa gttagtcatg
      961 aagtggttct tcccgaagaa ggaggaggcg aatgagctgg ctatagtgag gtccatcgag
     1021 gatgcagtcc tggatctcga agcaattgaa aacaatcata tgaccacctt gcttaaagat
     1081 aaagacagtc tggcaaccta catgagaaca cttgaccttg aagaggagaa agccaggaaa
     1141 ctctcaacca aatctgcctc acccgacatc gtgggcacaa tcaacgctct cctggcgaga
     1201 atcgctgccg cacgttctct ggtgcatcga gcgaaggagg agctttccag cagaccaaga
     1261 cctgtggtgt tgatgatatc aggcaggcca ggaataggga agactcacct cgctagggaa
     1321 gtggctaaga gaatcgcagc ctcccttaca ggagaccaac gtgttggtct catcccacgc
     1381 aatggcgtcg accattggga tgcgtacaag ggggagaggg tcgtcctatg ggacgattat
     1441 ggaatgagca accctattca cgatgccctc aggctgcaag aactcgctga cacttgcccc
     1501 ctcactctga actgtgacag gattgaaaat aaaggaaagg tctttgacag cgatgtcatc
     1561 attatcacca ccaatctggc caacccagcc ccactggact atgtcaactt tgaagcatgc
     1621 tcgaggcgca ttgacttcct cgtgtatgca gaagcccctg atgtcgaaaa ggcgaagcgt
     1681 gacttcccag gccagcctga catgtggaag aacgctttca gttctgattt ctcacacata
     1741 aaactagcac tggccccaca gggtggtttc gacaagaacg ggaacacccc acatggaaag
     1801 ggcgtcatga agtctctcac cactggctcc cttattgccc gggcatcagg gctgctccat
     1861 gagaggttag atgaatttga actgcagggc ccagctctca ccaccttcaa tttcgatcgc
     1921 aataaagtgc tagcctttag acagcttgct gctgaaaata aatatggatt gttggacaca
     1981 atgagggttg ggaaacagct caaggacgtc aaaaccatgc cagaactcaa acaagcactc
     2041 aagaatgtct caatcaagaa gtgtcaaata gtgtatagtg gttgcaccta catgcttgat
     2101 tctgatggca agggcaatgt gaaagttgac aggatccaaa gcgccaccgt gcagaccaac
     2161 aatgagctgg ttggtgccct gcaccacttg aggtgcgcca gaatcagata ctatgtcaag
     2221 tgtgtccagg aagccctgta ttccatcatt caaattgctg gggctgcatt tgtcaccacg
     2281 cgcattgcca agcgcatgaa catacaagac ctatggtcca agccacaagt ggaaaacaca
     2341 gaggagacta ccagcaagga cgggtgccca agacccaagg atgatgagga gtttgtcatt
     2401 tcgtccgacg acatcaaaac tgagggaaaa aaagggaaga acaagactgg ccgtggcaag
     2461 aagcacacag cattttcaag caaaggcctc agtgatgaag agtacgatga atacaagagg
     2521 atcagagaag aaaggaatgg caagtactct atagaagagt accttcagga cagggacaaa
     2581 tactatgaag aggtggccat tgccagagcg actgaggaag acttctgtga agaggaggaa
     2641 gccaagatcc gacaaaggat ctttaggcca acaaggaaac aacgcaagga ggaaagagtc
     2701 tctctcggtt tggtcacggg ttctgaaatt aggaaaagaa acccagatga cttcaaaccc
     2761 aaggggaaat tgtgggctga cgatgacagg agtgtggact acaatgagaa actcagtttt
     2821 gaggccccgc caagcatttg gtcaagaata gtcaactttg gttcaggctg gggattctgg
     2881 gtctccccca gcttgttcat aacatcaacc catgttatac cccagggcgc aaaggagttc
     2941 tttggagtcc ccatcaaaca aatacaggta cacaagtcag gcgagttctg tcgcttgaga
     3001 ttccctaaac caattaggac tgatgtgacg ggtatgatct tagaagaagg cgcacctgag
     3061 ggcaccgtgg tcacactact catcaaaagg tccactgggg aacttatgcc cctagcagct
     3121 aggatgggga cccatgcgac catgaagatc caagggcgca ctgttggggg ccagatgggc
     3181 atgcttctga caggatccaa cgccaagagt atggacctgg gtaccacacc aggtgattgt
     3241 ggctgcccct acatctacaa gagaggtaat gactatgtgg tcattggagt ccacacggct
     3301 gccgcacgtg gggggaacac tgtcatatgt gccacccaag ggagtgaagg agaggctaca
     3361 cttgagggtg gtgacaacaa ggggacatac tgtggtgcac caatcctagg cccagggagt
     3421 gccccaacac ttagcaccaa gaccaaattc tggaggtcgt ccacagcatc actcccacct
     3481 ggcacctatg aaccagccta tcttggtggc aaggacccta gagttaaggg tggcccttca
     3541 ctgcagcaag tcatgaggga acagttgaag ccattcacag agcccagggg taagccacca
     3601 aagccaagtg tgttagaagc tgccaagaaa accatcatta atgtccttga gcaaacaatt
     3661 gatccacctg agaaatggtc gttcgcacaa gcttgcgcgt ctcttgacaa gaccacttcc
     3721 agtggtcatc cgcaccacat gcggaaaaac gactgctgga acggggagtc cttcacaggc
     3781 aagctggcag accaggcttc taaggccaac ctgatgtttg aagaagggaa gaacatgacc
     3841 ccagtctaca cagctgcgct caaggatgag ttagttaaaa ctgacaaaat ttatggtaag
     3901 atcaagaaga ggcttctttg gggctcggac ttggcgacca tgatccggtg tgctcgagca
     3961 ttcggaggcc taatggatga actcaaagcg cactgtgtca cacttcccat tagagttggc
     4021 atgaatatga atgaggatgg ccccatcatc ttcgagaggc attccaggta cacgtaccac
     4081 tatgatgctg attactctcg atgggattca acacaacaga gagccgtgtt ggcagcagcc
     4141 ctagaaatca tggtaaaatt ctccccagaa ccacatttgg ctcaggtagt tgctgaagac
     4201 cttctttctc ctagcgtggt ggacgtgggc gacttcacaa tatcaatcaa cgagggcctt
     4261 ccctctgggg tgccttgcac ctcccaatgg aactccatcg cccactggct tctcactctc
     4321 tgtgcgctct ccgaagtcac aaacctgtct cctgatacca tacaggctaa ttctctcttc
     4381 tctttttatg gtgatgatga aattgttagc acagacataa aattggaccc agagaaattg
     4441 acagcaaagc tcagagaata tgggttaaag ccaacccgcc ctgacaaaac tgaagggccc
     4501 cttgtcatct ctgaagacct gaatggtcta actttcctgc ggagaactgt gacccgcgac
     4561 ccagctggtt ggtttggaaa actggagcag agttcaatac tcaggcaaat gtactggact
     4621 aggggtccca accatggaga cccatctgaa actatgattc cacactccca aaggcccata
     4681 caattgatgt ccctactggg ggaggccgct ctccacggcc cagcatttta cagtaaaatt
     4741 agcaaattgg tcattgcaga gctaaaagaa ggtggtatgg atttttacgt gcccagacaa
     4801 gagccaatgt tcagatggat gagattctca gatctgagca cgtgggaggg cgatcgcaat
     4861 ctggctccca gttttgtgaa tgaagatggc gtcgagtgac gccaacccat ctgatgggtc
     4921 cgcagccaac ctcgtcccag aggtcaacaa tgaggttatg gctctggagc ccgttgttgg
     4981 tgccgccatt gcggcacctg tagcgggcca acaaaatgta attgacccct ggattagaaa
     5041 taattttgta caagcccctg gtggagagtt tacagtgtcc cccagaaacg ctccaggtga
     5101 aatactatgg agcgcgccct taggccctga tctaaatccc tacctatccc atttggccag
     5161 aatgtataat ggttatgcag gtggttttga agtgcaggta attctcgcgg ggaacgcgtt
     5221 caccgccggg aaagtcatat ttgcagcagt cccaccaaat ttcccaactg aaggcttgag
     5281 ccccagccag gtcactatgt tcccccatat agtagtagat gttaggcaac tagaacctgt
     5341 gttgattccc ttacccgatg ttaggaataa tttctaccat tacaatcaat caaatgaccc
     5401 caccattaag ttgatagcaa tgttgtacac accacttagg gctaataatg ctggggacga
     5461 tgtcttcaca gtttcttgcc gagttctcac gagaccatcc cccgattttg atttcatatt
     5521 tctagtgcca cccacagttg agtctagaac taaaccattc tctgtcccag ttttaactgt
     5581 tgaggagatg accaattcaa gattccccat ccctctggaa aagttgttca cgggtcccag
     5641 cagtgccttt gttgttcaac cacaaaatgg taggtgcacg actgatggcg tgctcctagg
     5701 caccacccaa ttgtctcctg tcaacatctg caccttcaga ggggatgtca cccacattac
     5761 aggtagtcgt aactacacaa tgaatttggc ttctcaaaat tggaacaatt atgacccaac
     5821 agaagaaatc ccagcccctc taggagctcc agattttgtg gggaagattc aaggcatgct
     5881 cacccaaacc acaagggcag atggctcaac acgcggccac aaagccacgg tgtacactgg
     5941 gagcgccgac tttgctccaa aactgggcag agttcaattt gaaactgaca cagaccatga
     6001 ttttgaagct aaccaaaaca caaagttcac cccagtcggt gtcatccaag atggcagcac
     6061 cacccaccga aatgaacccc aacagtgggt gctcccaagt tactcaggca gaaatactca
     6121 caatgttcat ctggcccccg ctgtagcccc tacttttccg ggtgagcaac ttctcttctt
     6181 taggtccact atgcccggat gtagcgggta ccccaacatg gatttggact gtctgctccc
     6241 ccaggaatgg gtgcagtact tctaccaaga ggcagcccca gcacaatctg atgtggctct
     6301 gctaagattt gtgaatccag acacaggtag ggttttgttt gagtgtaaac ttcataaatc
     6361 aggctatgtt acagtggctc acactggcca acatgatttg gttatccccc ccaatggtta
     6421 ttttaggttt gattcctggg tcaaccagtt ctacacgctt gcccccatgg gaaatggagc
     6481 ggggcgtaga cgtgtagtat aatggctgga gctttctttg ctggattagc atctgatgtc
     6541 cttggctctg gacttggctc ccttatcaat gctggggctg gggccatcaa ccaaaaagtt
     6601 gagtttgaaa ataacagaaa attgcaacaa gcatccttcc aatttagcag caatctacaa
     6661 caggcttcct ttcaacatga caaagaaatg ctccaagcac aaattgaggc caccaaaaag
     6721 ctacaacagg aaatgatgag agttaagcag gcaatgctcc tagagggtgg gttctctgag
     6781 acagatgcag cccgcggggc aatcaacgcc cccatgacaa aagctttgga ctggagcggg
     6841 acaaggtact gggctcctga tgctaggacc acaacataca atgctggccg cttttccacc
     6901 cctcaaccat cgggggcact gccaggaaga gctaatctta gggatgctgt ccttgctcgg
     6961 ggttcctcta gcaaatctta taactcttct actgctactt ctgtgtactc aaatcaaacc
     7021 acttcaacga gacttggttc tacagctggt tctggtacca gtgtctcgag tctcccgtca
     7081 actgcaagga ctaggagctg ggttgaggat caaagtagga atttgtcccc tctcatgagg
     7141 ggggcccaca acatatcgtt tgtcacccca ccatctagca gatcctctag ccaaggcaca
     7201 gtctcaaccg tgcctaaaga ggttttgaac tcctggactg gcgctttcaa cacgcgcagg
     7261 cagcctctct tcgctcatat tcgtaaacga ggggagtcac gggtgtaatg tgaaaagaca
     7321 aaattgatta tctttctttc tctttagtgt cttttaaaaa aaaaaaaaaa aaaaaaaaaa
     7381 aaaaaaaaaa
//