Typing tool

Complete norovirus genomes

MK907798  GII.3
 GII.P21

Length: 7,480 | 3 CDS

ORF1: 1..5099
ORF2: 5080..6726
ORF3: 6726..7480
LOCUS       MK907798                7480 bp    RNA     linear   VRL 02-NOV-2019
DEFINITION  Norovirus GII isolate G19_034 nonstructural polyprotein (ORF1)
            gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3)
            gene, partial cds.
ACCESSION   MK907798
VERSION     MK907798.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7480)
  AUTHORS   Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
            Guyader,S.
  TITLE     Optimisation of agnostic metagenomic approaches to characterise
            human enteric viruses in sewage
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7480)
  AUTHORS   Le Guyader,S. and Strubbia,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
            44311, France
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. 3.12.0
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7480
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="G19_034"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="France: Nantes"
                     /collection_date="10-Jan-2008"
                     /note="genotype: GII.3-GII.P21"
     gene            <1..5099
                     /gene="ORF1"
     CDS             <1..5099
                     /gene="ORF1"
                     /codon_start=3
                     /product="nonstructural polyprotein"
                     /protein_id="QCO93092.1"
                     /translation="KMASNDASAAAAANSNNDNAKSSSDGVLSNMAVTFKRALGARSK
                     QPPPRDKPPKPPRPPTPELVKAIPPPPPNGEDEPIISYNVKGGVSGLPELSTVTQLEE
                     SSTAFSVPPLSQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISL
                     AKVELTPLSLYWRPVYTPQYLISPDTLRKLHGETFPYTAFDNNCYAFCCWVLDLNDSW
                     LSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKVKKVANVVLCALSSLFTRPIKDIIGKL
                     KPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL
                     AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK
                     KEEKNELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDMEEEKARKLSTK
                     SASPDIVGTINALLSRIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELA
                     KKIASSLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCP
                     LTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPEVEKA
                     KRDFPGQPDMWKDTFKPDFSHIKLTLAPQGGFDKNGNTPHGKGVMKTLTASSLVARAS
                     GLLHERLDEYELQGPTPTTFNFDRNKVLAFRQLAAENKYGLVDTMRVGSQLKNVKTMT
                     ELKQALKNISVKKCQLVYGGGTYTLESDGKGNVHVEKVNNTSVQTNNELSGALHHLRC
                     ARIRYYVKCVQEALYSILQIAGAAFITTRIAKRMNIQNLWSKPQAEDPEETNNEDGCP
                     KPKNDEEFVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGR
                     YSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVT
                     GSEVRKRNPDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPS
                     LFITSTHVIPQGAQEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGT
                     VATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDC
                     GCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGNEGEAMLEGGDNKGTYCGAPILGP
                     GNAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPR
                     GKPPKPSVLEAAKKTIINVLEQTIDPPQKWTYAQACASLDKTTSSGHPYHMRKNDCWN
                     GETFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGTIKKRLLWGSDLS
                     TMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWDS
                     TQQRAVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFKISINEGLPSGVPCTS
                     QWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTTKLKE
                     YGLKPTRPDKTEGPLIISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTKGPN
                     HEDPFETMIPHSQRPIQLMSLLGEAALHGPSFYSKISKLVISELKEGGMDFYVPRQEP
                     MFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..989
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     990..2087
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2088..2624
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2625..3023
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3024..3566
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3567..5096
                     /gene="ORF1"
                     /product="RdRp"
     gene            5080..6726
                     /gene="ORF2"
     CDS             5080..6726
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QCO93093.1"
                     /translation="MKMASNDAAPSNDGAAGLVPEINNEAMALEPVAGAAIAAPLTGQ
                     QNIIDPWIMNNFVQAPGGEFTVSPRNSPGEVLLNLELGPEINPYLAHLARMYNGYAGG
                     FEVQVVLAGNAFTAGKIIFAAIPPNFPIDNLSAAQITMCPHVIVDVRQLEPVNLPMPD
                     VRNNFFHYNQGSDSRLRLIAMLYTPLRANNSGDDVFTVSCRVLTRPSPDFSFNFLVPP
                     TVESKTKPFSLPILTISEMSNSRFPVPIDSLHTSPTENIVVQCQNGRVTLDGELMGTT
                     QLLPSQICALRGVLTRSTSRASDQADTATPRLFNYYWHIQLDNLNGTPYDPAEDIPGP
                     LGTPDFRGKVFGVASQRNPDATTRAHEAKIDTTSGRFTPKLGSLEISTESDDFDQNKP
                     TRFTPVGIGVDHEADFQQWTLPDYAGQFTHNMNLAPAVAPNFPGEQLLFFRSQLPSSG
                     GRSNGILDCLVPQEWVQHFYQESAPSQSQVALVRYINPDTGRVLFEAKLHKLGFMTIA
                     KNGDSPITVPPNGYFRFESWVNPFYTLAPMGTGNGRRRIQ"
     gene            6726..>7480
                     /gene="ORF3"
     CDS             6726..>7480
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QCO93091.1"
                     /translation="MAGAFIAGLAGDMLTNTVGSLVNAGANAINQTIDFENNKYLQNA
                     SFNHDREMLNAQIEATKRLQADMIAIKQGVLTAGGFSPTDAARGAINAPMTKVLDWNG
                     TRYWAPGATSTTSMSGGFTNQTVHRSTPNFKTNQAPKSTPSSGSSVRSNSTQITSLSS
                     HSSGSSRSSGSTVVSSMPSSNRTRDWVNQQNFNLEPHMPGSLRTAFVTPPSSTASSSG
                     TVSTVPKNVLDSWTSAFNTRRQPLFAHLRRRGES"
ORIGIN      
        1 tgaagatggc gtctaacgac gcttccgctg ccgctgctgc caatagcaac aacgacaacg
       61 caaaatcttc aagtgacgga gtattatcta atatggctgt cacttttaaa cgagccctcg
      121 gggcgcggtc taaacagccg cccccgaggg acaaaccacc aaaaccccca agaccaccca
      181 caccagagtt ggttaaggca atcccccctc ccccacccaa cggggaggac gaaccaatca
      241 tttcttacaa cgtcaaaggg ggtgtttctg gtttgcctga actctcaact gtcacccaac
      301 tggaagagag ctctacagca ttcagcgttc cccctcttag tcagagggag aacagagatg
      361 caaaggaacc attgactgga accatcctgg agatgtggga cggggagatc taccattatg
      421 gcttatacgt ggaacgaggg ttagtgctcg gtgtacacaa accaccagca gccatcagcc
      481 ttgctaaggt tgagctgacc cccttgtctt tgtattggag accagtgtac accccacagt
      541 acctcatttc ccctgacact ctcagaaagt tgcatgggga gacgttccct tatacagcct
      601 ttgacaacaa ctgttatgcc ttctgctgtt gggtacttga cctgaacgat tcatggctga
      661 gcagaaggat gatacagagg acaacaggct tcttccgacc ctaccaggac tggaacagga
      721 aacccctccc cacgatggat gactccaaag tgaaaaaggt ggccaatgtt gtcctctgtg
      781 ctctctcatc attgtttaca agaccaatta aggacattat tgggaagttg aaacccctaa
      841 acattctcaa catcctagcc acatgtgatt ggacttttgc aggcatagta gaatccctga
      901 tccttttggc tgaactgttt ggagttttct ggacaccccc agatgtgtct gcaatgatcg
      961 ctcccttact gggtgattat gaactacagg gacccgagga cctcgctgtg gaactcgtac
     1021 ccgtagtaat gggagggata ggtttggtgc tgggattcac caaagagaaa atcggcaaaa
     1081 tgctttcatc tgctgcttca accctgaggg catgcaaaga tctcggtgca tatgggttgg
     1141 agatcctcaa attggttatg aaatggtttt tcccaaagaa agaagagaaa aacgaattgg
     1201 caatggtgag atccatcgag gatgcagtgc tagatcttga ggccattgag aacaaccata
     1261 tgacagctct actcaaggat aaagacagcc ttgcaaccta catgaggact cttgacatgg
     1321 aggaggagaa ggcgaggaaa ctttccacca agtcggcctc gccggatatt gtgggcacga
     1381 taaacgccct actgtcaagg attgcagctg cccggtctct agtgcacagg gctaaggagg
     1441 agctgtcaag caggccccga ccggtcgttg taatgatttc agggagaccg ggtataggga
     1501 aaacccatct agctagagag ttggcaaaga agatcgcctc ctcactcaca ggtgaccaga
     1561 gggtgggtct tatcccacgt aacggggtcg accactggga tgcatacaaa ggcgaaagag
     1621 tcgttctctg ggacgactac gggatgagca accctatcca cgacgccctc agactccagg
     1681 agctcgctga tacctgtcct cttacactca actgtgacag gattgagaac aaaggcaaag
     1741 tctttgacag tgacgccata ataataacca ccaatttggc caacccagcg ccactggatt
     1801 acgtcaactt tgaagcttgc tcaagacgca tagacttcct cgtctatgct gatgcccctg
     1861 aagttgagaa ggctaaacgg gacttcccag gccaaccaga catgtggaaa gacaccttta
     1921 aacccgactt ctcacatata aaactgacat tagccccgca aggaggtttt gacaagaatg
     1981 gtaacactcc tcatgggaag ggtgtcatga agaccctcac tgccagttcc ctcgttgccc
     2041 gagcatcagg gctcctccac gagagattag acgaatatga gctgcagggc ccaactccca
     2101 caacattcaa cttcgaccgc aacaaggtgc ttgcttttag gcagcttgct gctgaaaaca
     2161 agtacggtct tgttgacaca atgagggtcg ggtcacagct caagaatgtc aaaactatga
     2221 cagaactcaa gcaggccctc aagaacatct cagtcaagaa atgtcagctt gtgtatggtg
     2281 ggggcacgta cacacttgaa tctgatggca aaggcaatgt gcatgtcgaa aaggtgaaca
     2341 acaccagtgt gcaaactaac aacgagctct ccggggcttt gcaccatctc aggtgtgcca
     2401 gaatcaggta ctatgttaag tgtgtccagg aagctctcta ctccatcttg caaattgccg
     2461 gggctgcatt cattaccacg cgcattgcaa agcgcatgaa catacaaaac ctctggtcca
     2521 aaccacaagc agaagatccg gaggagacta acaacgagga tggttgtcca aaacctaaaa
     2581 atgatgagga gtttgtcatc tcctctgatg acatcaaaac cgagggtaag aaaggaaaga
     2641 acaagactgg ccgtggtaag aagcacacag ccttttccag caaaggactc agtgatgagg
     2701 agtacgatga gtataagaga attagggaag agagaaatgg caggtactcc atagaggaat
     2761 acctacagga cagagacaaa tactatgagg aggtggccat agccagggca actgaggaag
     2821 acttctgcga agaagaggag gctaaaatcc ggcaaaggat attcagacca acaaggaaac
     2881 aacgcaaaga ggaaagagcc tctcttggtt tggttacagg ttctgaggtc aggaagagaa
     2941 acccagatga cttcaagccc aaagggaaac tatgggccga tgatgacagg agtgttgact
     3001 acaatgagaa acttagtttc gaggctccac caagcatttg gtcacgaata gtcaactttg
     3061 gctcagggtg gggcttttgg gtctcgccca gcctcttcat aacatcaacc catgtcattc
     3121 cccaaggcgc acaggagttc tttggagtgc ccatcaaaca aatacaaatt cacaagtcag
     3181 gtgagttctg ccggcttagg ttcccaaaac caatcaggac agatgtcaca ggcatgatct
     3241 tggaggaagg tgctccagaa ggcactgttg ccacacttct catcaagaga ccaactgggg
     3301 aactcatgcc cttggcagcc agaatgggca cccacgcgac catgaaaatt cagggtcgca
     3361 ctgttggtgg acagatgggc atgctactca cagggtctaa cgctaagagt atggatttgg
     3421 gcacaactcc tggcgattgt ggttgtccct atatctacaa gagagggaac gactacgtgg
     3481 tcattggagt tcacactgcc gccgctcgtg gaggaaacac cgtcatctgt gcaacccaag
     3541 gaaacgaggg tgaggccatg ctagaaggtg gtgacaacaa gggaacttat tgtggagcac
     3601 caatattagg ccctggaaat gcccccaaac tcagcaccaa aaccaaattc tggaggtctt
     3661 ccaccacccc cctgccaccc ggaacctatg agccagctta tctgggtggc aaggacccta
     3721 gagtgaaagg tggcccctca ctgcaacagg ttatgaggga ccaactaaaa ccattcactg
     3781 agcccagagg caaaccaccc aaaccaagtg tgctagaagc cgccaagaag actataatca
     3841 atgtgcttga acaaacaata gacccacctc aaaaatggac atacgcacaa gcgtgtgcat
     3901 cactagacaa gaccacttcc agtggccacc cttaccacat gcggaagaac gattgctgga
     3961 atggggaaac tttcacagga aaactggcag accaagcatc aaaggctaac ctaatgtttg
     4021 aggaaggaaa gaacatgacc ccagtataca caggagctct gaaagatgag ctagtcaaga
     4081 ctgataagat ctatgggaca atcaagaaga gactcctgtg gggttcagac ctatcaacca
     4141 tgatacggtg tgcacgagcc ttcggtgggc taatggacga gctcaaggcc cattgcgtca
     4201 cactaccagt cagggttggt atgaacatga atgaggatgg acccataata tttgagaaac
     4261 attccagata caaataccat tatgatgcag attactcccg ctgggactca acacaacaaa
     4321 gagcagtgct ggctgcagcc ctggaaataa tggtcaaatt ctcaccagaa ccccatctgg
     4381 cccaagtagt tgcagaagac ctcttgtccc ccagtgtgat ggatgtgggt gatttcaaga
     4441 tatcaatcaa cgagggatta ccctctggtg ttccttgcac ctcacaatgg aactccattg
     4501 cccactggct cctcacacta tgtgcactgt ctgaagtcac agacctgtcc cctgacatca
     4561 tccaggcgaa ttccctgttc tccttttatg gtgatgatga aatagtgagc acagacatca
     4621 aactggaccc agagaaattg acaacaaaat tgaaggaata cgggctcaaa ccaacccgtc
     4681 ctgataaaac agaaggaccc ttaattatct ctgaagattt ggatggcctg accttcttac
     4741 ggagaacagt gacccgtgat ccggccgggt ggtttggcaa actggaccaa agttcaatac
     4801 tcaggcagat gtactggacc aagggaccaa accatgagga cccctttgaa acaatgatac
     4861 cacactccca aagacccata caactgatgt cattactggg tgaagctgca ttgcatggtc
     4921 catcgttcta cagtaaaatc agcaaattgg tcatctcaga attgaaagag ggtggaatgg
     4981 atttttacgt gcccagacaa gaaccaatgt tcaggtggat gagattctca gatttgagca
     5041 cgtgggaggg cgatcgcaat ctggctccca gttttgtgaa tgaagatggc gtcgaatgac
     5101 gccgctccat ctaatgatgg tgccgccggc ctcgtcccag agatcaataa tgaggcaatg
     5161 gcgctagagc cagtggcggg tgcagcgata gcagcacccc tcactggtca gcaaaatata
     5221 attgatccct ggattatgaa taattttgtg caagcacctg gtggtgagtt tacagtatcc
     5281 cctagaaatt cccctggtga agttcttctt aatttggaat tgggcccaga aataaatccc
     5341 tatttggccc atcttgctag aatgtataat ggttatgcag gtggatttga agtgcaggtg
     5401 gtcctagctg gaaatgcgtt tacagcagga aagataatct ttgcagctat tccccccaat
     5461 tttccaattg acaatctaag tgcagcacag attacaatgt gcccacatgt gattgtggat
     5521 gtcagacagt tggaaccagt caacctcccg atgcctgacg ttcgtaacaa cttctttcat
     5581 tacaatcaag ggtccgattc gagattgcgc ctaattgcaa tgctatacac acctcttagg
     5641 gcaaataatt ctggggatga tgtttttact gtgtcttgca gagtgctaac tagacctagt
     5701 cctgacttct catttaattt ccttgtgcca cctactgtgg agtcaaagac aaaacccttt
     5761 tccctcccta ttctgactat ctctgaaatg tccaattcta ggttcccagt accaattgat
     5821 tctctgcaca ccagtcctac tgagaatatt gttgtccagt gccagaatgg gcgcgtcacc
     5881 cttgatggtg agttgatggg caccactcaa ctcttaccta gccaaatctg tgctctcagg
     5941 ggcgttctca ccagatcaac aagcagggcc agtgaccagg ccgatacagc aacccctaga
     6001 ttgtttaatt attattggca catacaattg gataatctaa atggaactcc ttatgatcct
     6061 gcagaagaca taccaggccc cctagggaca ccagatttcc ggggcaaagt ctttggcgtg
     6121 gccagccaga gaaatcctga tgccacgact agggcacatg aagcaaagat agacacaaca
     6181 tctggccgtt tcactccaaa gctaggctca ttagagatat ccactgaatc tgatgatttt
     6241 gatcaaaaca aaccaacaag attcacccca gttggcattg gggttgacca tgaggcagac
     6301 ttccaacaat ggactctacc cgactacgct ggccagttca cccacaacat gaacttagcc
     6361 ccagctgttg ctcccaactt ccctggtgag cagctccttt tcttccgctc acagttgcca
     6421 tcttctggtg ggcgatccaa cgggattcta gactgcctgg tcccccaaga atgggtacag
     6481 cacttctacc aagaatcagc cccctctcaa tctcaagtgg ccctggttag gtatatcaac
     6541 cctgacactg gtagagtgtt atttgaggcc aagctgcaca aattaggttt catgactata
     6601 gccaagaatg gtgactctcc aataacagtc cctccaaatg gatactttag gtttgaatct
     6661 tgggtgaacc ccttttacac acttgccccc atgggaactg ggaatgggcg tagaaggatt
     6721 caataatggc tggagctttt atagcaggat tggctggtga catgctcaca aatactgtag
     6781 gatctttagt taatgcaggg gctaatgcca ttaatcaaac aattgatttt gaaaataata
     6841 aatatttgca aaatgcttct ttcaatcatg atagggagat gttgaatgca caaattgagg
     6901 caacaaagag gttacaggct gacatgattg ctatcaaaca aggggttttg accgctggcg
     6961 gcttctcccc tactgatgca gcccgtgggg caattaatgc ccccatgaca aaagtcctag
     7021 attggaatgg aacgagatac tgggcaccag gtgccacctc cacaacctcg atgtcgggtg
     7081 gctttacaaa tcaaactgtg cacagatcca caccaaattt taaaacgaac caggctccca
     7141 aatccacacc cagcagtggg tcttcagtga ggtcaaattc aacccaaatc actagcctga
     7201 gctcacactc gtccgggtcg tctcgatcca gcgggtctac agttgtcagc tcaatgccat
     7261 cctctaacag gactagggac tgggtcaacc aacaaaattt taatttggaa ccacacatgc
     7321 ctggatctct taggacagct tttgtcactc caccatctag tacagcctct agctcaggca
     7381 cagtctcaac tgtgcccaaa aatgttttgg actcctggac atctgcgttt aacacgcgca
     7441 gacagccgct attcgcacac cttcgcagaa ggggggagtc
//