Typing tool

Complete norovirus genomes

MT238668  GII.4 Sydney
 GII.P16

Length: 7,556 | 3 CDS

ORF1: 1..5093
ORF2: 5074..6696
ORF3: 6696..7502
LOCUS       MT238668                7556 bp    RNA     linear   VRL 01-MAY-2020
DEFINITION  Norovirus GII isolate 782-3 nonstructural polyprotein (ORF1) gene,
            partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds.
ACCESSION   MT238668
VERSION     MT238668.1
DBLINK      BioProject: PRJNA604000
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7556)
  AUTHORS   Yang,Z., Williams-Hill,D., Wolfe,J., Silva,A.J., Hirneisen,K.,
            Ruelle,S., Kulka,M. and Hellberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAR-2020) Molecular Virology Team/Devision of
            Molecular Biology, OARSA/CFSAN/FDA, 8301 Muirkirk Rd., Laurel, MD
            20708, USA
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: CLC Genomics Workbench v. 11
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7556
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="782-3"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="USA"
                     /collection_date="17-Dec-2018"
                     /note="genotype: GII.P16-GII.4"
     gene            <1..5093
                     /gene="ORF1"
     CDS             <1..5093
                     /gene="ORF1"
                     /codon_start=3
                     /product="nonstructural polyprotein"
                     /protein_id="QIQ09391.1"
                     /translation="ASNDATVAVACNNNNDKEKSSGEGLFTNMSFTLKKALGARPKQP
                     APRDEPQKPPRPPTPELVKRIPPPPPNGEGEEEPVIRYEVKSGISGLPELTTVPQPDV
                     ANTAFSVPPLSLRENREAKEPLTGAILEMWDGEIYHYGLYVEKGLVLGVHKPPAAISM
                     ARVELTPLSLYWRVVYTPQYLISPETLRRLNGEAFPYTAFDNNCYAFCCWVLDLNDSW
                     LSRRMVQRTTGFFRPYQEWNRKPLPTMDDSKIKKVANIFLCSLSTLFTRPIKDLIGKI
                     KPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL
                     AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK
                     KEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMKTLDMEEEKARRLSTK
                     SASPDIVGTINALLARIAAARSLVHRAKEELSSRIRPVVVMISGKPGIGKTHLAREVA
                     RKVASTLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELADTCP
                     LTLNCDRIENKGKVFDSEVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA
                     KRDFPGQPDMWKDAFKPDFSHIKLQLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS
                     GLLHERMDEFELQGPTITTFNFDRNRITAFRQLAAENKYGLVDTMKVGNQLKGVKTME
                     ELKQAIRNVTIKRCRIIYGGSTYDLESDGKGKVLVEKVKNTSVQTNNELAGALHHLKH
                     ARIRYYVKCVQEAVYSIIQIAGAAFVTTRIARRMNIQELWSKPQLDQNESETKEEAPK
                     SEDDEFIISSKDIKEEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYS
                     IEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLGLVTGS
                     EIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFEAPPSIWSRIVSFGSGWGFWVSPSLF
                     ITSTHVIPAGITEAFGVPIKQIQIHKSGEFCRFRFPRPIRPDVTGMILEEGAPEGTVA
                     TVLIKRPTGELMPLAARMGTHATMKIQGRMVGGQMGMLLTGSNAKGMDLGTTPGDCGC
                     PYIYKRGNDYIVIGVHTAAARGGNTVICATQGSEGEATLEGGYDKGTYCGAPILGPGG
                     APKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEPRGK
                     PPRPSVLEAAKQTVINVLEQTLDPPQKWTYAQACASLDKTTSSGHPHHVRKNEFWNGE
                     TFTGKLADQASKANLMFEEGKHMTPVYTAALKDELVKTEKIYRKIKKRLLWGSDLSTM
                     IRCARSFGGLMDEMKAHCISLPVRVGMNVNEDGPIIFEKHSRYKYHYDADYSRWDSTQ
                     QRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPSVVDVGDFKITINEGLPSGVPCTSQW
                     NSIAHWLLTLCALSEVTKLSPDIIQANSMFSFYGDDEIVSTDIKLDPEQLTAKLKEYG
                     LKPTRPDKTEGPLIISEDLNGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHE
                     DPNETMIPHSQRPIQLMALLGEASLHGPSFYSRISKLVITELKEGGMDFYVPRQEPMF
                     RWMRFSDLSTWEGDRNLAPNFVNEDGVE"
     mat_peptide     <1..989
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     990..2087
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2088..2618
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2619..3017
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3018..3560
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3561..5090
                     /gene="ORF1"
                     /product="RdRp"
     gene            5074..6696
                     /gene="ORF2"
     CDS             5074..6696
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QIQ09392.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVV"
     gene            6696..7502
                     /gene="ORF3"
     CDS             6696..7502
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QIQ09393.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGPSNKSS
                     NSSTATSVYSNQTISTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN      
        1 tggcgtctaa cgacgctacc gttgccgttg cttgcaacaa caacaacgac aaggaaaaat
       61 cttcaggtga aggcttattc acaaatatgt ctttcacctt aaagaaagcc ctcggggcta
      121 ggcccaaaca gcctgccccg agagacgaac cacaaaagcc cccaagacca ccaacccccg
      181 agttggtcaa gaggataccc cctcctccac ctaatggcga aggagaagaa gaaccagtca
      241 ttaggtatga ggttaagagt gggatctctg gcctgcccga gctcacaaca gtcccccaac
      301 cggacgtggc caacacagca ttcagtgttc caccactgag cttgagagaa aacagggagg
      361 ccaaggaacc gctaacaggg gcaatattag agatgtggga tggagagata taccactatg
      421 gcctgtacgt ggagaaaggc ttagtgttgg gtgtgcacaa accacctgca gccataagca
      481 tggcaagagt ggagctgacg ccgctgtcat tgtactggcg tgtggtgtac actccccaat
      541 acctcatctc ccctgaaact ctcaggaggc tcaacggaga ggcgttccct tacaccgcct
      601 tcgacaacaa ctgctacgcc ttttgctgct gggtgttaga cctcaatgac tcatggctta
      661 gcaggaggat ggtgcaaaga acaacgggct tcttcagacc ttaccaagag tggaacagaa
      721 agcccctgcc taccatggat gactccaaaa ttaagaaggt agcaaatata ttcctatgtt
      781 cattgtccac attattcacc agacccataa aagacctcat agggaaaatt aaaccattaa
      841 acatattgaa catcctggca acgtgtgact ggacgtttgc cggaatagtg gagtctctga
      901 tattacttgc tgaactcttc ggagttttct ggacgccccc agatgtgtct gctatgatcg
      961 ctcccttact cggggactac gagttgcaag ggccagaaga cctcgccgtt gaactcgtac
     1021 ctgtggtaat gggagggatt ggtttggtgt tgggattcac caaagagaaa attggcaaaa
     1081 tgttgtcctc agcagcatca acactcaggg cttgcaaaga tcttggtgcc tatggcttag
     1141 agatactcaa gttggtcatg aagtggttct tcccaaagaa agaggaggcc aatgagctag
     1201 ccatggtgag ggccatagag gatgccgtgc tagatcttga ggcaatagaa aataaccaca
     1261 tgacaaccct gttgaaagac aaagacagct tagcaacata catgaaaaca ctggacatgg
     1321 aggaggagaa agccagaagg ttgtccacaa aatctgcatc ccctgacata gttgggacaa
     1381 tcaacgccct gctggctcga atagcagcgg ccaggtcatt agtccacagg gccaaggaag
     1441 agctatctag caggataagg ccagtagttg ttatgatatc tggcaaacca ggaataggca
     1501 aaactcatct ggccagggag gtggcaagaa aggtggcatc cactctcaca ggggaccaaa
     1561 gagtcggact cataccaaga aacggtgtgg accattggga tgcatacaaa ggtgagagag
     1621 tcgtgctgtg ggacgactat ggcatgagta accccatcca tgatgctctt cgcatacaag
     1681 aattggctga tacgtgtccc cttaccttaa attgtgacag aattgaaaat aagggaaaag
     1741 tttttgacag tgaagtcata ataattacaa caaaccttgc caatccagcc ccacttgatt
     1801 atgtcaactt tgaggcctgt tccaggagaa ttgatttcct ggtgtacgct gaggcaccag
     1861 aagtagaaaa ggcaaaacgg gactttcctg gtcagccaga tatgtggaag gacgccttca
     1921 agccggactt ttcacacatc aagctacagc ttgcacctca gggcggcttt gacaagaatg
     1981 gcaacacccc acatgggaaa ggagtgatga agaccctcac taccggttct ctgattgccc
     2041 gtgcatcagg cctactacat gagaggatgg atgaatttga actccaaggt cccacaatca
     2101 ccaccttcaa tttcgaccga aacagaatca cagcattcag acaattggct gcagaaaaca
     2161 agtatggatt ggtggatacc atgaaagttg gcaatcaatt aaaaggagtg aaaaccatgg
     2221 aagaactcaa acaagcaatc agaaatgtga ccatcaagag gtgccggatc atctacggtg
     2281 gctccacgta tgaccttgaa tctgatggca agggcaaagt tttggtggaa aaggtcaaga
     2341 acacctctgt acagaccaac aacgagttgg ccggggccct gcaccatctc aaacacgccc
     2401 gaatcaggta ctatgtcaaa tgtgtgcaag aagcagtcta ttccatcata caaattgccg
     2461 gcgctgcgtt tgtcaccacg cgcattgcac gccgcatgaa catacaagaa ctctggtcga
     2521 agccacaatt agatcaaaat gaatcagaga ctaaggaaga ggcccccaaa tcagaagatg
     2581 acgagttcat catatcttct aaggacatca aggaggaagg aaagaagggc aaaaacaaaa
     2641 ctggccgtgg caagaaacac actgcattct ccagcaaggg cttgagcgat gaggagtatg
     2701 acgagtacaa gaggataaga gaagagagaa atgggaagta ctctatagag gagtatcttc
     2761 aagacagaga caggtactat gaggagctcg ccattgccaa ggccacggaa gaagacttct
     2821 gtgaagagga ggagataaaa atccgtcaga gaattttccg tcccaccagg aaacaaagaa
     2881 aggaagagag ggccacatta ggactggtaa caggttcaga aatcagaaaa agaaaccctg
     2941 atgacttcaa acccaaaggg aagctgtggg ccgatgacaa cagaagtgtt gactataatg
     3001 agaaactgga ctttgaggcc cccccaagca tatggtctag gattgtgagc tttggttctg
     3061 gctggggctt ctgggtatca ccaagcctgt tcataacatc aactcatgta atccccgcag
     3121 gcataacaga agcatttgga gtccccatca aacaaattca gatccacaaa tcaggtgaat
     3181 tttgccgatt cagattccca agaccaatta gaccagacgt gacaggaatg atcttggaag
     3241 aaggtgcgcc tgaaggcacc gtggcaactg tgctcatcaa acgccccacc ggagagctca
     3301 tgcctcttgc agccagaatg ggaacacacg caaccatgaa aattcaaggc cgcatggttg
     3361 gcggacagat gggtatgttg ctcactggat caaatgctaa aggaatggat ttgggaacaa
     3421 ctcctggtga ctgtggctgt ccttacatct acaaaagggg caatgactat atagtcattg
     3481 gggtgcacac tgcagcagcc cgtggtggaa acaccgtcat ctgtgccaca cagggaagtg
     3541 agggtgaggc aactcttgag ggtggatatg acaaaggaac atactgtggg gcacccattc
     3601 taggccctgg gggtgcacca aagttgagca ccaaaaccaa attttggagg tcatcgaaca
     3661 cgcccctccc accagggaca tatgagcctg cctacctcgg tggccgtgat ccgcgtgtta
     3721 agggtgggcc ctccttgcag caggtaatga gagaccagtt gaagccattc actgaaccca
     3781 ggggcaaacc tccaagacca agtgtattgg aagcagccaa acaaaccgtt atcaatgtcc
     3841 tcgaacaaac cctggatcct ccacaaaaat ggacatacgc acaggcgtgt gcctcacttg
     3901 acaaaaccac ttccagcggg catcctcatc acgtccgaaa gaatgaattc tggaatggtg
     3961 agaccttcac cggcaaattg gcagaccaag catcaaaagc aaacctaatg tttgaggaag
     4021 ggaaacacat gacaccagtg tatacagcag cactcaagga cgagctagtc aagactgaga
     4081 aaatctatag aaagatcaag aagagactgc tctggggctc tgacttgtcc accatgatcc
     4141 ggtgcgctag gtcatttggt gggctcatgg acgagatgaa ggcacactgc atatcactcc
     4201 cagtacgagt tggcatgaat gtgaatgaag atggcccaat aatatttgag aaacattcca
     4261 gatacaaata ccactatgac gcagactact ctcgttggga ttcaacacaa cagagggcag
     4321 tactagcagc agccttggaa atcatggtca gattctctgc agaaccacaa ttggcacaaa
     4381 tagtcgctga ggatcttctg gcccctagcg tagtagatgt aggagacttt aaaatcacta
     4441 taaatgaagg gctcccatct ggtgtgccat gcacctccca atggaactcc atcgcacact
     4501 ggctgctaac tctctgtgcc ttgtctgaag tcaccaaact gtcccctgac attatacaag
     4561 caaattccat gttctcattt tacggtgatg acgagattgt cagcaccgac ataaaattgg
     4621 accctgaaca gttaaccgcc aagttgaagg agtatggcct gaaaccaacc cgcccagaca
     4681 agaccgaggg acccctgatc atcagtgaag atttgaacgg actcactttc ctccgaagga
     4741 cggtgactcg tgacccagct ggctggtttg gaaaactgga ccaaagttca attttgaggc
     4801 agatgtactg gactagagga ccaaatcacg aagatcccaa tgagacaatg ataccccatt
     4861 ctcaaagacc catacagctc atggcactgc ttggtgaagc ctctcttcac ggaccctctt
     4921 tctacagtag aatcagtaaa ttggtcataa ctgaactcaa agaaggtggg atggactttt
     4981 acgtgccaag gcaggaaccc atgttcaggt ggatgaggtt ttctgacttg agcacgtggg
     5041 agggcgatcg caatctggct cccaattttg tgaatgaaga tggcgtcgag tgacgccaac
     5101 ccatctgatg ggtccgcagc caacctcgta ccagaggtca acaatgaggt tatggctttg
     5161 gagcccgttg tcggtgccgc tattgcggca cctgtagcgg gccaacaaaa tgtaattgac
     5221 ccctggatta gaaataattt tgtacaagcc cctggtgggg agtttacagt atcccccaga
     5281 aacgctccag gtgaaatact atggagcgcg cccctaggcc ctgacctaaa tccctaccta
     5341 tcccatttgg ccagaatgta caatggttat gcaggtggtt ttgaagtgca ggtaattctc
     5401 gcggggaacg cgttcaccgc cgggaagatc atatttgcag cagtcccacc aaattttcca
     5461 actgaaggct taagtcctag ccaggtcact atgttccccc atataatagt agatgttaga
     5521 cagttagaac ctgtgctgat tcctttaccc gatgttagga ataatttcta tcattacaat
     5581 cagtcaaatg actccactat taagttgata gcaatgttgt acacaccact tagggctaat
     5641 aatgctgggg atgatgtttt cacagtttcg tgccgagttc tcacgagacc atcccccgat
     5701 tttgatttca tatttttagt gccacccaca gttgagtcaa gaactaaacc attctctgtc
     5761 ccagttttaa ctgttgagga gatgaccaat tcaagattcc ccatcccttt ggaaaagctg
     5821 ttcacaggcc ccagcagtgc ctttgttgtt caaccacaaa acggcaggtg cacaactgat
     5881 ggcgtgctcc taggcaccac ccaactttct cctgtcaaca tctgcacctt cagaggggat
     5941 gtcacccaca tcacaggtag tcgcaactac acaatgaatt tggcttctca aaattggaac
     6001 aactatgacc caacagaaga aatcccagcc cctctaggaa ctccagattt tgtggggaag
     6061 attcaaggca tgctcaccca aaccacaagg acagatggtt caacacgcgg ccacaaagct
     6121 acagtgtaca ctgggagcgc cgactttgct ccaaaactgg gtagagttca atttgaaact
     6181 gacacagacc atgattttga agctaatcaa aacacaaagt tcaccccagt cggtgtcatc
     6241 caagatggta gcaccaccca tcgaaacgaa ccccaacagt gggtgctccc aagttactca
     6301 ggcagaaata ctcacaatgt acatctggcc cccgctgtag cccccacctt tccgggtgag
     6361 caacttctct tcttcagatc caccatgccc ggatgcagcg ggtaccccaa catggatttg
     6421 gactgtctgc tcccccagga atgggtgcag tacttctatc aagaggcagc cccagcacaa
     6481 tctgatgtgg ctctgctaag atttgtgaat ccagacacag gtagggtttt gtttgagtgc
     6541 aagcttcaca aatcaggcta tgttacagtg gctcacactg gccaacatga tttggttatc
     6601 ccccccaatg gctactttag atttgattcc tgggtcaacc agttctatac gcttgccccc
     6661 atgggaaatg gaacggggcg tagacgtgta gtataatggc tggagctttc tttgctggat
     6721 tggcatctga tgtccttggc tctggacttg gttccctcat caatgctggg gctggggcca
     6781 tcaaccaaaa agttgagttt gaaaataaca gaaaattgca acaagcatcc ttccaattta
     6841 gcagcaatct gcaacaggct tcctttcaac atgataaaga gatgctccaa gcacaaattg
     6901 aggccactaa aaagctacaa caggaaatga tgaaagttaa gcaggcaatg ctcctagagg
     6961 gtgggttctc tgagacagat gcagcccgcg gggcaattaa cgcccccatg acaaaagctt
     7021 tggactggag tgggacaagg tactgggctc ccgatgctag gactacaaca tacaatgcag
     7081 gccgcttttc cacccctcaa ccatcggggg cactgccagg aagagctaat cttagggatg
     7141 ctgtccctgc tcggggaccc tccaacaaat cttctaactc ttctactgcc acctctgtgt
     7201 attcaaatca aactatttca acgagacttg gttctacagc tggttctgga accagtgtct
     7261 cgagcctccc gtcaactgca aggactagga gctgggttga ggatcaaaat aggaatttgt
     7321 cacctttcat gaggggggcc cacaacatat catttgtcac cccaccatct agcagatcct
     7381 ctagccaagg cacagtctca accgtgccta aagagatttt ggactcctgg actggcgctt
     7441 tcaacacgcg caggcagcct ctcttcgctc acattcgtaa gcgaggggag tcacgggtgt
     7501 aatgtgaaaa gacaaaattg attatttttc tttttcttta gtgtctttta aaaaaa
//