Typing tool

Complete norovirus genomes

MT742777  GII.4 Hong Kong
 GII.P31

Length: 7,501 | 3 CDS

ORF1: 1..5070
ORF2: 5051..6673
ORF3: 6673..7476
LOCUS       MT742777                7501 bp    RNA     linear   VRL 30-DEC-2020
DEFINITION  Norovirus GII isolate WT-NORO-0887 nonstructural polyprotein (ORF1)
            gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
            cds.
ACCESSION   MT742777
VERSION     MT742777.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7501)
  AUTHORS   Chan,M.C., Roy,S., Bonifacio,J., Zhang,L.-Y., Chhabra,P.,
            Chan,J.C., Celma,C., Igoy,M.A., Lau,S.-L., Mohammad,K.N., Vinje,J.,
            Vennema,H., Breuer,J., Koopmans,M. and deGraaf,M.
  TITLE     Detection of a New Norovirus GII.4 Variant, GII.4 Hong Kong, in
            Asia and Europe, 2017-2019
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7501)
  AUTHORS   Roy,S., Williams,R.J., Tutill,H.J., Sheth,S., Celma,C., Allen,D.
            and Breuer,J.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-JUL-2020) Infection & Immunity, UCL, Cruciform
            Building, Gower Street, London WC1E 6BT, United Kingdom
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: CLC Genomics Workbench v. 11.01
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7501
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="WT-NORO-0887"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="United Kingdom"
                     /collection_date="Mar-2019"
                     /note="genotype: GII.P31_GII.4_Hong_Kong_2019"
     gene            <1..5070
                     /gene="ORF1"
     CDS             <1..5070
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QLG43138.1"
                     /translation="AAVAKSNNDIAKSSSDGVLSNMAVTFKRALGARPKQPPPKEIPP
                     RPPRPPTPELIKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVGQPDETNTAFSVPP
                     LNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKVELTPLS
                     LYWRPVYTPQYLISPDTLRRLHGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMISRT
                     TGFFRPYQDWNRKPLPTVDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNIL
                     ATCDWTFAGIVESLILMAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVVM
                     GGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEVNELAM
                     VRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGT
                     INALLARIAAARSLVHRAKEELSSRLRPVVVMISGKPGIGKTHLARELAKKIAASLNG
                     DQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIE
                     NKGKVFDSDAIIITTNLTNPAPLDYVNFEACSRRIDFLVYAEAPDVEKAKRDFPGQPD
                     MWKNAFSPDFSHLKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLVARASGLLHERLDE
                     YELQGPTLTTFNFDRNKILAFRQLAAENKYGLVDTMRIGKQLKDVKTMPDLKQALKNV
                     SIKKCQIVYSGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARIRYYVKC
                     VQEALYSIIQIAGAAFVTTRITKRMNIQNLWSRPQVEDTEETANKDGCPKPKDDEEFV
                     VASDDVKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQD
                     RDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTKKQRKEERASLGLVTGSEIRKRNP
                     EDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVI
                     PQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRP
                     TGELMPLAVRMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRG
                     NDYVVIGVHTAAARGGNTVICATQGNEGEATLEGGDNKGTYCGAPILGPGSAPKLSTK
                     TKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPSVL
                     EAAKKTIVNVLEQTIDPPQKWSFAQACASLDKTTSSGYPHHVRKNDCWNGDSFTGKLA
                     DQASKANLMYEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLATMIRCARAF
                     GGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAA
                     ALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWL
                     LTLCALSEVTDLSPDIVQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLRPTRPD
                     KTEGPLVISEDLDGLTFLRRTVTRDIAGWFGKLEQSSILRQMYWTKGPNHEDPSETMI
                     PHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSD
                     LSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..960
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     961..2058
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2059..2595
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2596..2994
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     2995..3537
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3538..5067
                     /gene="ORF1"
                     /product="RdRp"
     gene            5051..6673
                     /gene="ORF2"
     CDS             5051..6673
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QLG43139.1"
                     /translation="MKMASNDANPSDGSTANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNLIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLSPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHVIVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFTVPVLTVEEMTNSRFPIPLEKLYTGPSAAFVVQPQNGRCTTDGVLLGTT
                     QLSAVNICTFRGDLTHIAGTRQFTMHLASPNWNNYDPTEEIPAPLGTPDFVGKIQGML
                     TQTTKTDGSTRGHKAMVSTGAADFAPKLGNIRFGTDTENDLQSGINTKFTPIGVVQDG
                     ENPHFSEPQQWVLPSYTGRTGHNVHLAPAVAPTYPGEQLLFFRSTMPGLSGYPNLTID
                     CLLPQEWVRHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYITVAHTGPHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRVL"
     gene            6673..7476
                     /gene="ORF3"
     CDS             6673..7476
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QLG43140.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATQKLQQGMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARVTTYNAGRFSTSQPSGALPGRTNLRATLPTRGSSSVTS
                     ASSATSVYSNKTVSTRLGSSAGSGTSASSTPSTTRTRSWVEDQNRNLSPFMSGALNTS
                     FVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHVRRRGESRV"
ORIGIN      
        1 gccgctgttg ccaaaagcaa caacgacatt gcaaaatctt caagtgacgg tgtgttatct
       61 aacatggctg tcacttttaa acgggccctc ggggcgcggc ctaaacagcc gcccccgaag
      121 gaaataccac ccagaccccc acgtccaccc acaccagaat tgatcaaaaa gattcctccc
      181 cccccaccca acggggagga tgaactggtg gtttcctaca gtgccaaaga tggcgtgtct
      241 ggcctgcctg agctcactac tgtcggacaa ccagatgaga ccaacacggc gttcagtgtc
      301 cccccgctta accagaggga gaatagggac gctaaggaac cactaactgg aacaatcatt
      361 gaaatgtggg atggagaaat ataccattac ggcctgtacg tggaacgagg tcttatactt
      421 ggtgtgcaca agccaccggc ggccattagc cttgccaagg tcgagctaac accactctct
      481 ttgtactgga gacctgtata caccccccag tatctcatct ctccagacac tctcaggagg
      541 ctgcatggag agacattccc ctacactgca tttgacaaca actgctacgc cttctgttgt
      601 tgggtgttag acctaaacga ctcgtggcta agcaggagaa tgattagcag aacaacaggt
      661 ttcttcaggc cataccagga ttggaacagg aaacccctcc ccactgtgga tgattccaaa
      721 ttgaaaaagg tagccaacat attcttgtgc actttgtctt cactattcac cagacctatt
      781 aaggacataa tagggaagct gaaacccctt aacatcctca atattctggc cacatgcgac
      841 tggaccttcg caggcatagt ggagtcctta atactcatgg cagaactctt tggagttttc
      901 tggacacccc cagatgtgtc tgcgatgatc gcccccttac taggtgatta tgaactgcaa
      961 ggacctgagg accttgcagt agaactagtc ccagtggtaa tgggggggat aggtttggtg
     1021 ctaggattta ccaaagagaa aattggaaag atgttgtcat ctgctgcatc cactttgaga
     1081 gcttgcaaag accttggtgc atacggactg gaaattttaa aattggtcat gaaatggttc
     1141 ttcccaaaga aagaggaagt aaatgaattg gctatggtga gatccatcga ggacgcagta
     1201 ctggacctcg aggcaattga aaataaccac atgaccgcac tgctcaagga caaagacagt
     1261 ctggcaacct atatgaggac cctcgacctt gaggaggaga aagccagaaa actctcaacc
     1321 aaatctgctt cacccgacat tgtgggcaca atcaacgctc ttctggcaag gatcgccgct
     1381 gcacgttccc tagtgcatcg ggcaaaagaa gagctctcta gcaggctgag acccgttgtt
     1441 gtgatgatat cggggaagcc agggataggg aagactcacc tcgccagaga gctagccaag
     1501 aagatcgcgg cctccctcaa tggggaccag cgtgtgggtc tcatcccgcg caatggcgtc
     1561 gatcactggg atgcgtataa gggagaaaga gtagttcttt gggacgatta tggaatgagt
     1621 aaccccatcc atgatgccct cagattgcag gaacttgctg acacctgtcc cctcacatta
     1681 aattgtgata ggattgagaa caaagggaaa gtctttgaca gtgatgctat aattattacc
     1741 accaacctga ccaacccagc accactggac tatgtcaatt ttgaagcgtg ttcgaggcgc
     1801 attgacttcc tcgtgtacgc agaagctcct gatgtggaaa aggcaaagcg cgacttccca
     1861 ggtcaacctg acatgtggaa gaacgctttc agccctgatt tctcgcacct gaaactgtca
     1921 ttggctccgc agggtggttt tgataagaac ggcaacaccc cgcatggaaa aggcgtcatg
     1981 aagaccctca ccactggctc cctcgtcgcc cgagcttcag gcttgctcca tgagaggcta
     2041 gacgagtatg aactacaagg cccaactctc accactttca actttgaccg taacaaaatc
     2101 cttgctttca ggcagcttgc cgccgaaaac aaatatgggc tggtggacac aatgaggatt
     2161 ggaaaacagc tcaaggatgt caagaccatg ccagacctca aacaggcact caagaatgtc
     2221 tctatcaaga agtgccagat agtgtacagt ggtggcacct acacacttga ggctgacggc
     2281 aagggcagtg tgaaagttga caaagtgcaa agcgccactg tgcaaaccaa caatgaacta
     2341 gctggtgccc tacaccacct aaggtgcgcc agaattaggt actatgtcaa gtgcgtccag
     2401 gaggcactgt attccatcat ccaaattgct ggggctgcat ttgtcaccac gcgcatcact
     2461 aagcgcatga acatacagaa tctctggtcc aggccacagg tggaagacac agaggagacg
     2521 gctaacaaag atggttgccc aaaacctaaa gatgatgaag agtttgtcgt cgcatccgac
     2581 gacgtcaaga ctgaaggcaa gaaagggaaa aacaagactg gccgtggtaa gaagcacaca
     2641 gccttctcaa gcaaagggct cagtgatgag gagtacgatg agtacaagag aatcagagaa
     2701 gaaaggaatg gcaagtattc catagaggag taccttcagg acagggacaa gtactatgag
     2761 gaggtggcta tcgcaagggc aaccgaagag gacttctgtg aagaagaaga ggccaaaatc
     2821 cggcagagaa tcttcagacc aacaaagaaa caacgcaaag aggagagggc ttctctcggt
     2881 ttggtcacag gctctgaaat taggaagaga aacccagaag atttcaaacc caaggggaaa
     2941 ctatgggctg acgatgacag aagtgttgac tacaatgaga aactcagctt tgaggcccca
     3001 ccaagcatct ggtcacggat agtcaacttt ggctcaggtt ggggcttctg ggtctccccc
     3061 agtctgttta taacctcaac ccatgtcata ccccaaggtg caaaagagtt ctttggagtc
     3121 cccatcaaac aaatccaaat acacaaatca ggtgaattct gccgattgag gttcccaaaa
     3181 ccaatcagaa ctgatgtgac gggcatgatt ctagaagaag gtgcgcccga aggaaccgtg
     3241 gccacacttc tcatcaagag accaactgga gaactcatgc ctttggcagt gagaatggga
     3301 acccatgcaa ccatgaagat tcaagggcgc acagttggag gacaaatggg catgctcttg
     3361 acaggatcca acgctaagag tatggacctg ggcacaacac caggtgactg cggctgtccc
     3421 tacatttaca agagggggaa tgattacgta gtcataggag tccacacggc cgctgcccgt
     3481 ggaggaaaca ctgtcatatg tgctacccag gggaatgaag gagaggccac actcgaaggt
     3541 ggcgacaata aaggaacata ctgtggtgca ccaatcttgg gcccagggag tgccccgaaa
     3601 ctcagcacca agaccaagtt ttggagatca tccacaacac ctctcccacc tggcacctat
     3661 gaaccagcct acctcggtgg caaggacccc agagtaaaag gtggtccctc gttgcaacaa
     3721 gttatgaggg accagctaaa gccatttaca gaacccagag gtaaaccacc aaggccaagt
     3781 gtgttggaag ctgccaagaa aaccattgtt aatgttcttg agcaaacaat tgacccaccc
     3841 caaaaatggt cattcgcgca ggcttgcgcg tccctcgaca aaaccacttc cagcggttat
     3901 ccgcaccatg tgcgaaagaa tgactgttgg aacggggatt ccttcacagg aaaattggca
     3961 gaccaggcct ctaaggccaa tctaatgtat gaagagggga agaacatgac tccagtctac
     4021 acaggtgcac ttaaggatga attggtaaag actgacaaaa tttatggtaa gatcaaaaag
     4081 aggcttctgt ggggttcaga cctggcgacc atgatacggt gcgcccgagc ttttggaggc
     4141 cttatggatg aactcaaggc acactgtgtc acccttcctg tcagagttgg catgaacatg
     4201 aatgaggatg gccccatcat ctttgagaaa cactctagat acagatacca ctatgatgct
     4261 gattattccc ggtgggactc aacacaacag agggatgtac tggcagcagc ccttgaaatc
     4321 atggttaaat tctctccaga accacacctg gcccaggtag ttgcagaaga cctcctttct
     4381 cccagtgtaa tggatgtggg tgactttcaa atatcaataa gcgagggact cccctccggg
     4441 gtgccttgca cctcccaatg gaattccatt gcccattggc ttctcactct ttgtgcacta
     4501 tctgaagtca cggacttatc ccctgatatt gttcaggcca attccctctt ctccttctat
     4561 ggtgatgatg agattgtgag cacagacata aagttggacc cagagaagct gacggcaaag
     4621 ctcaaggagt acgggctaag accaacccgc cccgacaaaa ctgaagggcc ccttgtcatc
     4681 tctgaagatc tggatggcct gacattcctg cggagaactg tgacccgtga catagcaggc
     4741 tggtttggca aattggaaca gagctcaatt ctcaggcaaa tgtactggac caagggcccc
     4801 aaccatgaag atccatctga aacaatgata ccacactccc aaagacccat acaattgatg
     4861 tctctactgg gcgaggctgc actccatggc ccggcattct atagcaaaat tagcaagttg
     4921 gtcattgcag agttgaagga aggtggcatg gatttttacg tgcccagaca agagccaatg
     4981 ttcagatgga tgagattctc agatctgagc acgtgggagg gcgatcgcaa tctggctccc
     5041 agttttgtga atgaagatgg cgtcgaatga cgccaaccca tctgatgggt ccacagccaa
     5101 cctcgtccca gaggtcaaca atgaggttat ggctttggag cctgttgtag gcgccgctat
     5161 tgcggcacct gtggcgggcc aacaaaactt aattgacccc tggattagaa ataattttgt
     5221 acaagcccct ggtggagagt tcacagtgtc ccccagaaac gctccaggtg aaatactatg
     5281 gagcgcgcct ttgagccctg atttgaaccc ttatctttcc catttggcca gaatgtacaa
     5341 tggttacgca ggtggttttg aggtgcaggt aatccttgcg gggaacgcgt tcaccgccgg
     5401 aaaaatcata tttgcagcag tcccaccaaa tttcccaact gaaggcttga gccccagcca
     5461 ggttactatg tttccccatg taatcgtaga tgttaggcaa ttagaacctg tgttgatccc
     5521 cttacctgat gttaggaata atttctatca ttataatcaa tcaaatgatt ctactattaa
     5581 attgatagca atgctgtata caccacttag ggctaataat gctggggatg atgtcttcac
     5641 agtttcttgt cgggtcctta cgaggccctc ccctgatttt gatttcatat ttctggtacc
     5701 accaacagtt gagtcaagaa ctaaaccttt tactgtccca gtcttaaccg ttgaggaaat
     5761 gaccaattca agattcccca ttcccttgga gaagctgtat acgggtccca gtgctgcttt
     5821 tgttgttcaa ccacaaaatg gcagatgcac gactgatggc gtgctcttgg gcactaccca
     5881 gttgtctgct gtcaacatct gcacctttag aggagatctc acacacattg caggcactcg
     5941 ccaattcaca atgcatttgg cctctccgaa ctggaacaac tatgacccaa cagaagaaat
     6001 cccagccccc ctgggaaccc cagacttcgt ggggaagatt caaggcatgc tcacccaaac
     6061 cacaaaaaca gatggctcga cccgcggcca caaagccatg gtgtccactg gggctgccga
     6121 ctttgcccca aaattaggca acattcgatt cggcactgac acagaaaatg atctccaatc
     6181 tggcataaac accaaattca ccccaatcgg cgttgtccaa gatggtgaaa acccccattt
     6241 tagtgaaccc caacaatggg tgcttccaag ttacacaggt agaactggac ataatgtgca
     6301 tttggcccct gctgttgccc ccacttaccc gggtgagcaa ctccttttct ttaggtccac
     6361 catgcccgga ctcagcgggt accccaactt gaccatagac tgcttgctcc cccaggaatg
     6421 ggtgcggcac ttctaccaag aagcagctcc agcacaatct gatgtggctt tgctaagatt
     6481 tgtgaatcca gacacaggta gggttttgtt tgagtgcaag ctccataaat caggatatat
     6541 tacagtagct cacactggtc ctcatgattt agttatcccc cccaatggtt attttagatt
     6601 tgattcctgg gtcaaccagt tttacacact tgcccccatg ggaaatggag cggggcgcag
     6661 acgtgtgtta taatggctgg agctttcttt gctggattgg catctgatgt ccttggctcc
     6721 ggacttggtt ccttaatcaa tgctggggct ggagctatca atcaaaaagt tgaatttgaa
     6781 aataatagaa aattgcaaca agcttctttc caatttagta gtaatctgca acaagcttcc
     6841 tttcagcatg acaaggagat gctccaagca caaattgagg ccactcaaaa attgcaacaa
     6901 ggtatgatga aggtcaagca ggcgatgctc ctggaaggtg gattctctga aacagatgca
     6961 gcccgtgggg caatcaacgc ccccatgaca aaggccttgg attggagtgg aacaagatac
     7021 tgggcacctg atgctagggt tacaacatat aatgcaggcc gcttttccac ctctcaacct
     7081 tcgggggcat tgccaggaag gactaacctc agggctactc tccccactcg gggatcttct
     7141 agtgtaactt ctgcttcttc tgctacttct gtgtattcaa ataaaacagt ttcaacgagg
     7201 cttggttctt cagcaggttc tggcaccagc gcctcaagca ccccgtcaac cacaaggact
     7261 aggagctggg ttgaggatca aaacaggaac ttgtcaccct tcatgagtgg ggctctcaac
     7321 acatcattcg tcaccccacc atctagcaga tcctccagtc aaggcacagt ctcaaccgtg
     7381 cctaaagaaa ttttggactc ctggactggc gctttcaaca cgcgcaggca gcctctcttc
     7441 gctcatgttc gtaggcgagg ggagtcacgg gtgtaatgtg aaaagacaaa attgattatc
     7501 t
//