Typing tool

Complete norovirus genomes

MW559991  GII.4
 GII.P31

Length: 7,443 | 3 CDS

ORF1: 1..5100
ORF2: 5081..6703
ORF3: 6703..7443
LOCUS       MW559991                7443 bp    RNA     linear   VRL 22-FEB-2021
DEFINITION  Norovirus GII isolate Hu/GII.4/P31/003/2019/KEN nonstructural
            polyprotein (ORF1) and VP1 (ORF2) genes, complete cds; and VP2
            (ORF3) gene, partial cds.
ACCESSION   MW559991
VERSION     MW559991.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7443)
  AUTHORS   Lambisia,A.W., de Laurent,Z., Nokes,J.D. and Agoti,C.N.
  TITLE     Near complete genomes of five human norovirus GII.4 recovered from
            diarrheal stool samples of hospitalized children in coastal Kenya
            in 2019
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7443)
  AUTHORS   Lambisia,A.W., de Laurent,Z., Nokes,J.D. and Agoti,C.N.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-FEB-2021) Bioscience, KEMRI WELLCOME TRUST, KILIFI,
            KILIFI, KILIFI 230-80108, Kenya
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. v3.13.2
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7443
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /strain="KLF_NOV_003"
                     /isolate="Hu/GII.4/P31/003/2019/KEN"
                     /isolation_source="human stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Kenya"
                     /collection_date="09-Mar-2019"
                     /note="genotype: GII.4"
     gene            1..5100
                     /gene="ORF1"
     CDS             1..5100
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QRF49930.1"
                     /translation="MKMASNDASAAVAANSNNDIAKSSSDGVFSNMAVTFKRALGARP
                     KQPPPKEIPPKPPRPPTPELVRKIPPPPPNGEDELVVSYSAKDGISGLPELTTVSQPE
                     EINTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS
                     LAKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
                     WLNRRMIQRTTGFFRPYQEWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
                     LKPLNILNILATCDWTFPGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
                     LAVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
                     KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
                     KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLAREL
                     AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
                     PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
                     AKRDFPGQPDMWKNAFSPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
                     SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVRTM
                     PDLKQALKNVAIKKCQIVYNGSTYTLEADGKGSVKVDKVQNATVQTNNELAGALHHLR
                     CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETASKDGC
                     PKPKDDEEFVVSSDDIKAEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
                     KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
                     TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
                     SLFITSTHVIPQGTKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
                     TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
                     CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILG
                     PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
                     RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHVRKNDCW
                     NGDSFTGRLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL
                     ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD
                     STQQRDVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
                     SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
                     EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
                     NHEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
                     PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     1..990
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     991..2088
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2089..2625
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2626..3024
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3025..3567
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3568..5097
                     /gene="ORF1"
                     /product="RdRp"
     gene            5081..6703
                     /gene="ORF2"
     CDS             5081..6703
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QRF49931.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSIPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSHNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRVV"
     gene            6703..>7443
                     /gene="ORF3"
     CDS             6703..>7443
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QRF49932.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKDMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNTGRFSTPQLSGASSGRANIRDAVPARGSSNKTS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SYVTPPSSRSSSQGTVSTVPKEVLDSWTG"
ORIGIN      
        1 atgaagatgg cgtctaacga cgcttccgct gccgttgctg ccaacagcaa caacgacatc
       61 gcaaaatctt caagtgacgg tgtgttttcc aacatggctg tcacttttaa acgggccctc
      121 ggggcgcggc ctaaacagcc gcccccgaag gagataccac ccaaaccccc acgaccaccc
      181 acgccagaat tggtcaggaa gatccctcct cccccaccca acggggagga tgaactagtg
      241 gtttcttaca gtgccaaaga tggcatttcc ggattgcctg agctcaccac tgtcagccaa
      301 ccggaagaaa tcaacacggc gttcagtgtc cccccgctca atcaaaggga gaacagggac
      361 gccaaggaac cactgactgg aacaattatc gaaatgtggg atggagaaat ctaccattac
      421 ggcctgtatg tggaacgggg tcttatactt ggtgtgcata agccaccggc agccatcagc
      481 cttgccaagg tcgagctaac accgctctct ttgttctgga gacctgtgta tacccctcag
      541 tacctcatct ctccagacac tcttaggaga ctacatggag aatcattccc ctacaccgca
      601 tttgacaaca attgttacgc cttctgctgt tgggtattag acctaaacga ctcatggtta
      661 aatagaagaa tgattcagag aacaacgggt ttcttcagac cataccagga atggaacagg
      721 aagcccctcc ccactatgga tgattccaag ttaaagaagg tagccaacat attcttgtgc
      781 accttgtctt cactattcac cagacccatt aaggacataa tagggaaatt gaaacctctt
      841 aacatcctca acattctggc tacatgtgac tggaccttcc caggcatagt ggaatcccta
      901 atactcttgg cagaactctt tggagttttc tggacacccc cagatgtgtc tgcgatgatc
      961 gcccccttac taggtgatta tgaactgcaa ggacctgagg accttgcagt agaactggtc
     1021 ccagtggtga tgggggggat aggcttggtg ctaggattta ccaaagagaa aattggaaag
     1081 atgttgtcgt ccgctgcatc cactttgaga gcttgcaaag acctgggtgc atacggactg
     1141 gaaatcttaa agttagttat gaaatggttc ttcccgaaga aagaggaagc aaatgaactg
     1201 gccatggtga ggtccattga agatgcagta ctagacctcg aggcaattga aaacaaccac
     1261 atgaccaccc tgctcaagga caaggatagc ttggcaacct acatgagaac cctcgacctt
     1321 gaggaggaga aagccagaaa actctcaacc aaatctgctt cgcctgatat cgtgggcaca
     1381 atcaactctc ttctggcacg aatcgccgct gcgcgctccc tagtgcatcg agcgaaagag
     1441 gagctctcca gcaggccgag acctgtcgtt gtgatgatat cgggaaaacc aggaataggg
     1501 aaaactcatc ttgccaggga gttggccaag aagatcgcag cctctctcac aggtgaccag
     1561 cgtgtaggtc taatcccacg caatggcgtc gaccactggg acgcatacaa gggtgaaaga
     1621 gttgtcctat gggacgacta tgggatgagc aaccccatac atgatgccct caggctacag
     1681 gaacttgctg acacttgccc cctcacgcta aactgtgaca ggattgagaa caaaggaaaa
     1741 gtctttgaca gtgacgccat aatcatcacc accaatctgg ccaacccagc accactggat
     1801 tatgtcaatt ttgaggcgtg ctcgaggcgc attgacttcc tcgtgtacgc ggaagctcct
     1861 gaggtggaaa aggcaaaacg tgacttccca ggccaacctg acatgtggaa gaacgctttc
     1921 agtcctgact tctcacacat aaaactggca ttggctccac agggtggttt tgacaagaac
     1981 ggcaacaccc cgcatggaaa aggtgtcatg aagaccctca caactggctc cctcatcgcc
     2041 cgagcatcag gactactcca tgagaggcta gatgaatatg aattacaagg cccagccctc
     2101 accactttca attttgaccg caacaagata cttgctttta gacagcttgc tgctgaaaac
     2161 aagtatgggc tgatggacac aatgagagtt ggaaaacagc tcaaggatgt caggaccatg
     2221 ccggacctca aacaagcact caagaatgtc gcgattaaga agtgccagat agtgtacaat
     2281 ggtagcacct atacgcttga agccgatggc aagggtagtg tgaaagttga caaagtgcaa
     2341 aatgccaccg tgcagaccaa caatgaactg gccggcgccc tgcaccacct aaggtgcgcc
     2401 agaatcaggt actatgttaa gtgtgtccag gaggcactgt attccatcat tcaaatcgct
     2461 ggggctgcgt tcgtcaccac gcgcatcgcc aagcgcatga acatacaaaa tctctggtcc
     2521 aagccacagg tggaagacac agaagagacg gccagcaaag atggttgccc taagcccaaa
     2581 gatgatgaag agttcgtcgt ttcatccgac gacatcaaag ctgagggcaa gaaagggaag
     2641 aacaagtccg gccgtggcaa aaagcacaca gccttctcaa gcaaaggact cagtgatgag
     2701 gagtacgatg agtacaagag aatcagggag gaaaggaatg gcaagtactc catagaagag
     2761 tacctccagg acagagacaa gtactatgag gaggtggcca ttgccagggc aactgaagag
     2821 gacttctgtg aagaagaaga ggccaaaatc cggcagagaa tttttagacc aacaagaaaa
     2881 caacgtaagg aagaaagggc ctctctaggc ttggtcacag gttcagaaat caggaagaga
     2941 aacccagagg acttcaaacc caagggaaag ctgtgggccg atgatgacag aagtgttgac
     3001 tacaacgaga aactcaactt tgaggcccca ccaagcatct ggtcgcggat agtcaacttt
     3061 ggttcaggtt ggggcttctg ggtctccccc agtttgttta taacatcaac ccatgtcata
     3121 ccccaaggta caaaggagtt cttcggggtc cccatcaaac aaatccagat acacaaatca
     3181 ggtgaattct gtcggctgag attcccaaaa ccaatcagaa ctgatgtgac gggcatgatt
     3241 ctagaagaag gtgcgccaga gggaaccgtg gccacactgc tcatcaagag gccaactgga
     3301 gagcttatgc ctttggcagc cagaatggga actcacgcaa ccatgaaaat tcaggggcgc
     3361 acagttggag gacaaatggg tatgctcttg acagggtcca acgccaaaag tatggacttg
     3421 ggcacaacac caggcgactg cggctgtccc tacatctata aaagagggaa cgactatgtg
     3481 gtcataggag tccatactgc tgctgcccgt ggaggaaaca ctgtcatctg tgccacccag
     3541 ggtagtgagg gggaagccac ccttgaagga ggtgacaaca aaggaacgta ctgtggtgca
     3601 ccaatcttgg gtccagggag tgctccgaaa cttagcacca agactaagtt ttggaggtca
     3661 tccacaacgc cactcccacc aggcacctac gaaccagcct acctcggcgg caaggatccc
     3721 agagtcaaag gtggcccttc attgcaacaa gtcatgaggg accagctaaa gccattcaca
     3781 gaacccagag gcaaaccacc aagaccaaat gtgttggaag ccgccaagaa aaccatcatt
     3841 aatgtccttg agcaaacaat tgacccaccc caaaaatggt cattcgcgca agcttgcgca
     3901 tcccttgaca aaaccacctc cagcggccat ccgcaccacg tgcggaaaaa cgattgctgg
     3961 aatggggact cctttacagg aagattggca gatcaggcct ccaaggccaa cctaatgttt
     4021 gaggagggaa agaacatgac tccagtctac acaggtgcac ttaaagatga attggtaaag
     4081 actgacaaga tttatggtaa gatcaagaag aggcttttgt ggggctcgga cttggcgacc
     4141 atgatacggt gcgcccgggc ttttggaggc cttatggatg agctcaaggc acactgtgtc
     4201 acccttccag tcagagttgg catgaacatg aatgaagatg gccccataat ctttgagaag
     4261 cactccagat acaagtatca ctatgatgct gattactcca ggtgggactc aacacaacaa
     4321 agggatgtgc tagcagcagc actagaaatc atggttaagt tctctccaga accacacttg
     4381 gcccaggtgg ttgcagaaga cctcctttcc cctagtgtaa tggacgtagg tgactttcag
     4441 atatcaataa gtgaggggct cccctccggg gtgccttgca cttcccagtg gaattccatc
     4501 gcccactggc tcctcaccct ttgtgcgctc tctgaagtca cggacctgtc ccctgatatc
     4561 attcaggcca attccctttt ctccttctac ggtgatgatg agattgtgag tacagacata
     4621 aaattggacc cagagaagct gacagcaaaa ctcaaggagt acggattgaa gccgacccgt
     4681 cccgacaaaa ctgaaggacc ccttgttatc tctgaagatt tggatggcct gacattcctc
     4741 cggaggactg tgacccgtga cccagctggc tggtttggaa aattggaaca aagctcaatc
     4801 ctcaggcaaa tgtactggac caggggtccc aaccatgaag atccatctga aacaatgata
     4861 ccacactccc aaagacccat acaattgatg tccttgctag gcgaggctgc actccacggc
     4921 ccagcatttt atagcaaaat tagcaaattg gttattgcag agttgaagga aggtggcatg
     4981 gatttctacg tgcccagaca agagccaatg ttcagatgga tgagattctc agatctgagc
     5041 acgtgggagg gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgagtga
     5101 cgccaaccca tctgatgggt ccgcagccaa cctcgtccca gaggtcaaca atgaggttat
     5161 ggctctggag cccgttgttg gtgccgccat agcggcacct gtagcgggcc agcaaaatgt
     5221 aattgacccc tggattagaa acaattttgt acaagcccct ggtggagagt ttacagtatc
     5281 tcctagaaac gctccaggtg aaatactatg gagcgcgccc ttaggccctg atttaaatcc
     5341 ctacctatcc catttggcca gaatgtacaa tggttatgca ggtggctttg aagtgcaggt
     5401 aattctcgcg gggaacgcgt tcaccgccgg gaaggtcata tttgcagcag tcccaccaaa
     5461 tttcccaact gaaggcttaa gtcccagcca ggtcaccatg ttcccccaca tagtagtaga
     5521 tgttaggcaa ctagaacctg tgttgattcc cttacccgat gttaggaaca atttttacca
     5581 ctacaatcaa tcaaatgacc ccactatcaa gttgatagca atgttgtata caccacttag
     5641 ggctaataat gctggggatg atgtcttcac agtctcttgc cgggtcctca cgagaccatc
     5701 ccccgatttt gatttcatat ttttggtgcc acccacagtt gagtcaagaa ctaagccatt
     5761 ctctatccca gttttaactg ttgaggagat gaccaattca agattcccca tccctttgga
     5821 gaagttattc acgggtccca gcagtgcctt tgttgttcaa ccacaaaacg gcaggtgcac
     5881 gactgatggc gtgctcctag gcaccaccca actgtctcct gtcaacatct gtaccttcag
     5941 aggggatgtc acccatatca caggcagtca taactacaca atgaatttgg cttctcaaaa
     6001 ttggaacaat tatgacccaa cagaagaaat cccagcccct ctaggaactc cagactttgt
     6061 ggggaagatc caaggcatgc tcacccaaac cacaaggaca gatggctcaa cacgcggcca
     6121 caaagccaca gtgtacaccg ggagcgccga cttcgctcca aaactgggta gagttcaatt
     6181 tgaaactgac acaaaccatg actttgaagc taaccaaaac acaaagttca ccccagtcgg
     6241 tgtcatccaa gatggtagca ccacccaccg aaatgaaccc caacagtggg tgctcccaag
     6301 ttattcaggc agaaacactc ataatgtaca tctggccccc gctgtggccc ccacttttcc
     6361 aggtgagcag cttctctttt tccgatccac catgcccgga tgcagcgggt accccaacat
     6421 ggatctggat tgtctcctcc cccaggaatg ggtgcagcac ttctaccaag aggcagcccc
     6481 agcacaatct gatgtggccc tgctgagatt tgtgaaccca gacacaggta gggttttgtt
     6541 cgagtgtaaa ctccacaaat caggttatgt tactgtggct cacactggcc aacatgattt
     6601 ggttatcccc cccaatggtt attttaggtt tgattcctgg gtcaaccagt tctacacgct
     6661 tgcccccatg ggaaatggag cggggcgtag acgtgtggta taatggctgg agctttcttt
     6721 gctggattgg catctgatgt ccttggctct ggacttggtt ccctcatcaa tgctggggct
     6781 ggggccatca accaaaaagt tgagtttgaa aataacagaa aattacaaca agcatccttc
     6841 caatttagca gcaatctgca acaggcctcc tttcaacatg acaaagatat gctccaagca
     6901 caaattgaag ccaccaaaaa gttacaacag gaaatgatga aggtcaagca ggcaatgctc
     6961 ctagagggtg ggttctctga gacagatgca gcccgcgggg caatcaacgc ccccatgaca
     7021 aaagctctgg actggagcgg gacaaggtac tgggcccccg atgctaggac tacaacatac
     7081 aatacaggcc gcttttccac ccctcaacta tcgggggcat cgtcaggaag agctaatatt
     7141 agggatgctg tccctgctcg gggttcctcc aataaaactt ctaattcttc tactgctact
     7201 tctgtgtatt cgaatcaaac tacttcaacg agacttggtt ctacagctgg ttctggcacc
     7261 agtgtctcga gcctcccgtc aactgcaagg actaggagct gggttgagga tcaaagtagg
     7321 aatttgtcac ctttcatgag gggggcccac aacatttcgt atgtcacccc accatctagt
     7381 agatcctcca gccaaggcac agtctcaacc gtgcctaaag aggttttgga ctcctggact
     7441 ggc
//