Typing tool

Complete norovirus genomes

MW559994  GII.4
 GII.P31

Length: 7,445 | 3 CDS

ORF1: 1..5100
ORF2: 5081..6703
ORF3: 6703..7445
LOCUS       MW559994                7445 bp    RNA     linear   VRL 22-FEB-2021
DEFINITION  Norovirus GII isolate Hu/GII.4/P31/001/2019/KEN nonstructural
            polyprotein (ORF1) and VP1 (ORF2) genes, complete cds; and VP2
            (ORF3) gene, partial cds.
ACCESSION   MW559994
VERSION     MW559994.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7445)
  AUTHORS   Lambisia,A.W., de Laurent,Z., Nokes,J.D. and Agoti,C.N.
  TITLE     Near complete genomes of five human norovirus GII.4 recovered from
            diarrheal stool samples of hospitalized children in coastal Kenya
            in 2019
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7445)
  AUTHORS   Lambisia,A.W., de Laurent,Z., Nokes,J.D. and Agoti,C.N.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-FEB-2021) Bioscience, KEMRI WELLCOME TRUST, KILIFI,
            KILIFI, KILIFI 230-80108, Kenya
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. v3.13.2
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7445
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /strain="KLF_NOV_001"
                     /isolate="Hu/GII.4/P31/001/2019/KEN"
                     /isolation_source="human stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Kenya"
                     /collection_date="22-Feb-2019"
                     /note="genotype: GII.4"
     gene            1..5100
                     /gene="ORF1"
     CDS             1..5100
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QRF49939.1"
                     /translation="MKMASNDASAAVAANSNNDIAKSSSDGVFSNMAVTFKRALGARP
                     KQPPPKEIPPKPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGISGLPELTTVSQPK
                     EINTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYEERGLILGVHKPPAAIS
                     LAKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
                     WLSRRMIQRTTGFFRPYQEWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
                     LKPLNILNILATCDWTFPGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
                     LAVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
                     KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDHEEEKARKLST
                     KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLAREL
                     AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
                     PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
                     AKRDFPGQPDMWKNAFSPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
                     SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVRTM
                     PDLKQALKNVAIKKCQIVYNGGTYTLEADGKGSVKVDKVQNATVQTNNELAGALHHLR
                     CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETASKDGC
                     PKPKDDEEFVVSSDDIKAEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
                     KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
                     TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
                     SLFITSTHVIPQGTKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
                     TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
                     CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILG
                     PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
                     RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
                     NGDSFTGRLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL
                     ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD
                     STQQRDVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
                     SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
                     EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
                     NHEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
                     PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     1..990
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     991..2088
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2089..2625
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2626..3024
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3025..3567
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3568..5097
                     /gene="ORF1"
                     /product="RdRp"
     gene            5081..6703
                     /gene="ORF2"
     CDS             5081..6703
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QRF49940.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNSYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSIPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSHNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRVV"
     gene            6703..>7445
                     /gene="ORF3"
     CDS             6703..>7445
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QRF49941.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKDMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNTGRFSTPQLSGAPPGRANLRDAVPARGSSNKTS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SYVTPPSSRPSSQGTVSTVPKEVLDSWTGA"
ORIGIN      
        1 atgaagatgg cgtctaacga cgcttccgct gccgttgctg ccaacagcaa caacgacatc
       61 gcaaaatctt caagtgacgg tgtgttttcc aacatggctg tcacttttaa acgggccctc
      121 ggggcgcggc ctaaacagcc gcccccgaag gagataccac ccaaaccccc acgaccaccc
      181 acaccagaat tggtcaaaaa gatccctcct cccccaccca acggggagga tgaactagtg
      241 gtttcttaca gtgccaaaga tggcatttcc ggattgcctg agctcaccac tgtcagccaa
      301 ccgaaagaaa ttaacacggc gttcagtgtc cccccgctca atcaaaggga gaatagggac
      361 gccaaggaac cactgactgg aacaattatt gaaatgtggg atggagaaat ctaccattac
      421 ggcctgtatg aggaacgggg tcttatactt ggtgtgcata agccgccggc agccatcagc
      481 cttgccaagg tcgagctaac accgctctcc ttgttctgga gacctgtgta cacccctcag
      541 tacctcatct ctccagacac tcttaggaga ctacatggag aatcattccc ctacaccgca
      601 tttgacaaca attgttacgc cttctgctgt tgggtattag acctaaacga ctcatggcta
      661 agtagaagaa tgattcaaag aacaacgggt ttcttcagac cataccagga atggaacagg
      721 aaacccctcc ccactatgga tgattccaag ttaaagaagg tagccaacat attcttgtgc
      781 accttgtctt cactattcac cagacccatt aaggacataa tagggaaatt gaaacctctt
      841 aacatcctca atattctggc tacatgtgac tggaccttcc caggcatagt ggaatcccta
      901 atactcttgg cagaactctt tggagttttc tggacacccc cagatgtgtc tgcgatgatc
      961 gcccccttac taggtgatta tgaactgcaa ggacctgagg accttgcagt agaactggtc
     1021 ccagtggtga tgggggggat aggcttggtg ctaggattca ccaaagagaa aattggaaag
     1081 atgttgtcgt ccgctgcatc cactttgaga gcttgcaaag acctgggtgc atacggactg
     1141 gaaatcttga agttagttat gaaatggttc ttcccgaaga aagaggaagc aaacgaactg
     1201 gctatggtga ggtccatcga agatgcagta ctagacctcg aggcaattga aaacaaccac
     1261 atgaccaccc tgctcaagga caaggatagc ttggcaacct acatgagaac cctcgaccat
     1321 gaggaggaga aagccagaaa actctcaact aaatctgctt cgcctgatat cgtgggcaca
     1381 atcaactctc ttctggcacg aatcgccgct gcgcgctccc tagtgcatcg agcgaaagag
     1441 gagctctcca gcaggccgag acctgtcgtt gtgatgatat cgggaaaacc aggaataggg
     1501 aaaactcatc ttgccaggga gttggccaag aagatcgcag cctctctcac aggggaccag
     1561 cgtgtaggtc taatcccacg caatggcgtc gaccactggg acgcatacaa gggtgaaaga
     1621 gttgtcctat gggatgacta tgggatgagc aaccccatac acgatgccct caggctacag
     1681 gagcttgctg acacttgccc cctcacgcta aactgtgaca ggattgagaa caaaggaaaa
     1741 gtctttgaca gtgacgccat aattatcacc actaatctgg ccaacccagc accactggat
     1801 tatgtcaatt ttgaggcgtg ctcaaggcgc attgacttcc tcgtgtacgc ggaagctcct
     1861 gaggtggaaa aggcaaaacg tgacttccca ggccaacctg acatgtggaa gaacgctttc
     1921 agtcctgact tctcacacat aaaactggca ttggctccac agggtggttt tgacaagaac
     1981 ggcaacaccc cgcatggaaa aggtgtcatg aagaccctca caactggctc cctcatcgcc
     2041 cgagcatcag gactactcca tgagaggcta gatgaatatg agttacaagg cccagccctc
     2101 accactttca attttgaccg caacaagata cttgctttta gacagcttgc tgctgaaaac
     2161 aagtatgggc taatggacac aatgagagtt ggaaaacagc tcaaggatgt caggaccatg
     2221 ccagacctca aacaagcact caagaatgtc gcaattaaga agtgccagat agtgtacaat
     2281 ggtggcacct atacgcttga agccgatggc aagggtagtg tgaaagttga caaagtgcaa
     2341 aatgccaccg tgcagaccaa caatgaactg gccggcgccc tgcaccacct aaggtgtgcc
     2401 agaatcaggt actatgttaa gtgtgtccag gaggcactgt attccatcat tcaaatcgct
     2461 ggggctgcgt tcgtcaccac gcgcatcgcc aagcgcatga acatacaaaa tctctggtcc
     2521 aagccacagg tggaagacac agaagagacg gccagcaaag atggttgccc taagcccaaa
     2581 gatgatgaag agttcgtcgt ttcatccgac gacatcaaag ctgagggcaa gaaagggaag
     2641 aacaagtccg gccgtggcaa gaagcacaca gccttctcaa gcaaaggact cagtgatgag
     2701 gagtacgatg agtacaagag aatcagggag gaaaggaatg gcaagtactc catagaagag
     2761 tacctccagg acagagacaa gtactatgag gaggtggcca ttgccagggc aactgaagag
     2821 gacttctgtg aagaagaaga ggccaaaatc cggcagagaa ttttcagacc aacaagaaaa
     2881 caacgtaagg aagaaagggc ctctctaggc ttggtcacag gttcagaaat caggaagaga
     2941 aacccagagg acttcaaacc caagggaaag ctgtgggccg atgatgacag aagtgttgac
     3001 tacaacgaga aactcaactt tgaggcccca ccaagcatct ggtcgcggat agtcaacttt
     3061 ggttcaggtt ggggcttctg ggtctccccc agtttgttta taacatcaac ccatgtcata
     3121 ccccaaggca caaaagagtt cttcggagtc cccatcaaac aaatccagat acacaaatca
     3181 ggtgaattct gtcggctgag attcccaaaa ccaatcagaa ctgatgtgac gggcatgatt
     3241 ctagaagaag gtgcgccaga gggaaccgtg gccacactgc tcatcaagag gccaactgga
     3301 gagcttatgc ctttggcagc cagaatggga actcatgcaa ccatgaaaat tcaggggcgc
     3361 acagttggag gacaaatggg tatgctcttg acagggtcca acgccaagag tatggacttg
     3421 ggcacaacac caggcgactg cggctgtccc tacatctata aaagagggaa cgactatgtg
     3481 gtcataggag tccatactgc tgctgcccgt ggaggaaaca ctgtcatctg tgccacccag
     3541 ggtagtgagg gggaagccac ccttgaagga ggtgacaaca aaggaacgta ctgtggtgca
     3601 ccaatcttgg gtccagggag cgctccgaaa cttagcacca aaactaagtt ttggaggtca
     3661 tccacaacgc cactcccacc aggcacctac gaaccagcct acctcggcgg caaggatccc
     3721 agagtcaaag gtggcccttc attgcaacaa gtcatgaggg accagctaaa gccattcaca
     3781 gagcccagag gcaaaccacc aagaccaaat gtattggaag ccgccaagaa aaccatcatt
     3841 aatgtccttg agcaaacaat tgacccaccc caaaaatggt cattcgcgca agcttgcgca
     3901 tcccttgaca aaaccacctc cagcggccat ccgcaccaca tgcggaaaaa cgattgctgg
     3961 aatggggact cctttacagg aagattggca gatcaggcct ccaaggccaa cctaatgttt
     4021 gaggagggaa agaacatgac tccagtctac acaggtgcac ttaaagatga attggtaaag
     4081 actgacaaga tttatggtaa gatcaagaag aggcttttgt ggggctcgga cttggcgacc
     4141 atgatacggt gcgcccgggc ttttggaggc cttatggatg agctcaaggc acactgtgtc
     4201 acccttccag tcagagttgg tatgaacatg aatgaagatg gccccataat ctttgagaaa
     4261 cactccagat acaagtacca ctatgatgct gattactcca ggtgggactc aacacaacaa
     4321 agggatgtgc tagcagcagc actagaaatc atggttaagt tctctccaga gccacacttg
     4381 gcccaggtgg ttgcagaaga cctcctttcc cctagtgtaa tggatgtagg tgactttcag
     4441 atatcaataa gtgagggact cccctccggg gtgccttgca cttcccagtg gaattccatc
     4501 gcccactggc tcctcaccct ttgtgcgctc tctgaagtca cggacctgtc ccctgatatc
     4561 attcaagcca attccctttt ctccttctac ggtgatgatg agattgtgag tacagacata
     4621 aaattggacc cagagaagct aacagcaaaa ctcaaggagt acgggttgaa gccgacccgt
     4681 cccgacaaaa ctgaaggacc ccttgttatc tctgaagatt tggatggcct gacattcctc
     4741 cggaggactg tgacccgtga cccagctggc tggtttggaa aattggaaca aagctcaatc
     4801 ctcaggcaaa tgtattggac caggggtccc aaccatgaag atccatctga aacaatgata
     4861 ccacactccc aaagacccat acaattgatg tccttgctag gcgaggctgc actccacggc
     4921 cccgcatttt atagcaaaat tagcaaattg gttattgcag agttgaagga aggtggcatg
     4981 gatttttacg tgcccagaca agagccaatg ttcagatgga tgagattctc agatctgagc
     5041 acgtgggagg gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgagtga
     5101 cgccaaccca tctgatgggt ccgcagccaa cctcgtccca gaggtcaaca atgaggttat
     5161 ggctctggag cccgttgttg gtgccgccat agcggcacct gtagcgggcc agcaaaatgt
     5221 aattgacccc tggattagaa ataattttgt acaagcccct ggtggagagt ttacagtatc
     5281 tcctagaaac gctccaggtg aaatactatg gagcgcgccc ttaggccctg atttaaatcc
     5341 ctacctatcc catttggcca gaatgtacaa tagttatgca ggtggctttg aagtgcaggt
     5401 aattctcgcg gggaacgcgt tcaccgccgg gaaggtcata tttgcagcag tcccaccaaa
     5461 ttttccaact gaaggcttaa gtcccagcca ggtcaccatg ttcccccata tagtagtaga
     5521 tgttaggcaa ctagaacctg tgttgattcc cttacccgat gttaggaaca atttttacca
     5581 ctacaatcaa tcaaatgacc ccactatcaa gttgatagca atgttgtata caccacttag
     5641 agctaataat gctggggatg atgtcttcac agtttcttgc cgggtcctca cgagaccatc
     5701 ccccgatttt gatttcatat ttttggtgcc acccacagtt gagtcaagaa ctaagccatt
     5761 ctctatccca gttttaactg ttgaggagat gaccaattca agattcccca ttcctttgga
     5821 gaagttattc acgggcccca gcagtgcctt tgttgttcaa ccacaaaacg gcaggtgcac
     5881 gactgatggc gtgctcctag gcaccaccca actgtctcct gtcaacatct gtaccttcag
     5941 aggggatgtc acccatatca caggtagtca taactacaca atgaatttgg cttctcaaaa
     6001 ttggaacaat tatgacccaa cagaagaaat cccagcccct ctaggaactc cagattttgt
     6061 ggggaagatc caaggcatgc tcacccaaac cacaaggaca gatggctcaa cacgcggcca
     6121 caaagctaca gtgtacaccg ggagcgccga cttcgctcca aaactgggta gagttcaatt
     6181 tgaaactgac acaaaccatg actttgaagc taaccaaaac acaaagttca ccccagtcgg
     6241 tgtcatccaa gatggtagca ccacccaccg aaatgaaccc caacagtggg tgctcccaag
     6301 ttattcaggc agaaacactc ataatgtaca tctggccccc gctgtggccc ccacttttcc
     6361 aggtgagcag cttctcttct tccgatccac catgcccgga tgcagcgggt accccaacat
     6421 ggatctggat tgtctcctcc cccaggaatg ggtgcagcac ttctaccaag aggcagcccc
     6481 agcacaatct gatgtggccc tgctgagatt tgtgaaccca gacacaggta gggttttgtt
     6541 tgagtgtaaa ctccacaaat caggttatgt tactgtggct cacactggcc aacatgattt
     6601 ggttatcccc cccaatggct attttaggtt tgattcctgg gtcaaccagt tctacacgct
     6661 tgcccccatg ggaaatggag cggggcgtag acgtgtagta taatggctgg agctttcttt
     6721 gctggattgg catctgatgt ccttggctct ggacttggtt ccctcatcaa tgctggggct
     6781 ggggccatca accaaaaagt tgagtttgaa aataacagaa aattacaaca agcatccttc
     6841 caatttagca gcaatctgca acaggcctcc tttcaacatg acaaagatat gctccaagca
     6901 caaattgaag ccaccaaaaa gttacaacag gaaatgatga aggtcaagca ggcaatgctc
     6961 ctagagggtg ggttctctga gacagatgca gcccgcgggg caatcaacgc ccccatgaca
     7021 aaagctctgg actggagcgg gacaaggtac tgggcccccg atgctaggac tacaacatac
     7081 aatacaggcc gcttttccac ccctcaacta tcgggggcac cgccaggaag agccaatctt
     7141 agggatgctg tccctgctcg gggttcctcc aataaaactt ctaattcttc tactgctact
     7201 tctgtgtatt cgaatcaaac tacttcaacg agacttggtt ctacagctgg ttctggcacc
     7261 agtgtctcga gcctcccgtc aactgcaagg actaggagct gggttgagga tcaaagtagg
     7321 aatttgtcac ctttcatgag gggggcccac aacatttcgt atgtcacccc accatctagt
     7381 agaccctcca gccaaggcac agtctcaacc gtgcctaaag aggttttgga ctcctggact
     7441 ggcgc
//