Typing tool

Complete norovirus genomes

MW019959  GII.4 Sydney
 GII.P31

Length: 7,20 | 3 CDS

ORF1: 1..5097
ORF2: 5078..6700
ORF3: 6700..7020
LOCUS       MW019959                7020 bp    RNA     linear   VRL 29-DEC-2020
DEFINITION  Norovirus GII isolate 2049 nonstructural polyprotein (ORF1) gene,
            partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene,
            partial cds.
ACCESSION   MW019959
VERSION     MW019959.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7020)
  AUTHORS   Makhaola,K., Moyo,S. and Kebaabetswe,L.P.
  TITLE     Next Generation Sequencing of Near-Full Length Genome of Norovirus
            GII.4 from Botswana
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7020)
  AUTHORS   Makhaola,K., Moyo,S. and Kebaabetswe,L.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-SEP-2020) Biosciences and Biotechnology, Botswana
            International University of Science and Technology, Khurumela,
            Palapye 00, Botswana
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: Genome Detective v. 1.126
            Sequencing Technology :: Oxford Nanopore(MinION)
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7020
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="2049"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Botswana: Molepolole"
                     /collection_date="20-Oct-2017"
                     /note="genotype: GII.4 Sydney[GII.P31]"
     gene            <1..5097
                     /gene="ORF1"
     CDS             <1..5097
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QNT54371.1"
                     /translation="KMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPK
                     QPPPKEKPPKPPRPPTPELIKEIPPPPPNGEDEPVVSYSAKDGVSGLPELTTVRQPEE
                     NNTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISL
                     AKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSW
                     LSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKL
                     KPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL
                     AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK
                     KEEANELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDLEEEKARKLSTK
                     SASPDIVGTINALLARIAAARSLVHRAKEELSSRSRPVVVMISGKPGIGKTHLARELA
                     KKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCP
                     LTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA
                     KRDFPGQPDMWKNAFSPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS
                     GLLHERLDEYELQGPVLTTFNFDRNKVLAFRQLAAENKYGLMDTMRIGKQLKDVKTMP
                     DLKQALKNVAIKKCQIVYGGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRC
                     ARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETASKDGCP
                     KPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIRDERNGK
                     YSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVT
                     GTEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSSN
                     LLITTTHVLPKGVKELFGVEIKQIQVHKSGEFCRFRFPRSIRPDVTGLVLEEGAPEGT
                     VCSILIKRPTGEMIPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDLGTGPGDC
                     GCPYIYKRGNDIVVAGVHTAAARGGNTVICATQGQDGEAVLEGGEDRGTYCGAPILGP
                     GKAPKLSTKTKFWRSSPDALPPGTYEPAYLGGKDPRVKKGPSLQQVMRDQLKPFTEPR
                     GKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWN
                     GESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLA
                     TMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWDS
                     TQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTS
                     QWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKE
                     YGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPN
                     HEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEP
                     MFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..987
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     988..2085
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2086..2622
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2623..3021
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3022..3564
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3565..5094
                     /gene="ORF1"
                     /product="RdRp"
     gene            5078..6700
                     /gene="ORF2"
     CDS             5078..6700
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QNT54372.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDSTLKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPILTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEINQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPTYSGRKTLNVHLAPAVAPTFPGEQLLFFRSTMPGCSGVPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVV"
     gene            6700..>7020
                     /gene="ORF3"
     CDS             6700..>7020
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QNT54373.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIDATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTK"
ORIGIN      
        1 aagatggcgt ctaacgacgc ttccgctgcc gctgttgcca acagcaacaa cgacatcgca
       61 aaatcttcaa gtgacggtgt gttttctaac atggctgtca cttttaaacg ggccctcggg
      121 gcgcggccta aacagccgcc cccgaaggaa aaaccaccca aacccccgcg accacccaca
      181 ccagagttga tcaaagagat ccctcccccc ccacccaatg gggaggatga accagtggtc
      241 tcctacagcg ccaaagacgg cgtttccggg ctgcctgagc tcaccactgt cagacaaccg
      301 gaagaaaaca acacggcgtt cagtgtcccc ccactcaacc aaagggagaa cagggacgcc
      361 aaggagccac taactggaac aattattgaa atgtgggatg gagaaatcta ccattacggc
      421 ctgtacgtgg aacgaggtct tatacttggt gtgcacaagc caccggcggc catcagcctt
      481 gccaaggtcg agctaacacc gctctctttg ttctggagac ctgtgtacac cccccagtat
      541 ctcatctctc cagacactct taggagacta catggagagt cgttccccta cactgcattt
      601 gacaacaatt gctacgcctt ttgttgttgg gtactagacc taaacgactc atggctaagc
      661 aggagaatga ttcagagaac aacaggtttc tttaggccat accaggactg gaacaggaaa
      721 cccctcccca ctatggatga ttccaaatta aagaaggtag ccaacatatt cttgtgcact
      781 ttgtcttcgc tatttaccag gcccattaag gacataatag ggaagctgaa acctctcaac
      841 atccttaaca ttctggctac atgtgattgg actttcgcag gcatagtgga gtccttaata
      901 ctcttggcag aactctttgg agttttctgg acacccccag atgtgtctgc gatgatcgcc
      961 cctttactag gtgattatga actgcaagga cctgaggacc ttgcagtaga actggtccca
     1021 gtggtgatgg gagggatagg cttggtgcta ggatttacca aagagaaaat tggaaagatg
     1081 ctgtcgtccg ccgcatccac cttaagggct tgcaaagacc ttggtgcata cggactagaa
     1141 attttgaaat tggtcatgaa atggttcttc ccaaagaaag aggaagcaaa tgagctggcc
     1201 atggtgagat ccatcgagga cgcagtactg gacctcgagg caattgaaaa caaccacatg
     1261 accgccctgc tcaaggataa agacagcttg gcaacctaca tgagaaccct tgaccttgag
     1321 gaggagaaag ccagaaaact ctcaaccaaa tctgcttcac ctgacattgt gggtacaatc
     1381 aacgctcttc tggcacgaat cgccgctgca cgctccctag tgcatcgggc gaaagaagag
     1441 ctctccagca ggtcgaggcc tgtcgttgtg atgatatcgg gaaaaccagg gatagggaaa
     1501 actcaccttg ccagggagtt ggccaagaag atcgcagcct ccctcacagg ggaccagcgt
     1561 gtgggcctga tcccacgcaa tggcgttgac cactgggacg catacaaggg tgaaagagtt
     1621 gtcctatggg acgactatgg gatgagcaac cccatacacg atgccctcag gttgcaggaa
     1681 cttgctgaca cttgccccct cacgctaaat tgtgacagga ttgagaacaa aggaaaagtc
     1741 tttgatagtg atgccataat tatcaccacc aatctggcca acccagcacc actggattat
     1801 gtcaattttg aagcgtgctc gaggcgcatt gatttcctcg tgtacgcgga ggctcctgag
     1861 gtggagaagg caaaacgcga cttcccaggc caacctgaca tgtggaagaa cgctttcagt
     1921 cctgacttct cacacataaa actggcattg gctccacagg gtggttttga caagaacggc
     1981 aacaccccgc atggaaaagg tgttatgaag actctcacca ctggctccct cattgcccga
     2041 gcatcaggat tactccatga gagactagat gaatatgaat tacaaggccc agtcctcact
     2101 accttcaact ttgatcgcaa caaagtgctt gcctttagac agcttgctgc tgaaaacaag
     2161 tatgggctga tggacacaat gagaattgga aaacagctta aggatgtcaa gaccatgcca
     2221 gacctcaaac aggcactcaa gaatgttgcg atcaagaagt gccagatagt gtatggtggt
     2281 agcacctaca cgcttgaggc cgatggcaag ggtagtgtga aggttgacaa agtgcagagt
     2341 gccaccgtgc aaactaacaa tgaactagcc ggcgccctgc accacctgag gtgcgccaga
     2401 atcaggtact atgtcaagtg tgtccaggag gcattgtatt ccatcatcca aatcgctggg
     2461 gccgcgtttg tcaccacgcg catcgccaag cgcatgaaca tacaaaacct ctggtccaag
     2521 ccacaggtgg aagacacaga agagacggcc agcaaagatg gttgcccaaa acccaaagat
     2581 gatgaagagt tcgtcgtttc atccgacgac atcaagactg agggcaagaa agggaagaac
     2641 aagtccggcc gtggcaagaa gcacacagcc ttctcaagca aagggctcag tgatgaggag
     2701 tacgatgagt acaagagaat cagagatgaa aggaatggta agtactccat agaagagtac
     2761 cttcaggaca gagacaagta ctatgaggag gtggccattg ccagggcaac tgaagaggac
     2821 ttctgtgaag aagaagaggc caaaatccgg cagagaattt ttagaccaac aaggaaacaa
     2881 cgtaaagagg agagggcctc tttaggcttg gtcacaggca cagagatcag gaagagaaac
     2941 ccagaagact tcaaacccaa aggaaagctg tgggctgatg atgacagaag tgttgactac
     3001 aacgagaaac tcaactttga ggccccacca agcatctggt cgcggatagt caactttggt
     3061 tccggttggg ggttctgggt ttcatccaac cttctgatca caacaacaca cgttctgcct
     3121 aaaggggtta aggaactctt tggagttgaa attaaacaaa tccaagtcca caagtctgga
     3181 gagttctgca gattcagatt cccgagatcc attaggccag atgtcacagg acttgtgctg
     3241 gaggaaggag ccccagaagg cactgtctgt tccatactca taaaaaggcc tacaggtgag
     3301 atgatcccct tggcagtgag gatgggcaca catgcatcca tgaaaataca gggccggacc
     3361 gttggtggcc agatgggaat gctcctaaca ggggcgaatg caaagaacat ggatctcggc
     3421 actggtcctg gtgactgcgg ttgtccctac atctacaaac gcggcaacga cattgttgtc
     3481 gcgggtgttc acaccgcagc agcccgggga ggcaacactg tcatatgtgc cacccaaggg
     3541 caggatgggg aggcagtcct tgagggaggt gaggaccgtg gcacctactg tggcgcccca
     3601 attctgggcc ctggcaaggc gcccaaactc agcacgaaga ctaagttttg gcgctcgtca
     3661 ccagatgcct tgccgcctgg cacgtatgaa cctgcttacc tgggaggcaa ggaccccaga
     3721 gtgaaaaaag ggccttcctt gcagcaagtc atgagggacc aattgaaacc atttacagag
     3781 cccagaggta aaccaccaag accaaatgtg ttggaagctg ccaagaaaac catcattaat
     3841 gttcttgagc aaacaattga cccaccccaa aaatggtcat tcgcgcaagc atgcgcatcc
     3901 cttgacaaaa ctacctccag cggccaccca caccacatgc ggaaaaacga ttgctggaat
     3961 ggggagtcct ttacaggaaa attggcagat caggcctcca aggccaacct aatgtttgaa
     4021 gagggaaaga acatgactcc agtctacaca ggtgcactta aagatgaact ggtgaagact
     4081 gacaaaattt atggtaagat caagaagagg ctcctgtggg gctcggacct ggcgaccatg
     4141 atacggtgcg cccgggcttt tgggggcctc atggatgaac tcaaggctca ctgtgtcacc
     4201 cttcctgtca gagttggtat gaacatgaat gaggatggcc ccataatctt tgagaagcac
     4261 tccagatata aatatcacta tgatgctgat tactccaggt gggactcaac acaacaaagg
     4321 gatgtgctag cagcagcact tgaaatcatg gttaagttct ccccagaacc acatctggcc
     4381 cagatagttg cagaagacct cctttccccc agcgtgatgg atgtgggtga ttttcaaata
     4441 tcaataagtg agggtctccc ctctggggtg ccttgtacct cccagtggaa ttccatcgcc
     4501 cactggcttc tcactctttg tgcactctct gaagtcacgg acctgtctcc tgacattatt
     4561 caggccaact cccttttctc cttctacggt gatgatgaga ttgtaagcac agacataaag
     4621 ttggatccag agaagctgac agcaaaactc aaggagtacg ggctgaaacc aacccgcccc
     4681 gacaaaactg aaggacccct tgttatatct gaagacctgg atggcctgac tttcctccgg
     4741 agaactgtga cccgtgatcc agctggttgg tttggaaaat tggaacaaag ttcaattctc
     4801 aggcaaatgt actggactag gggtcccaac catgaagatc catttgaaac aatgatacca
     4861 cactcccaaa gacccataca attgatgtcc ttgctgggcg aggctgcact tcatggcccg
     4921 gcattttata gcaaaatcag caaactagtc attgcagagt tgaaggaagg tggcatggac
     4981 ttttacgtgc ccagacaaga gccaatgttt agatggatga gattctcaga tctgagcacg
     5041 tgggagggcg atcgcaatct ggctcccagt tttgtgaatg aagatggcgt cgagtgacgc
     5101 caacccatct gatgggtccg cagccaacct cgtcccagag gtcaacaatg aggttatggc
     5161 tctggagcct gttgttggtg ccgccattgc ggcacctgta gcgggccaac aaaatgtaat
     5221 tgacccctgg attagaaaca attttgtaca ggcccctggt ggagaattta cagtatcccc
     5281 tagaaacgct ccaggtgaaa tactatggag cgcgccctta ggccctgatt taaaccccta
     5341 cctatcccat ttggccagga tgtacaatgg ttacgcaggt ggttttgaag tgcaggtaat
     5401 cctcgcgggg aacgcgttca ccgccgggaa ggtcatattt gcagcagtcc caccaaattt
     5461 tccaactgaa ggcttaagcc ccagccaggt cactatgttc ccccatataa tagtagatgt
     5521 taggcaacta gaacctgtgt tgattcccct acccgatgtt aggaataatt tctatcatta
     5581 caatcaatca aatgattcca cccttaaatt gatagcaatg ttatatacac cacttagggc
     5641 taataatgct ggggacgatg tcttcacagt ctcttgccga gtcctcacaa gaccatcccc
     5701 cgattttgat ttcatattct tggtgccgcc cacagttgag tcaagaacta aaccattctc
     5761 tgtcccaatt ctaaccgttg aggagatgac caattcaaga ttccccattc ccttggaaaa
     5821 gttgttcacg ggtcccagca gtgcctttgt tgttcaacca caaaacggca ggtgcacgac
     5881 tgatggcgtg ctcctaggca ccacccaact gtctcctgtc aacatctgca ccttcagagg
     5941 ggatgtcacc cacattacag gtagtcgaaa ttatacaatg aatttggctt ctcaaaattg
     6001 gaacaattat gacccaacag aagaaatccc agcccctcta ggaactccag attttgtagg
     6061 gaagattcaa ggtatgctca cccaaaccac aaggacagat ggctcaacac gcggccacaa
     6121 agctacagtg tacactggga gcgccgactt tgctccaaaa ttgggtagag ttcaatttga
     6181 aactgacaca gaccatgact ttgaaattaa ccaaaacaca aagttcaccc cagtcggtgt
     6241 catccaagac ggtagcacca cccaccgaaa tgagccccaa cagtgggtgc tcccaactta
     6301 ctcaggcaga aaaacactta atgtgcatct ggcccccgct gtagccccca cttttccggg
     6361 tgagcaactt cttttcttca gatctaccat gcccggatgc agcggtgtac ccaacatgga
     6421 tttggactgt ctgctccccc aggaatgggt gcagtacttc taccaagagg cagccccagc
     6481 acaatctgat gtggctctgc taagatttgt gaatccagat acaggtaggg ttttgtttga
     6541 gtgtaagctt cataaatcag gctatgttac agtggctcac actggccaac atgatctggt
     6601 tatccccccc aatggttatt ttaggtttga ttcttgggtc aaccagttct acacactcgc
     6661 ccccatggga aatggaacgg ggcgtagacg tgtagtataa tggctggagc tttctttgct
     6721 ggattggcat ctgatgtcct tggctccgga cttggttccc tcatcaatgc tggggctggg
     6781 gccatcaacc aaaaagttga gtttgaaaat aacagaaaat tacaacaagc atccttccag
     6841 tttagcagta atttgcaaca ggcttccttt caacatgaca aagagatgct ccaagcacaa
     6901 attgatgcca ccaaaaagct acaacaggaa atgatgaaag ttaagcaggc aatgctccta
     6961 gagggtgggt tttctgagac agatgcagcc cgcggggcaa tcaacgcccc catgacaaaa
//