Typing tool

Complete norovirus genomes

MW019621  GII.4
 GII.P31

Length: 7,128 | 3 CDS

ORF1: 1..5097
ORF2: 5078..6700
ORF3: 6700..7128
LOCUS       MW019621                7128 bp    RNA     linear   VRL 29-OCT-2020
DEFINITION  Norovirus GII isolate 4020 nonstructural polyprotein (ORF1) gene,
            partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene,
            partial cds.
ACCESSION   MW019621
VERSION     MW019621.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7128)
  AUTHORS   Makhaola,K., Moyo,S. and Kebaabetswe,L.P.
  TITLE     Near complete genome next generation sequence analysis of norovirus
            GII.4 from Botswana
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7128)
  AUTHORS   Makhaola,K., Moyo,S. and Kebaabetswe,L.P.
  TITLE     Direct Submission
  JOURNAL   Submitted (19-SEP-2020) Biosciences and Biotechnology, Botswana
            International University of Science and Technology, Khurumela,
            Palapye 00, Botswana
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: Genome Detective v. 1.126
            Sequencing Technology :: Oxford Nanopore(MinION)
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7128
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="4020"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Botswana: Mochudi"
                     /collection_date="25-Oct-2017"
                     /note="genotype: GII.4 Sydney[GII.P31]"
     gene            <1..5097
                     /gene="ORF1"
     CDS             <1..5097
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QNT38524.1"
                     /translation="KMASNDASAAAAANSNNDIAKSSSDGVFSNMAVTFKRALGARPK
                     QPPPKEIPTRPPRPPTPELVKKIPPPPPNGEEELVVSYSAKDGVSGLPELTTVSQPEE
                     TNTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISL
                     AKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSW
                     LSRRMIQRTTGFFRPYQEWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKL
                     KPLNILNILATCDWTFPGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL
                     AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK
                     KEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTK
                     SASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLARELA
                     RKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCP
                     LTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA
                     KRDFPGQPDMWKNAFSPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS
                     GLLHERLDEYELQGPVLTTFNFDRNKVLAFRQLAAENKYGLMDTMRIGKQLKDVKTMP
                     DLKQALKNVAIKKCQIVYGGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRC
                     ARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETASKDGCP
                     KPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIRDERNGK
                     YSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVT
                     GTEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPS
                     LFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGT
                     VATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDC
                     GCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNRGTYCGAPILGP
                     GSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPR
                     GKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWN
                     GESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLA
                     TMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWDS
                     TQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTS
                     QWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKE
                     YGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPN
                     HEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEP
                     MFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..987
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     988..2085
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2086..2622
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2623..3021
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3022..3564
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3565..5094
                     /gene="ORF1"
                     /product="RdRp"
     gene            5078..6700
                     /gene="ORF2"
     CDS             5078..6700
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QNT38525.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPGLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSRNYTMNLASLNWNSYDPTEEIPAPLGTPDFVGKIQGML
                     TQTTRADGSTRGHKATVYTGSADFSPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
                     STTHQNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
     gene            6700..>7128
                     /gene="ORF3"
     CDS             6700..>7128
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QNT38526.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAVLLEGGFSETDAARGAIN
                     APMTKTLDWSGTRYWAPDARITTYNAGRFSTPQTSVALPGR"
ORIGIN      
        1 aagatggcgt ctaacgacgc ttccgctgcc gctgctgcca acagcaacaa cgacatcgca
       61 aaatcttcaa gtgacggtgt gttttctaac atggctgtca cttttaaacg ggccctcggg
      121 gcgcggccta aacaaccgcc cccgaaggaa ataccaacca gacccccacg accacccaca
      181 ccagaattgg tcaaaaagat cccccctccc ccacccaacg gggaggaaga attagtggtt
      241 tcttacagcg ccaaagacgg cgtttccgga ttgcctgagc ttaccactgt cagccaaccg
      301 gaagaaacca atacggcgtt cagtgttccc ccgctcaatc aaagggagaa tagggacgcc
      361 aaggaaccac taactgggac aattattgaa atgtgggatg gagaaatcta ccattacggc
      421 ctgtacgtgg aacgaggtct tatacttggt gtgcacaagc caccagcagc catcagcctt
      481 gccaaggtcg agttaacacc actctctttg ttctggagac ctgtgtacac cccccagtat
      541 ctcatctctc cagacactct caggagacta catggagagt cattccccta caccgcattt
      601 gacaacaatt gctacgcctt ctgctgttgg gtattagacc taaacgactc atggctaagt
      661 aggagaatga ttcagagaac aacaggtttc ttcagaccat accaggagtg gaacaggaaa
      721 cccctcccca ctatggatga ctccaaattg aagaaggtag ccaacatatt cttgtgcacc
      781 ctgtcctcac tattcaccag acccattaag gacataatag ggaaattgaa acctcttaac
      841 attctcaata ttctggctac atgtgattgg accttcccag gtatagtgga atccctaata
      901 ctcttggcag agctctttgg agttttctgg acacccccag atgtgtctgc gatgatcgcc
      961 cccttactag gtgattatga actgcaagga cctgaggacc ttgcagtaga actagtccca
     1021 gtggtgatgg gggggatagg tttggtgcta ggatttacca aagagaaaat tggaaagatg
     1081 ctgtcgtccg ccgcatccac tttgagagcc tgtaaagatc ttggagcgta cggactggaa
     1141 attttaaaat tagtcatgaa atggttcttc ccaaagaaag aggaagcaaa tgaactggct
     1201 atggtgagat ccatcgagga cgcagtgctg gacctcgagg caattgaaaa caaccacatg
     1261 accaccctgc tcaaagataa agacagcttg gcaacttaca tgagaaccct tgaccttgag
     1321 gaggagaaag ccagaaaact ctcaaccaaa tctgcttcac ctgatattgt gggcacaatc
     1381 aactctcttc ttgcacgaat cgctgctgca cgttctctag tgcatcgggc gaaagaagag
     1441 ctctccagca ggccgagacc tgttgttgtg atgatatcgg gaaaaccagg gatagggaaa
     1501 acccaccttg ccagggagtt ggccaggaag atcgcagcct ccctcacagg ggaccagcgt
     1561 gtgggcctga tcccacgcaa tggcgttgac cactgggacg catacaaggg tgaaagagtt
     1621 gtcctatggg acgactatgg gatgagcaac cccatacacg atgccctcag gttgcaggaa
     1681 cttgctgaca cttgccccct cacgctaaat tgtgacagga ttgagaacaa aggaaaagtc
     1741 tttgatagtg atgccataat tatcaccacc aatctggcca acccagcacc actggattat
     1801 gtcaattttg aagcgtgctc gaggcgcatt gatttcctcg tgtacgcgga ggctcctgag
     1861 gtggagaagg caaaacgcga cttcccaggc caacctgaca tgtggaagaa cgctttcagt
     1921 cctgacttct cacacataaa actggcattg gctccacagg gtggttttga caagaacggc
     1981 aataccccgc atggaaaagg tgttatgaag actctcacca ctggctccct cattgcccga
     2041 gcatcaggat tactccatga gagactagat gaatatgaat tacaaggccc agtcctcact
     2101 accttcaact ttgaccgcaa caaggtgctt gcatttagac agcttgctgc tgaaaacaag
     2161 tatgggttga tggacacaat gagaattgga aaacagctta aggatgtcaa gaccatgcca
     2221 gacctcaaac aagcactcaa gaatgttgcg atcaagaagt gccagatagt gtatggtggt
     2281 agcacctaca cgcttgaggc cgatggcaag ggtagtgtga aggttgacaa agtgcagagt
     2341 gccaccgtgc aaactaacaa tgaactagcc ggcgccctgc accacctgag gtgcgccaga
     2401 atcaggtact atgtcaagtg tgtccaggag gcattgtatt ccatcatcca aatcgctggg
     2461 gccgcgtttg tcaccacgcg catcgccaag cgcatgaaca tacaaaacct ctggtccaag
     2521 ccacaggtgg aagacacaga agagacggcc agcaaagatg gttgcccaaa acccaaagat
     2581 gatgaagagt tcgtcgtttc atccgacgac atcaagactg agggcaagaa agggaagaac
     2641 aagtccggcc gtggcaagaa gcacacagcc ttctcaagca aagggctcag tgatgaggag
     2701 tacgatgagt acaagagaat cagagatgaa aggaatggta agtactccat agaagagtac
     2761 cttcaggaca gagacaagta ctatgaggag gtggccattg ccagggcaac tgaagaggac
     2821 ttctgtgaag aagaagaggc caaaatccgg cagagaattt ttagaccaac aaggaaacaa
     2881 cgtaaagaag agagggcctc tttaggcttg gtcacaggca cagagatcag gaagagaaac
     2941 ccagaagact tcaaacccaa gggaaagctg tgggctgatg atgacagaag tgttgactac
     3001 aacgagaaac tcaattttga ggccccacca agcatctggt cgcggatagt caactttggt
     3061 tcaggttggg gcttctgggt ctcccccagt ttgtttataa catcaaccca tgtcatacct
     3121 caaggtgcaa aagagttctt cggagtcccc atcaaacaaa tccagataca caaatcaggt
     3181 gaattctgcc gactgagatt cccaaaacca atcagaacgg atgtgacggg catgattctg
     3241 gaagaaggtg cgccagaagg aaccgtggcc acactgctca tcaagagacc aactggagag
     3301 ctcatgcctt tggcagctag aatgggaacc catgcaacca tgaagatcca ggggcgcaca
     3361 gttgggggac aaatgggtat gctcttgaca ggatccaacg ccaagagtat ggacttgggc
     3421 acaacaccag gcgactgcgg ttgtccctac atctacaaaa gagggaatga ctatgtggtc
     3481 ataggagtcc atacagccgc tgcccgtgga ggaaacactg tcatctgtgc cacccagggt
     3541 agtgagggag aagccacact tgaaggaggt gacaacagag gaacgtactg tggtgcacca
     3601 atcttgggcc cagggagtgc tccaaaactc agcaccaaga ctaagttttg gagatcatcc
     3661 acaacgccac tcccaccagg cacctacgag ccagcctacc tcggtggcaa ggatcccaga
     3721 gtcaaaggtg gtccttcatt gcaacaagtt atgagggacc agctaaaacc atttacagag
     3781 cccagaggca aaccaccaag accaaatgtg ttggaagctg ccaagaaaac catcattaat
     3841 gttcttgagc aaacaattga cccaccccaa aaatggtcat tcgcgcaagc atgcgcatcc
     3901 cttgacaaaa ccacctccag cggccaccca caccacatgc ggaaaaacga ttgctggaat
     3961 ggggagtcct ttacaggaaa attggcagat caggcctcca aggccaacct aatgtttgaa
     4021 gagggaaaga acatgactcc agtctacaca ggtgcactta aagatgaact ggtgaagact
     4081 gacaaaattt atggtaagat caagaagagg ctcctgtggg gctcggacct ggcgaccatg
     4141 atacggtgcg cccgggcttt tgggggcctc atggatgaac tcaaggctca ctgtgtcacc
     4201 cttcctgtca gagttggtat gaacatgaat gaggatggcc ccataatctt tgagaagcac
     4261 tccagatata aatatcacta tgatgctgat tactccaggt gggactcaac acaacaaagg
     4321 gatgtgctag cagcagcact agaaatcatg gttaagtttt ctccagaacc acacttggcc
     4381 cagatagttg cagaagacct cctttcccct agtgtaatgg atgtgggtga ctttcaaata
     4441 tcaataagtg aggggcttcc ctccggggtg ccttgcacct cccagtggaa ctccatcgcc
     4501 cactggctcc tcaccctttg tgcactctct gaagtcacgg acctgtctcc tgacatcatc
     4561 caggccaact cccttttctc cttctatggt gatgacgaga ttgtgagtac agacataaag
     4621 ttggacccag agaagctgac agcaaaactc aaggagtacg ggctgaagcc aacccgcccc
     4681 gacaaaactg agggacccct tgttatctct gaagacctgg atggcctgac attcctccgg
     4741 aggactgtga cccgtgatcc agctggctgg tttggaaaat tggaacaaag ctcaattctc
     4801 aggcaaatgt actggaccag aggtcccaac catgaagatc catctgagac aatgatacca
     4861 cactcccaaa gacccataca attgatgtcc ttgctgggcg aggctgcact ccacggccca
     4921 gcattttaca gcaaaattag caaattggtc attgcagaat tgaaggaagg tggcatggat
     4981 ttttacgtgc ccaggcaaga gccaatgttc agatggatga gattctcaga tctgagcacg
     5041 tgggagggcg atcgcaatct ggctcccagt tttgtgaatg aagatggcgt cgagtgacgc
     5101 caacccatct gatgggtccg cagccaacct cgtcccagag gtcaacaatg aggttatggc
     5161 tctggagccc gttgttggtg ccgctattgc ggcacctgta gcgggccagc aaaatgtaat
     5221 tgacccctgg attagaaata attttgtaca agcccctggt ggagagttta cagtatcccc
     5281 tagaaacgct ccaggtgaaa tactatggag cgcgccctta ggccccggtc taaatcccta
     5341 cctatcccat ttggccagaa tgtacaatgg ttatgcaggt ggttttgaag tgcaggtaat
     5401 tctcgcgggg aacgcgttca ccgctgggaa ggtcatattt gccgcagtcc caccaaattt
     5461 cccaactgaa ggcttgagcc ccagccaggt cactatgttc ccccatatag tagtggatgt
     5521 taggcaacta gaacctgtgt tgattcccct acccgacgtt aggaacaatt tctatcacta
     5581 caatcagtca aatgacccca ctattaagtt gatagcaatg ctgtatacac cacttagggc
     5641 taataatgct ggggatgatg tcttcacagt ctcttgccgg gtcctcacga gaccatcccc
     5701 cgattttgat tttatatttc tagtgccacc cacagttgag tcaagaacta agccattctc
     5761 tgtcccggtc ttaactgttg aggagatgac caattcaaga ttccccatcc ctttggaaaa
     5821 gttgttcacg ggtcccagca gtgcctttgt tgttcaacca caaaacggca ggtgtacgac
     5881 tgatggcgtg ctcctaggta ccacccaact gtctcctgtc aacatctgca ccttcagagg
     5941 ggatgtcacc cacatcacag gtagtcgtaa ctacacaatg aatttggctt ctctaaattg
     6001 gaacagttac gacccaacag aagaaatccc agcccctctg ggaactccag attttgtagg
     6061 gaaaatccaa ggtatgctca cccaaaccac aagggcagat ggctcaacac gcggccacaa
     6121 agctacagtg tacactggga gcgccgattt ttctccaaaa ttgggtagag ttcaatttga
     6181 aactgacaca gaccatgatt ttgaagccaa ccaaaacaca aagttcaccc cagtcggtgt
     6241 catccaagat ggcagcacca cccatcaaaa tgaaccccaa cagtgggtgc tcccaagtta
     6301 ctcaggcaga aacactccca acgtgcatct ggcccccgct gtagccccca cttttccagg
     6361 tgagcaactc cttttcttca gatccaccat gcccggatgc agcgggtatc ccaatatgga
     6421 tttggactgt ctactccccc aggaatgggt gcagtacttc taccaagagg cagccccagc
     6481 acaatctgat gtggctctgc taagatttgt gaatccagac acaggtaggg ttttgtttga
     6541 gtgtaagctt cataagtcag gctatgttac agtggctcac actggccaac atgatttggt
     6601 tatccccccc aatggttatt ttaggtttga ttcctgggtc aaccagttct acacgcttgc
     6661 ccccatggga aatggaacgg ggcgtagacg tgcagtataa tggctggagc tttctttgct
     6721 ggattggcat ctgatgtcct tggctctgga cttggttccc ttatcaatgc tggggctggg
     6781 gccatcaacc aaaaagttga gtttgaaaat aacagaaaat tacaacaagc atccttccaa
     6841 tttagcagta atctacaaca ggcttccttc caacatgaca aagagatgct ccaggcacaa
     6901 attgaggcca ccaaaaagct acaacaggaa atgatgaaag ttaaacaggc agtgctccta
     6961 gagggtgggt tctctgagac agatgcagcc cgcggggcaa tcaacgcccc tatgacaaag
     7021 actttggact ggagcgggac aaggtactgg gctcccgatg ctaggattac aacatacaat
     7081 gcaggccgct tttccacccc tcaaacatcg gttgcactac caggaagg
//