Typing tool
|
Complete norovirus genomes
MW019958 | GII.4 | ||
---|---|---|---|
GII.P31 |
ORF1: 1..5097 ORF2: 5078..6700 ORF3: 6700..7017LOCUS MW019958 7017 bp RNA linear VRL 29-DEC-2020 DEFINITION Norovirus GII isolate 1137 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MW019958 VERSION MW019958.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7017) AUTHORS Makhaola,K., Moyo,S. and Kebaabetswe,L.P. TITLE Next Generation Sequencing of Near-Full Length Genome of Norovirus GII.4 from Botswana JOURNAL Unpublished REFERENCE 2 (bases 1 to 7017) AUTHORS Makhaola,K., Moyo,S. and Kebaabetswe,L.P. TITLE Direct Submission JOURNAL Submitted (20-SEP-2020) Biosciences and Biotechnology, Botswana International University of Science and Technology, Khurumela, Palapye 00, Botswana COMMENT ##Assembly-Data-START## Assembly Method :: Genome Detective v. 1.126 Sequencing Technology :: Oxford Nanopore(MinION) ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7017 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="1137" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="Botswana: Gaborone" /collection_date="25-Sep-2018" /note="genotype: GII.4 Sydney[GII.P31]" gene <1..5097 /gene="ORF1" CDS <1..5097 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QNT54368.1" /translation="KMASNDASAAAVANGNNDIAKSSSDNVLSSMAITFKRALGARPK QPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGISGLPDLTTVSQPEE NNTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISL AKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSW LSRRMIQRTTGFFRPYQEWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKV KPLNILNILASCDWTFPGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK KEEANELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDLEEEKARKLSTK SASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLARELA KKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCP LTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA KRDFPGQPDMWKNAFSPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS GLLHERLDEYELQGPVLTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGKQLKDVKTMP DLKQALKNVAIKKCQIVYNGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRC ARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETASKDGCP KPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIRDERNGK YSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVT GTEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSSN LLITTTHVLPKGVKELFGVEIKQIQVHKSGEFCRFRFPRSIRPDVTGLVLEEGAPEGT VCSILIKRPTGEMIPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDLGTGPGDC GCPYIYKRGNDIVVAGVHTAAARGGNTVICATQGQDGEAVLEGGEDRGTYCGAPILGP GKAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPR GKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWN GESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKVKKRLLWGSDLA TMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDS TQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTS QWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKE YGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPN HEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEP MFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..987 /gene="ORF1" /product="p48" mat_peptide 988..2085 /gene="ORF1" /product="NTPase" mat_peptide 2086..2622 /gene="ORF1" /product="p22" mat_peptide 2623..3021 /gene="ORF1" /product="VPg" mat_peptide 3022..3564 /gene="ORF1" /product="Pro" mat_peptide 3565..5094 /gene="ORF1" /product="RdRp" gene 5078..6700 /gene="ORF2" CDS 5078..6700 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QNT54369.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPGLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLDPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKQFSVPILTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDLTHIANSHNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRADGSTRGHKATVLTGSADFAPKLGRIQFQTDTDRDFEAHQNTKFTPVGVIQDG GTTHQNEPQQWVLPSYSGRDTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV" gene 6700..>7017 /gene="ORF3" CDS 6700..>7017 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QNT54370.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAVLLEGGFSETDAARGAIN APMT" ORIGIN 1 aagatggcgt ctaacgacgc ttccgctgcc gctgttgcca atggcaacaa cgacatcgca 61 aaatcttcaa gtgacaatgt gctttctagc atggctatca cttttaaacg agctctcggg 121 gcgcggccta aacagccgcc cccgaaggaa ataccaccca gacctccacg accacccaca 181 ccagaattgg tcaaaaagat cccccctccc ccacccaacg gagaggatga actagtggtt 241 tcttacagcg ccaaagatgg catttccgga ttgcctgatc tgaccactgt cagccaaccg 301 gaagaaaaca acacggcgtt cagcgttccc ccgctcaatc aaagggagaa tagggacgcc 361 aaggaaccac taactggaac aattatagag atgtgggatg gagaaatcta ccattacggt 421 ctgtacgtgg aacgaggtct tatacttggt gtgcacaagc caccggcagc catcagcctt 481 gccaaggtcg agttaacacc actctctttg ttctggagac ctgtgtatac ccctcagtac 541 ctcatctctc cagacactct taggagacta catggagagt cattccccta tactgcattt 601 gacaacaatt gctacgcctt ctgctgttgg gtgttagacc taaacgactc atggctgagt 661 aggagaatga ttcagaggac tacaggattc ttcagaccat accaggaatg gaacaggaaa 721 cccctcccca ctatggatga ttccaaactg aaaaaggtgg ccaacatatt cttgtgcacc 781 ctatcttcac tgttcaccag acccattaag gacataatag gaaaagtgaa acctcttaac 841 atcctcaata tcctggcctc atgtgattgg actttcccag gcatagtgga atccctaata 901 ctcttggcag agctctttgg agttttctgg acacccccag atgtgtctgc gatgatcgcc 961 cctttactag gtgattatga actgcaagga cctgaggacc ttgcagtaga actggtccca 1021 gtggtgatgg gagggatagg cttggtgcta ggatttacca aagagaaaat tggaaagatg 1081 ctgtcgtccg ccgcatccac cttaagggct tgcaaagacc ttggtgcata cggactagaa 1141 attttgaaat tggtcatgaa atggttcttc ccaaagaaag aggaagcaaa tgagctggcc 1201 atggtgagat ccatcgagga cgcagtactg gacctcgagg caattgaaaa caaccacatg 1261 accgccctgc tcaaggataa agacagcttg gcaacctaca tgagaaccct tgaccttgag 1321 gaggagaaag ccagaaaact ctcaaccaaa tctgcttcac ctgacattgt gggtacaatc 1381 aacgctcttc tggcacgaat cgctgctgca cgttccctag tgcatcgggc gaaagaagag 1441 ctctccagca ggccgagacc tgttgttgtg atgatatcgg gaaaaccagg gatagggaaa 1501 acccaccttg ccagggagtt ggccaagaag atcgcagcct ccctcacagg ggaccagcgt 1561 gtgggcctga tcccacgcaa tggcgttgac cactgggacg catacaaggg tgaaagagtt 1621 gtcctatggg acgactatgg gatgagcaac cccatacacg atgccctcag gttgcaggaa 1681 cttgctgaca cttgccccct cacgctaaat tgtgacagga ttgagaacaa aggaaaagtc 1741 tttgatagtg atgccataat tatcaccacc aatctggcca acccagcacc actggattat 1801 gtcaattttg aagcgtgctc gaggcgcatt gatttcctcg tgtacgcgga ggctcctgag 1861 gtggagaagg caaaacgcga cttcccaggc caacctgaca tgtggaagaa cgctttcagt 1921 cctgacttct cacacataaa actggcattg gctccacagg gtggttttga caagaacggc 1981 aacaccccgc atggaaaagg tgttatgaag actctcacca ctggctccct cattgcccga 2041 gcatcaggat tactccatga gagactagat gaatatgaat tacaaggccc agtcctcact 2101 accttcaact ttgaccgcaa caaggtgctt gcattcagac agcttgctgc tgaaaacaag 2161 tatgggttga tggacacaat gagagttgga aaacagctta aggatgtcaa gaccatgcca 2221 gaccttaaac aagcactcaa gaatgtcgcg atcaagaagt gccagatagt gtataatggt 2281 agcacctaca cgcttgaggc cgatggcaag ggtagtgtga aggttgacaa agtgcagagt 2341 gccaccgtgc aaactaacaa tgaactagcc ggcgccctgc accacctgag gtgcgccaga 2401 atcaggtact atgtcaagtg tgtccaggag gcattgtatt ccatcatcca aatcgctggg 2461 gccgcgtttg tcaccacgcg catcgccaag cgcatgaaca tacaaaacct ctggtccaag 2521 ccacaggtgg aagacacaga agagacggcc agcaaagatg gttgcccaaa acccaaagat 2581 gatgaagagt tcgtcgtttc atccgacgac atcaagactg agggcaagaa agggaagaac 2641 aagtccggcc gtggcaagaa gcacacagcc ttctcaagca aagggctcag tgatgaggag 2701 tacgatgagt acaagagaat cagagatgaa aggaatggta agtactccat agaagagtac 2761 cttcaggaca gagacaagta ctatgaggag gtggccattg ccagggcaac tgaagaggac 2821 ttctgtgaag aagaagaggc caaaatccgg cagagaattt ttagaccaac aaggaaacaa 2881 cgtaaagaag agagggcctc tttaggcttg gtcacaggca cagagatcag gaagagaaac 2941 ccagaagact tcaaacccaa gggaaagctg tgggctgatg atgacagaag tgttgactac 3001 aacgagaaac tcaactttga ggccccacca agcatctggt cgcggatagt caactttggt 3061 tcaggttggg gcttctgggt ttcatccaac cttctgatca caacaacaca tgttctgcct 3121 aaaggggtta aggaactctt tggagttgaa attaaacaaa tccaagtcca caagtctgga 3181 gagttctgca gattcagatt cccgagatcc attaggccag atgtcacagg acttgtgctg 3241 gaggaaggag ccccagaagg cactgtctgt tccatactca taaaaaggcc tacaggtgag 3301 atgatcccct tggcagtgag gatgggcaca catgcatcca tgaaaataca gggccggacc 3361 gttggtggcc agatgggaat gctcttaaca ggggcgaatg caaagaacat ggatctcggc 3421 actggtcctg gtgactgcgg ttgtccctac atctacaaac gcggcaacga cattgttgtc 3481 gcgggtgttc acaccgcagc agcccgggga ggcaacactg tcatatgtgc cacccaaggg 3541 caggatgggg aggcagtcct tgagggaggt gaggaccgtg gcacctactg tggcgcccca 3601 attctgggcc ctggcaaggc gcccaaactc agcaccaaga cgaaattctg gagatcatcc 3661 acaacgccac tcccaccagg cacctacgaa ccagcctacc tcggtggcaa ggaccccaga 3721 gtcaaaggtg gcccttcact gcaacaagtt atgagggacc agctaaaacc attcacagaa 3781 cctagaggca aaccaccaag accaaatgtg ttggaagctg ccaagaaaac catcattaat 3841 gttcttgagc aaacaattga cccaccccaa aaatggtcat tcgcgcaagc ttgcgcatcc 3901 cttgacaaaa ccacctccag cggccaccca caccacatgc ggaaaaacga ttgttggaat 3961 ggggagtcct ttacaggtaa attggcagac caggcctcca aggccaacct aatgtttgaa 4021 gagggaaaga acatgactcc agtctacaca ggtgcactta aggatgaact ggtgaagact 4081 gacaaaattt atggcaaggt caagaagagg ctcctgtggg gctcggacct ggcgaccatg 4141 atacggtgcg cccgggcttt tggaggtctc atggatgaac tcaaggcaca ctgtgttacc 4201 cttcctgtca gagttggtat gaacatgaat gaggatggcc ccataatctt tgagaagcac 4261 tccaggtata ggtatcacta tgacgctgac tactccaggt gggactcaac acaacaaagg 4321 gatgtgctag cagcagcact agaaatcatg gttaagtttt ctccagagcc acacttggcc 4381 cagatagttg cagaagacct cctctcccct agtgtaatgg atgtgggtga cttccaaata 4441 tcaataagtg agggactccc ctccggggtg ccttgcacct cccaatggaa ctccatcgcc 4501 cactggctcc tcaccctttg tgcactctca gaagtcacag acctgtcccc tgacatcatc 4561 caggccaact cccttttctc cttctatggt gatgatgaga ttgtgagtac agacataaaa 4621 ttggacccag agaaactgac agcaaaactc aaagagtacg ggctgaagcc aacccgcccc 4681 gacaaaactg aaggacccct tgttatctct gaagatctgg atggcctgac cttcctccgg 4741 aggactgtga cccgtgaccc agctggttgg tttggaaaat tggaacagag ctcaattctc 4801 aggcaaatgt attggaccag gggccccaac catgaagatc catctgaaac aatgatacca 4861 cactcccaaa gacccataca attgatgtcc ttgctgggcg aggctgcact ccacggccca 4921 gcattttaca gcaaaattag caaattggtc attgcagaat tgaaagaagg tggcatggat 4981 ttttacgtgc ccaggcaaga gccaatgttc agatggatga gattctcaga tctgagcacg 5041 tgggagggcg atcgcaatct ggctcccagt tttgtgaatg aagatggcgt cgagtgacgc 5101 caacccatct gatgggtccg cagccaacct cgtcccagag gtcaacaatg aggttatggc 5161 tctggagccc gttgttggtg ccgctatcgc ggcacctgta gcgggccaac aaaatgtaat 5221 tgacccctgg attagaaata attttgtgca agcccctggt ggagagttta cagtatcccc 5281 tagaaacgct ccaggtgaaa tactatggag cgcgccctta ggccccggtc taaatcccta 5341 cctatcccat ttggccagaa tgtacaatgg ttatgcaggt ggttttgaag tgcaggtaat 5401 cctcgcgggg aacgcgttca ccgctgggaa ggtcatattt gcagcagtcc caccaaattt 5461 tccaactgaa ggcctgagcc ccagccaggt caccatgttc ccccatatag tagtggatgt 5521 taggcaatta gaccctgtgt tgattccctt acccgacgtt aggaacaatt tctaccacta 5581 caaccaatca aatgacccca ccattaagtt gatagcaatg ctttatacac cacttagggc 5641 taataatgct ggggacgatg tcttcacagt ctcttgccgg gtcctcacga gaccatcccc 5701 cgactttgat tttatattcc tagtgccacc cacagttgag tcaagaacta agcaattctc 5761 tgttccaatc ttaactgttg aggagatgac caattcaaga ttccccattc ctttggaaaa 5821 gttgttcacg ggtcccagca gtgcttttgt tgttcaacca caaaacggca ggtgcacaac 5881 tgatggcgtg ctcctaggca ccacccaact gtctcctgtc aacatctgca ccttcagagg 5941 agatctcact cacatcgcaa atagtcataa ttacacaatg aatttggctt ctcaaaattg 6001 gaacaattat gacccaacag aagaaatccc agcccctcta ggaactccag attttgtagg 6061 gaagatacaa ggagtgctca cccaaaccac aagggcggat ggctcaacac gcggccacaa 6121 agccacagtg ctcactggga gcgccgattt tgctccaaaa ctgggtagaa ttcaatttca 6181 aactgacaca gatcgtgatt ttgaagctca ccaaaacaca aagttcaccc cagtcggtgt 6241 catccaagat ggtggcacca cccatcaaaa tgaaccccaa cagtgggtgc tcccaagtta 6301 ctcaggcagg gacaccccca atgtgcattt ggcccccgct gtagccccca cttttccggg 6361 tgagcaactt cttttcttca gatccaccat gcccggatgc agcgggtacc ccaatatgga 6421 tttggactgt ctactccccc aggaatgggt gcagtacttc taccaagagg cagccccagc 6481 acaatctgat gtggctctgc taagatttgt gaatccagac acaggtaggg ttttgtttga 6541 gtgtaagctt cataagtcag gctatgttac agtggctcac actggccaac atgatttggt 6601 tatccccccc aatggttatt ttaggtttga ttcctgggtc aaccagttct acacgcttgc 6661 ccccatggga aatggaacgg ggcgtagacg tgcagtataa tggctggagc tttctttgct 6721 ggattggcat ctgatgtcct tggctctgga cttggttccc ttatcaatgc tggggctggg 6781 gccatcaacc aaaaagttga gtttgaaaat aacagaaaat tacaacaagc atccttccaa 6841 tttagcagta atctacaaca ggcttccttc caacatgaca aagagatgct ccaggcacaa 6901 attgaggcca ccaaaaagct acaacaggaa atgatgaaag ttaaacaggc agtgctccta 6961 gagggtgggt tctctgagac agatgcagcc cgcggggcaa tcaacgcccc catgaca //