Typing tool
|
Complete norovirus genomes
MW019959 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 1..5097 ORF2: 5078..6700 ORF3: 6700..7020LOCUS MW019959 7020 bp RNA linear VRL 29-DEC-2020 DEFINITION Norovirus GII isolate 2049 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MW019959 VERSION MW019959.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7020) AUTHORS Makhaola,K., Moyo,S. and Kebaabetswe,L.P. TITLE Next Generation Sequencing of Near-Full Length Genome of Norovirus GII.4 from Botswana JOURNAL Unpublished REFERENCE 2 (bases 1 to 7020) AUTHORS Makhaola,K., Moyo,S. and Kebaabetswe,L.P. TITLE Direct Submission JOURNAL Submitted (20-SEP-2020) Biosciences and Biotechnology, Botswana International University of Science and Technology, Khurumela, Palapye 00, Botswana COMMENT ##Assembly-Data-START## Assembly Method :: Genome Detective v. 1.126 Sequencing Technology :: Oxford Nanopore(MinION) ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7020 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="2049" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="Botswana: Molepolole" /collection_date="20-Oct-2017" /note="genotype: GII.4 Sydney[GII.P31]" gene <1..5097 /gene="ORF1" CDS <1..5097 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QNT54371.1" /translation="KMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPK QPPPKEKPPKPPRPPTPELIKEIPPPPPNGEDEPVVSYSAKDGVSGLPELTTVRQPEE NNTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISL AKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSW LSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKL KPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK KEEANELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDLEEEKARKLSTK SASPDIVGTINALLARIAAARSLVHRAKEELSSRSRPVVVMISGKPGIGKTHLARELA KKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCP LTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA KRDFPGQPDMWKNAFSPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS GLLHERLDEYELQGPVLTTFNFDRNKVLAFRQLAAENKYGLMDTMRIGKQLKDVKTMP DLKQALKNVAIKKCQIVYGGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRC ARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETASKDGCP KPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIRDERNGK YSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVT GTEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSSN LLITTTHVLPKGVKELFGVEIKQIQVHKSGEFCRFRFPRSIRPDVTGLVLEEGAPEGT VCSILIKRPTGEMIPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDLGTGPGDC GCPYIYKRGNDIVVAGVHTAAARGGNTVICATQGQDGEAVLEGGEDRGTYCGAPILGP GKAPKLSTKTKFWRSSPDALPPGTYEPAYLGGKDPRVKKGPSLQQVMRDQLKPFTEPR GKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWN GESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLA TMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWDS TQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTS QWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKE YGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPN HEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEP MFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..987 /gene="ORF1" /product="p48" mat_peptide 988..2085 /gene="ORF1" /product="NTPase" mat_peptide 2086..2622 /gene="ORF1" /product="p22" mat_peptide 2623..3021 /gene="ORF1" /product="VPg" mat_peptide 3022..3564 /gene="ORF1" /product="Pro" mat_peptide 3565..5094 /gene="ORF1" /product="RdRp" gene 5078..6700 /gene="ORF2" CDS 5078..6700 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QNT54372.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD VRNNFYHYNQSNDSTLKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPILTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEINQNTKFTPVGVIQDG STTHRNEPQQWVLPTYSGRKTLNVHLAPAVAPTFPGEQLLFFRSTMPGCSGVPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVV" gene 6700..>7020 /gene="ORF3" CDS 6700..>7020 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QNT54373.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIDATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTK" ORIGIN 1 aagatggcgt ctaacgacgc ttccgctgcc gctgttgcca acagcaacaa cgacatcgca 61 aaatcttcaa gtgacggtgt gttttctaac atggctgtca cttttaaacg ggccctcggg 121 gcgcggccta aacagccgcc cccgaaggaa aaaccaccca aacccccgcg accacccaca 181 ccagagttga tcaaagagat ccctcccccc ccacccaatg gggaggatga accagtggtc 241 tcctacagcg ccaaagacgg cgtttccggg ctgcctgagc tcaccactgt cagacaaccg 301 gaagaaaaca acacggcgtt cagtgtcccc ccactcaacc aaagggagaa cagggacgcc 361 aaggagccac taactggaac aattattgaa atgtgggatg gagaaatcta ccattacggc 421 ctgtacgtgg aacgaggtct tatacttggt gtgcacaagc caccggcggc catcagcctt 481 gccaaggtcg agctaacacc gctctctttg ttctggagac ctgtgtacac cccccagtat 541 ctcatctctc cagacactct taggagacta catggagagt cgttccccta cactgcattt 601 gacaacaatt gctacgcctt ttgttgttgg gtactagacc taaacgactc atggctaagc 661 aggagaatga ttcagagaac aacaggtttc tttaggccat accaggactg gaacaggaaa 721 cccctcccca ctatggatga ttccaaatta aagaaggtag ccaacatatt cttgtgcact 781 ttgtcttcgc tatttaccag gcccattaag gacataatag ggaagctgaa acctctcaac 841 atccttaaca ttctggctac atgtgattgg actttcgcag gcatagtgga gtccttaata 901 ctcttggcag aactctttgg agttttctgg acacccccag atgtgtctgc gatgatcgcc 961 cctttactag gtgattatga actgcaagga cctgaggacc ttgcagtaga actggtccca 1021 gtggtgatgg gagggatagg cttggtgcta ggatttacca aagagaaaat tggaaagatg 1081 ctgtcgtccg ccgcatccac cttaagggct tgcaaagacc ttggtgcata cggactagaa 1141 attttgaaat tggtcatgaa atggttcttc ccaaagaaag aggaagcaaa tgagctggcc 1201 atggtgagat ccatcgagga cgcagtactg gacctcgagg caattgaaaa caaccacatg 1261 accgccctgc tcaaggataa agacagcttg gcaacctaca tgagaaccct tgaccttgag 1321 gaggagaaag ccagaaaact ctcaaccaaa tctgcttcac ctgacattgt gggtacaatc 1381 aacgctcttc tggcacgaat cgccgctgca cgctccctag tgcatcgggc gaaagaagag 1441 ctctccagca ggtcgaggcc tgtcgttgtg atgatatcgg gaaaaccagg gatagggaaa 1501 actcaccttg ccagggagtt ggccaagaag atcgcagcct ccctcacagg ggaccagcgt 1561 gtgggcctga tcccacgcaa tggcgttgac cactgggacg catacaaggg tgaaagagtt 1621 gtcctatggg acgactatgg gatgagcaac cccatacacg atgccctcag gttgcaggaa 1681 cttgctgaca cttgccccct cacgctaaat tgtgacagga ttgagaacaa aggaaaagtc 1741 tttgatagtg atgccataat tatcaccacc aatctggcca acccagcacc actggattat 1801 gtcaattttg aagcgtgctc gaggcgcatt gatttcctcg tgtacgcgga ggctcctgag 1861 gtggagaagg caaaacgcga cttcccaggc caacctgaca tgtggaagaa cgctttcagt 1921 cctgacttct cacacataaa actggcattg gctccacagg gtggttttga caagaacggc 1981 aacaccccgc atggaaaagg tgttatgaag actctcacca ctggctccct cattgcccga 2041 gcatcaggat tactccatga gagactagat gaatatgaat tacaaggccc agtcctcact 2101 accttcaact ttgatcgcaa caaagtgctt gcctttagac agcttgctgc tgaaaacaag 2161 tatgggctga tggacacaat gagaattgga aaacagctta aggatgtcaa gaccatgcca 2221 gacctcaaac aggcactcaa gaatgttgcg atcaagaagt gccagatagt gtatggtggt 2281 agcacctaca cgcttgaggc cgatggcaag ggtagtgtga aggttgacaa agtgcagagt 2341 gccaccgtgc aaactaacaa tgaactagcc ggcgccctgc accacctgag gtgcgccaga 2401 atcaggtact atgtcaagtg tgtccaggag gcattgtatt ccatcatcca aatcgctggg 2461 gccgcgtttg tcaccacgcg catcgccaag cgcatgaaca tacaaaacct ctggtccaag 2521 ccacaggtgg aagacacaga agagacggcc agcaaagatg gttgcccaaa acccaaagat 2581 gatgaagagt tcgtcgtttc atccgacgac atcaagactg agggcaagaa agggaagaac 2641 aagtccggcc gtggcaagaa gcacacagcc ttctcaagca aagggctcag tgatgaggag 2701 tacgatgagt acaagagaat cagagatgaa aggaatggta agtactccat agaagagtac 2761 cttcaggaca gagacaagta ctatgaggag gtggccattg ccagggcaac tgaagaggac 2821 ttctgtgaag aagaagaggc caaaatccgg cagagaattt ttagaccaac aaggaaacaa 2881 cgtaaagagg agagggcctc tttaggcttg gtcacaggca cagagatcag gaagagaaac 2941 ccagaagact tcaaacccaa aggaaagctg tgggctgatg atgacagaag tgttgactac 3001 aacgagaaac tcaactttga ggccccacca agcatctggt cgcggatagt caactttggt 3061 tccggttggg ggttctgggt ttcatccaac cttctgatca caacaacaca cgttctgcct 3121 aaaggggtta aggaactctt tggagttgaa attaaacaaa tccaagtcca caagtctgga 3181 gagttctgca gattcagatt cccgagatcc attaggccag atgtcacagg acttgtgctg 3241 gaggaaggag ccccagaagg cactgtctgt tccatactca taaaaaggcc tacaggtgag 3301 atgatcccct tggcagtgag gatgggcaca catgcatcca tgaaaataca gggccggacc 3361 gttggtggcc agatgggaat gctcctaaca ggggcgaatg caaagaacat ggatctcggc 3421 actggtcctg gtgactgcgg ttgtccctac atctacaaac gcggcaacga cattgttgtc 3481 gcgggtgttc acaccgcagc agcccgggga ggcaacactg tcatatgtgc cacccaaggg 3541 caggatgggg aggcagtcct tgagggaggt gaggaccgtg gcacctactg tggcgcccca 3601 attctgggcc ctggcaaggc gcccaaactc agcacgaaga ctaagttttg gcgctcgtca 3661 ccagatgcct tgccgcctgg cacgtatgaa cctgcttacc tgggaggcaa ggaccccaga 3721 gtgaaaaaag ggccttcctt gcagcaagtc atgagggacc aattgaaacc atttacagag 3781 cccagaggta aaccaccaag accaaatgtg ttggaagctg ccaagaaaac catcattaat 3841 gttcttgagc aaacaattga cccaccccaa aaatggtcat tcgcgcaagc atgcgcatcc 3901 cttgacaaaa ctacctccag cggccaccca caccacatgc ggaaaaacga ttgctggaat 3961 ggggagtcct ttacaggaaa attggcagat caggcctcca aggccaacct aatgtttgaa 4021 gagggaaaga acatgactcc agtctacaca ggtgcactta aagatgaact ggtgaagact 4081 gacaaaattt atggtaagat caagaagagg ctcctgtggg gctcggacct ggcgaccatg 4141 atacggtgcg cccgggcttt tgggggcctc atggatgaac tcaaggctca ctgtgtcacc 4201 cttcctgtca gagttggtat gaacatgaat gaggatggcc ccataatctt tgagaagcac 4261 tccagatata aatatcacta tgatgctgat tactccaggt gggactcaac acaacaaagg 4321 gatgtgctag cagcagcact tgaaatcatg gttaagttct ccccagaacc acatctggcc 4381 cagatagttg cagaagacct cctttccccc agcgtgatgg atgtgggtga ttttcaaata 4441 tcaataagtg agggtctccc ctctggggtg ccttgtacct cccagtggaa ttccatcgcc 4501 cactggcttc tcactctttg tgcactctct gaagtcacgg acctgtctcc tgacattatt 4561 caggccaact cccttttctc cttctacggt gatgatgaga ttgtaagcac agacataaag 4621 ttggatccag agaagctgac agcaaaactc aaggagtacg ggctgaaacc aacccgcccc 4681 gacaaaactg aaggacccct tgttatatct gaagacctgg atggcctgac tttcctccgg 4741 agaactgtga cccgtgatcc agctggttgg tttggaaaat tggaacaaag ttcaattctc 4801 aggcaaatgt actggactag gggtcccaac catgaagatc catttgaaac aatgatacca 4861 cactcccaaa gacccataca attgatgtcc ttgctgggcg aggctgcact tcatggcccg 4921 gcattttata gcaaaatcag caaactagtc attgcagagt tgaaggaagg tggcatggac 4981 ttttacgtgc ccagacaaga gccaatgttt agatggatga gattctcaga tctgagcacg 5041 tgggagggcg atcgcaatct ggctcccagt tttgtgaatg aagatggcgt cgagtgacgc 5101 caacccatct gatgggtccg cagccaacct cgtcccagag gtcaacaatg aggttatggc 5161 tctggagcct gttgttggtg ccgccattgc ggcacctgta gcgggccaac aaaatgtaat 5221 tgacccctgg attagaaaca attttgtaca ggcccctggt ggagaattta cagtatcccc 5281 tagaaacgct ccaggtgaaa tactatggag cgcgccctta ggccctgatt taaaccccta 5341 cctatcccat ttggccagga tgtacaatgg ttacgcaggt ggttttgaag tgcaggtaat 5401 cctcgcgggg aacgcgttca ccgccgggaa ggtcatattt gcagcagtcc caccaaattt 5461 tccaactgaa ggcttaagcc ccagccaggt cactatgttc ccccatataa tagtagatgt 5521 taggcaacta gaacctgtgt tgattcccct acccgatgtt aggaataatt tctatcatta 5581 caatcaatca aatgattcca cccttaaatt gatagcaatg ttatatacac cacttagggc 5641 taataatgct ggggacgatg tcttcacagt ctcttgccga gtcctcacaa gaccatcccc 5701 cgattttgat ttcatattct tggtgccgcc cacagttgag tcaagaacta aaccattctc 5761 tgtcccaatt ctaaccgttg aggagatgac caattcaaga ttccccattc ccttggaaaa 5821 gttgttcacg ggtcccagca gtgcctttgt tgttcaacca caaaacggca ggtgcacgac 5881 tgatggcgtg ctcctaggca ccacccaact gtctcctgtc aacatctgca ccttcagagg 5941 ggatgtcacc cacattacag gtagtcgaaa ttatacaatg aatttggctt ctcaaaattg 6001 gaacaattat gacccaacag aagaaatccc agcccctcta ggaactccag attttgtagg 6061 gaagattcaa ggtatgctca cccaaaccac aaggacagat ggctcaacac gcggccacaa 6121 agctacagtg tacactggga gcgccgactt tgctccaaaa ttgggtagag ttcaatttga 6181 aactgacaca gaccatgact ttgaaattaa ccaaaacaca aagttcaccc cagtcggtgt 6241 catccaagac ggtagcacca cccaccgaaa tgagccccaa cagtgggtgc tcccaactta 6301 ctcaggcaga aaaacactta atgtgcatct ggcccccgct gtagccccca cttttccggg 6361 tgagcaactt cttttcttca gatctaccat gcccggatgc agcggtgtac ccaacatgga 6421 tttggactgt ctgctccccc aggaatgggt gcagtacttc taccaagagg cagccccagc 6481 acaatctgat gtggctctgc taagatttgt gaatccagat acaggtaggg ttttgtttga 6541 gtgtaagctt cataaatcag gctatgttac agtggctcac actggccaac atgatctggt 6601 tatccccccc aatggttatt ttaggtttga ttcttgggtc aaccagttct acacactcgc 6661 ccccatggga aatggaacgg ggcgtagacg tgtagtataa tggctggagc tttctttgct 6721 ggattggcat ctgatgtcct tggctccgga cttggttccc tcatcaatgc tggggctggg 6781 gccatcaacc aaaaagttga gtttgaaaat aacagaaaat tacaacaagc atccttccag 6841 tttagcagta atttgcaaca ggcttccttt caacatgaca aagagatgct ccaagcacaa 6901 attgatgcca ccaaaaagct acaacaggaa atgatgaaag ttaagcaggc aatgctccta 6961 gagggtgggt tttctgagac agatgcagcc cgcggggcaa tcaacgcccc catgacaaaa //