Typing tool
|
Complete norovirus genomes
MW019956 | GII.4 | ||
---|---|---|---|
GII.P31 |
ORF1: 1..5097 ORF2: 5078..6700 ORF3: 6700..7017LOCUS MW019956 7017 bp RNA linear VRL 29-DEC-2020 DEFINITION Norovirus GII isolate 1048 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MW019956 VERSION MW019956.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7017) AUTHORS Makhaola,K., Moyo,S. and Kebaabetswe,L.P. TITLE Next Generation Sequencing of Near-Full Length Genome of Norovirus GII.4 from Botswana JOURNAL Unpublished REFERENCE 2 (bases 1 to 7017) AUTHORS Makhaola,K., Moyo,S. and Kebaabetswe,L.P. TITLE Direct Submission JOURNAL Submitted (20-SEP-2020) Biosciences and Biotechnology, Botswana International University of Science and Technology, Khurumela, Palapye 00, Botswana COMMENT ##Assembly-Data-START## Assembly Method :: Genome Detective v. 1.126 Sequencing Technology :: Oxford Nanopore(MinION) ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7017 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="1048" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="Botswana: Gaborone" /collection_date="20-Mar-2017" /note="genotype: GII.4 Sydney[GII.P31]" gene <1..5097 /gene="ORF1" CDS <1..5097 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QNT54362.1" /translation="KMASNDASAAAAANSNNDIAKSSSDGVFSNMAVTFKRALGARPK QPPPKEIPTRPPRPPTPELVKKIPPPPPNGEEELVVSYSAKDGVSGLPELTTVSQPEE TNTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISL AKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSW LSRRMIQRTTGFFRPYQEWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKL KPLNILNILATCDWTFPGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK KEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTK SASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLARELA RKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCP LTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA KRDFPGQPDMWKNAFSPDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS GLLHERLDEYELQGPVLTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMP DLKQALKNVAIKKCQIVYNGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRC ARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETASKDGCP KPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIRDERNGK YSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTKKQRKEERASLGLVT GSDIRKRNPDDFKPKGKLWADDTRSVDYNERLDFEAPPSVWSRIVPLGTGWGFWVSSN LLITTTHVLPKGVKELFGVEIKQIQVHKSGEFCRFRFPRSIRPDVTGLVLEEGAPEGT VCSILIKRPTGEMIPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDLGTGPGDC GCPYIYKRGNDIVVAGVHTAAARGGNTVICATQGQDGEAVLEGGEDRGTYCGAPILGP GKAPKLSAKTKFWRSSPDALPPGTYEPAYLGGKDPRVEKGPSLQQVMRDQLKPFTEPR GKPPRPAVLEEAKKTVMNVLEQTIDPPQPWSYSQACASLDKTTSSGHPHHVKKNDHWN GESFTGPLADQASKANLMYEQAKNMKPVYTGALKDELVKTDKIYKKIKKRLLWGSDLA TMIRCARAFGGLMDSMKASCIALPCRVGMNMNEDGPIIFDKHSKYRYHYDADYSRWDS TQQRSILSAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTS QWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKE YGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPN HEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEP MFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..987 /gene="ORF1" /product="p48" mat_peptide 988..2085 /gene="ORF1" /product="NTPase" mat_peptide 2086..2622 /gene="ORF1" /product="p22" mat_peptide 2623..3021 /gene="ORF1" /product="VPg" mat_peptide 3022..3564 /gene="ORF1" /product="Pro" mat_peptide 3565..5094 /gene="ORF1" /product="RdRp" gene 5078..6700 /gene="ORF2" CDS 5078..6700 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QNT54363.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASLNWNSYDPTEEIPAPLGTPDFVGKIQGML TQTTRTDGSTRGHKATVYTGSADFSPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG STTHRNEPPKWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGVPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVV" gene 6700..>7017 /gene="ORF3" CDS 6700..>7017 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QNT54364.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIDATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN APMT" ORIGIN 1 aagatggcgt ctaacgacgc ttccgctgcc gctgctgcca acagcaacaa cgacatcgca 61 aaatcttcaa gtgacggtgt gttttctaac atggctgtca cttttaaacg ggccctcggg 121 gcgcggccta aacaaccgcc cccgaaggaa ataccaacca gacccccacg accacccaca 181 ccggaattgg tcaaaaagat cccccctccc ccacccaacg gggaggaaga attagtggtt 241 tcttacagcg ccaaagacgg cgtttccgga ttgcctgagc ttaccactgt cagccaaccg 301 gaagaaacca atacggcgtt cagtgttccc ccgctcaatc aaagggagaa tagggacgcc 361 aaggaaccac taactgggac aattattgaa atgtgggatg gagaaatcta ccattacggc 421 ctgtacgtgg aacgaggtct tatacttggt gtgcacaagc caccagcagc catcagcctt 481 gccaaggtcg agttaacacc actctctttg ttctggagac ctgtgtacac cccccagtat 541 ctcatctctc cagacactct caggagacta catggagagt cattccccta caccgcattt 601 gacaacaatt gctacgcctt ctgctgttgg gtattagacc taaacgactc atggctaagt 661 aggagaatga ttcagagaac aacaggtttc ttcagaccat accaggagtg gaacaggaaa 721 cccctcccca ctatggatga ctccaaattg aagaaggtag ccaacatatt cttgtgcacc 781 ctgtcctcac tattcaccag acccattaag gacataatag ggaaattgaa acctcttaac 841 attctcaata ttctggctac atgtgattgg accttcccag gtatagtgga atccctaata 901 ctcttggcag agctctttgg agttttctgg acacccccag atgtgtctgc gatgatcgcc 961 cctttgctag gtgattatga actgcaagga cctgaggacc ttgcagtgga actggtccca 1021 gtagtgatgg gggggatagg tttggtgcta ggattcacca aagagaaaat tggaaaaatg 1081 ctgtcatccg ctgcatccac tttaagagct tgtaaagacc ttggtgcata cggactggaa 1141 attttaaaac tggtcatgaa gtggttcttc ccaaagaaag aggaagcaaa tgagctggct 1201 atggtgagat ccattgagga cgcagtgcta gacctcgaag caattgaaaa caaccacatg 1261 accaccctac tcaaggacaa agatagcttg gcaacctaca tgagaaccct tgaccttgag 1321 gaggagaaag ctagaaaact ctcaaccaaa tctgcttcac ccgatattgt gggcacaatc 1381 aacgctcttc tggcaagaat cgctgctgca cgctccctag tgcatcgggc gaaagaagag 1441 ctctccagca ggccgagacc tgttgttgtg atgatatcgg gaaaaccagg gatagggaaa 1501 acccaccttg ccagggagtt ggccaggaag atcgcagcct ccctcacagg ggaccagcgt 1561 gtgggcctga tcccacgcaa tggcgttgac cactgggacg catacaaggg tgaaagagtt 1621 gtcctatggg acgactatgg gatgagcaac cccatacacg atgccctcag gttgcaggaa 1681 cttgctgaca cttgccccct cacgctaaat tgtgacagga ttgagaacaa aggaaaagtc 1741 tttgatagtg atgccataat tatcaccacc aatctggcca acccagcacc actggattat 1801 gtcaattttg aagcgtgctc gaggcgcatt gatttcctcg tgtacgcgga ggctcctgag 1861 gtggagaagg caaaacgcga cttcccaggc caacctgaca tgtggaagaa cgctttcagt 1921 cctgacttct cacacataaa actggcattg gctccacagg gtggttttga caagaacggc 1981 aacaccccgc atggaaaagg tgttatgaag actctcacca ctggctccct cattgcccga 2041 gcatcaggat tactccatga gagactagat gaatatgaat tacaaggccc agtcctcact 2101 accttcaact ttgaccgcaa caagatactt gcatttagac agcttgctgc tgaaaacaag 2161 tatgggttga tggacacaat gagagttgga aaacagctta aggatgtcaa gaccatgcca 2221 gaccttaaac aagcactcaa aaatgtcgcg atcaagaagt gccagatagt gtataatggt 2281 agcacctaca cgcttgaggc cgatggcaag ggtagtgtga aggttgacaa agtgcagagt 2341 gccaccgtgc aaactaacaa tgaactagcc ggcgccctgc accacctgag gtgcgccaga 2401 atcaggtact atgtcaagtg tgtccaggag gcattgtatt ccatcatcca aatcgctggg 2461 gccgcgtttg tcaccacgcg catcgccaag cgcatgaaca tacaaaacct ctggtccaag 2521 ccacaggtgg aagacacaga agagacggcc agcaaagatg gttgcccaaa acccaaagat 2581 gatgaagagt tcgtcgtttc atccgacgac atcaagactg agggcaagaa agggaagaac 2641 aagtccggcc gtggcaagaa gcacacagcc ttctcaagca aagggctcag tgatgaggag 2701 tacgatgagt acaagagaat cagagatgaa aggaatggta agtactccat agaagagtac 2761 cttcaggaca gagacaagta ctatgaggag gtggccattg ccagggcaac cgaggaagac 2821 ttctgtgagg aggaggaggc caaaataagg caaagaattt tccgcccgac aaagaaacag 2881 aggaaggagg aaagggcctc tcttggcttg gtcactggct cagacatcag gaagagaaac 2941 ccagacgact tcaaaccaaa aggcaaactg tgggcagatg acactagaag tgtggattac 3001 aatgaaaggc ttgacttcga ggcgccccca agtgtttggt caagaatagt cccattgggc 3061 accggttggg ggttctgggt ttcatccaac cttctgatca caacaacaca cgttctgcct 3121 aaaggggtta aggaactctt tggagttgaa attaaacaaa tccaagtcca caagtctgga 3181 gagttctgca gattcagatt cccgagatcc attaggccag atgtcacagg acttgtgctg 3241 gaggaaggag ccccagaagg cactgtctgt tccatactca taaaaaggcc tacaggtgag 3301 atgatcccct tggcagtgag gatgggcaca catgcatcca tgaaaataca gggccggacc 3361 gttggtggcc agatgggaat gctcctaaca ggggcgaatg caaagaacat ggatctcggc 3421 actggtcctg gtgactgcgg ttgtccctac atctacaaac gcggcaacga cattgttgtc 3481 gcgggtgttc acaccgcagc agcccgggga ggcaacactg tcatatgtgc cacccaaggg 3541 caggatgggg aggcagtcct tgagggaggt gaggaccgtg gcacctactg tggcgcccca 3601 attctgggcc ctggcaaggc gcccaaactc agcgcgaaga ctaagttttg gcgctcgtca 3661 ccagatgcct tgccgcctgg cacgtatgaa cctgcttacc tgggaggcaa ggaccccaga 3721 gtggaaaaag ggccttcctt gcagcaagtc atgagggacc aattgaaacc attcacagaa 3781 cccagaggca aaccacctag gcctgcagtt ttagaagagg ccaaaaagac agtgatgaat 3841 gttctggaac aaaccatcga cccaccccaa ccatggtcct actcgcaagc atgtgcctca 3901 ctggacaaaa ccacctctag tggtcacccc catcacgtca agaaaaacga ccactggaat 3961 ggagagtcct tcaccggtcc cttggcagac caggcatcca aagccaacct catgtatgag 4021 caggctaaga acatgaagcc agtgtataca ggtgcgctta aagatgagct tgttaagact 4081 gataagatct acaaaaagat aaagaagagg ctcctatggg gttcggacct ggcgacaatg 4141 atcaggtgcg ccagggcttt tggtggtctc atggacagca tgaaggcaag ttgtatagct 4201 ctcccatgca gggtagggat gaacatgaat gaagatggcc ccatcatatt tgacaagcat 4261 tctaagtaca ggtaccacta tgatgctgac tattctaggt gggattcaac ccagcaaagg 4321 agcattctat ctgcagcact ggaaatcatg gttaagtttt ctccagaacc acacttggcc 4381 cagatagttg cagaagacct cctttcccct agtgtaatgg atgtgggtga ctttcaaata 4441 tcaataagtg aggggcttcc ctccggggtg ccttgcacct cccagtggaa ctccatcgcc 4501 cactggctcc tcaccctttg tgcactctct gaagtcacgg acctgtctcc tgacatcatc 4561 caggccaact cccttttctc cttctatggt gatgacgaga ttgtgagtac agacataaag 4621 ttggacccag agaagctgac agcaaaactc aaggagtacg ggctgaagcc aacccgcccc 4681 gacaaaactg agggacccct tgttatctct gaagacctgg atggcctgac attcctccgg 4741 aggactgtga cccgtgatcc agctggctgg tttggaaaat tggaacaaag ctcaattctc 4801 aggcaaatgt actggaccag aggtcccaac catgaagatc catctgagac aatgatacca 4861 cactcccaaa gacccataca attgatgtcc ttgctgggcg aggctgcact ccacggccca 4921 gcattttaca gcaaaattag caaattggtc attgcagaat tgaaggaagg tggcatggat 4981 ttttacgtgc ccaggcaaga gccaatgttc agatggatga gattctcaga tctgagcacg 5041 tgggagggcg atcgcaatct ggctcccagt tttgtgaatg aagatggcgt cgagtgacgc 5101 caacccatct gatgggtccg cagccaacct cgtcccagag gtcaacaatg aggttatggc 5161 tctggagccc gttgttggtg ccgctattgc ggcacctgta gcgggccagc aaaatgtaat 5221 tgacccctgg attagaaaca attttgtaca agcccctggt ggagagttta cagtatcccc 5281 tagaaacgct ccaggtgaaa tactatggag cgcgccctta ggccctgatc taaatcccta 5341 cctatcccat ttggccagaa tgtacaatgg ttatgcaggt ggttttgaag tgcaggtaat 5401 tctcgcgggg aacgcgttca ccgctgggaa ggtcatattt gccgcagtcc caccaaattt 5461 cccaactgaa ggcttgagcc ccagccaggt cactatgttc ccccatatag tagtagatgt 5521 taggcaacta gaacctgtgt tgattcccct acccgacgtc aggaataatt tctatcacta 5581 caatcagtca aatgacccca ctattaagtt gatagcaatg ctgtatacac cacttagggc 5641 taataatgct ggggatgatg tcttcacagt ctcttgccgg gtcctcacga gaccatcccc 5701 cgattttgat tttatatttc tagtgccacc cacagttgag tcaagaacta agccattctc 5761 tgtcccggtc ttaactgttg aggagatgac caattcaaga ttccccatcc ccttggaaaa 5821 gttgttcacg ggtcccagca gtgcctttgt tgttcaaccg caaaacggca ggtgtacgac 5881 tgatggcgtg ctcctaggta ccacccaact gtctcctgtc aacatctgca ccttcagagg 5941 ggatgtcacc cacatcacag gtagtcgtaa ctacacaatg aatttggctt ctctaaattg 6001 gaacagttat gacccaacag aagaaatccc agcccctctg ggaactccag attttgtagg 6061 gaaaatccaa ggcatgctca cccaaaccac aaggacagat ggctcaacac gcggccacaa 6121 agctacagtg tacactggga gcgccgattt ttctccaaaa ttgggtagag ttcaatttga 6181 aactgacaca gaccatgatt ttgaagctaa ccaaaacaca aagttcaccc cagtcggtgt 6241 catccaagat ggcagcacca cccaccgaaa tgaaccccca aagtgggtgc tcccaagtta 6301 ctcaggcaga aacaccccta acgtgcatct ggcccccgct gtagccccca cttttccagg 6361 tgagcaactt cttttcttca gatctaccat gcccggatgc agcggggtac ccaacatgga 6421 tttggactgt ctgctccccc aggaatgggt gcagtacttc taccaagagg cagccccagc 6481 acaatctgat gtggctctgc taagatttgt gaatccagat acaggtaggg ttttgtttga 6541 gtgtaagctt cacaaatcag gctatgttac agtggctcac actggccaac atgatttggt 6601 tatccccccc aatggttatt ttaggtttga ttcttgggtc aaccagttct acacactcgc 6661 ccccatggga aatggaacgg ggcgtagacg tgtagtataa tggctggagc tttctttgct 6721 ggattggcat ctgatgtcct tggctccgga cttggttccc tcatcaatgc tggggctggg 6781 gccatcaacc aaaaagttga gtttgaaaat aacagaaaat tacaacaagc atccttccag 6841 tttagcagta atttgcaaca ggcttccttt caacatgaca aagagatgct ccaagcacaa 6901 attgatgcca ccaaaaagct acaacaggaa atgatgaaag ttaagcaggc aatgctccta 6961 gagggtgggt tttctgagac agatgcagcc cgcggggcaa tcaacgcccc catgaca //