Typing tool
|
Complete norovirus genomes
OK376252 | GII.2 | ||
---|---|---|---|
GII.P16 |
ORF1: 4..5103 ORF2: 5084..6694 ORF3: 6694..7473LOCUS OK376252 7517 bp RNA linear VRL 11-OCT-2021 DEFINITION Norovirus GII isolate CHN-GII4 nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION OK376252 VERSION OK376252.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7517) AUTHORS Li,W., Tong,Y. and Mao,P. TITLE Direct Submission JOURNAL Submitted (05-OCT-2021) Fifth Medical Center, Chinese PLA General Hospital, West Fourth Ring Middle Road, Beijing 100039, China COMMENT ##Assembly-Data-START## Assembly Method :: Newbler v. 2.3; CLC Genomic Workbench v. 9.0 Sequencing Technology :: Sanger dideoxy sequencing; Illumina; IonTorrent ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7517 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="CHN-GII4" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="China" /collection_date="Oct-2018" /note="genotype: GII.2" gene 4..5103 /gene="ORF1" CDS 4..5103 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="UBX26718.1" /translation="MKMASNDATVAVACNNNNDKEKSSGEGLFTNMSSTLKKALGARP KQPAPRDKPQKPPRPPTPELVKRIPPPPPNGEEEKEPVIRYEVKSGISGLPELTTVPQ PDVANTAFSVPPLSLRENREAKEPLTGAILEMWDGEIYHYGLYVEKGLVLGVHKPPAA ISMARVELTPLSLYWRVVYTPQYLISPETLRRLNGEAFPYTAFDNNCYAFCCWVLDLN DSWLSRRMIQRTTGFFRPYQEWNRKPLPTMDDSKIKKVANIFLCSLSTLFTRPIKDLI GKIKPLNILNILATCDWTFAGIVESLILFAELFGVFWTPPDVSAMIAPLLGDYELQGP EDLAVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWF FPKKEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMKTLDMEEEKARRL STKSASPDIVGTINALLARIAAARSLVHRAKEELSSRIRPVVVMISGKPGIGKTHLAR EVARKVASTLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELAD TCPLTLNCDRIENKGKVFDSEVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEV EKAKRDFPGQPDMWKDAFKPDFSHIKLQLAPQGGFDKNGNTPHGKGVMKTLTTGSLIA RASGLLHERMDEFELQGPTVTTFNFDRNRITAFRQLAAENKYGLVDTMKVGNQLKGVK TMEELKQAIRNVTIKRCQIIYGGSTYDLESDGKGKVLVEKVKNTSVQTNNELAGALHH LKHARIRYYVKCVQEAVYSIIQIAGAAFVTTRIARRMNIQELWSKPQLDQNESETKEE APKSEDDEFIISSKDIKEEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG KYSIEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLGLV TGSEIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFEAPPSIWSRIVSFGSGWGFWVSP SLFITSTHVIPAGITEAFGVSIKQIQIHKSGEFCRFRFPKPIRPDVTGMILEEGAPEG TVATVLIKRPTGELMPLAARMGTHATMKIQGRMVGGQMGMLLTGSNAKGMDLGTTPGD CGCPYIYKRGNDYIVIGVHTAAARGGNTVICATQGSEGEATLEGGDDKGTYCGAPILG PGGAPKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEP RGKPPRPSVLEAAKQTIINVLEQTLDPPQKWTYAQACASLDKTTSSGHPHHIRKNEFW NGETFTGKLADQASKANLMFEEGKHMTPVYTAALKDELVKTEKIYGRIKKRLLWGSDL STMIRCARSFGGLMDEMKAHCISLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD STQQRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPSVVDVGDFKITINEGLPSGVPCT SQWNSIAHWLLTLCALSEVTKLSPDIIQANSMFSFYGDDEIVSTDIKLDPEQLTAKLK EYGLKPTRPDKTEGPLIISEDLNGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGP NHEDPKETMIPHSQRPIQLMALLGEASLHGPSFYSKISKLVITELKEGGMDFYVPRQE PMFRWMRFSDLSTWEGDRNLAPNFVNEDGVE" mat_peptide 4..999 /gene="ORF1" /product="p48" mat_peptide 1000..2097 /gene="ORF1" /product="NTPase" mat_peptide 2098..2628 /gene="ORF1" /product="p22" mat_peptide 2629..3027 /gene="ORF1" /product="VPg" mat_peptide 3028..3570 /gene="ORF1" /product="Pro" mat_peptide 3571..5100 /gene="ORF1" /product="RdRp" gene 5084..6694 /gene="ORF2" CDS 5084..6694 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="UBX26719.1" /translation="MKMASNDAAPSTDGAAGLVPESNNEVMALEPVAGAALAAPVTGQ TNIIDPWIRANFVQAPNGEFTVSPRNAPGEVLLNLELGPELNPYLAHLARMYNGYAGG MEVQVMLAGNAFTAGKLVFAAVPPHFPVENLSPQQITMFPHVIIDVRTLEPVLLPLPD VRNNFFHYNQKDDPKMRIVAMLYTPLRSNGSGDDVFTVSCRVLTRPSPDFDFTYLVPP TVESKTKPFTLPILTLGELSNSRFPVSIDQMYTSPNEIISVQCQNGRCTLDGELQGTT QLQVSGICAFKGEVTAHLHDNDHLYNVTITNLNGSPFDPSEDIPAPLGVPDFQGRVFG IISQRDKHNSPGHNEPANRGHDAVVPTYTAQYTPKLGQIQIGTWQTDDLTVNQPVKFT PVGLNDTEHFNQWVVPRYAGALAPSVAPVFPGERLLFFRSYIPLKGGYGNPAIDCLLP QEWVQHFYQEAAPSMSEVALVRYINPDTGRALFEAKLHRAGFMTVSSNTSAPVVVPAN GYFRFDSWVNQFYSLAPMGTGNGRRRVQ" gene 6694..7473 /gene="ORF3" CDS 6694..7473 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="UBX26720.1" /translation="MAGAFVAGLAGDVLSNGLSSLINAGANAINQRAEFDFNQKLQQN SFNHDKEMLQAQIQATKQLQADMMAIKQGVLTAGGFSPADAARGAVNAPMTQALDWNG TRYWAPNSMRTTSYSGKFTSTAPVRQAGFQHTQSRPSSGSSVSSFATQSSRPTLTTTT GSSHGTTSSNSTRSTSLSQSTVSRATSRTSEWVRDQNRNLEPYMHGALQTAFVTPPSS RASDGTVSTVPKGVLDSWTPAFNTRRQPLFAHLRKRGESQA" ORIGIN 1 ttaatgaaga tggcgtctaa cgacgctacc gttgccgttg cttgcaacaa caacaacgac 61 aaagaaaaat cttcaggtga aggcttattc acaaatatgt cttccacctt aaagaaagcc 121 ctcggggcta ggcccaaaca acctgccccg agagacaaac cacaaaagcc cccaagacca 181 ccaactcccg agttggtcaa gaggataccc cctcctccac ctaacggcga agaagaaaaa 241 gaaccagtca ttaggtatga ggttaagagt gggatctctg gtctgcccga gctcacaaca 301 gtcccccaac cggacgtggc caacacagca ttcagtgttc caccactgag cttgagagaa 361 aacagggagg ccaaggagcc gctaacaggg gcaatattag agatgtggga tggagagata 421 taccactatg gcctgtacgt ggagaaaggc ttagtgttgg gtgtgcacaa accacctgca 481 gccataagca tggcaagagt ggagctgacg ccgttgtcat tgtactggcg tgtggtgtac 541 actccccaat acctcatctc ccctgaaacc ctcaggaggc tcaacggaga ggcattccct 601 tacaccgcct ttgacaacaa ctgctatgcc ttttgctgct gggtgttaga cctcaatgac 661 tcatggctta gcaggaggat gatacaaaga acaacgggct tcttcagacc ttaccaagag 721 tggaacagaa aacccctgcc taccatggat gactccaaaa ttaagaaggt agcaaatata 781 ttcctatgtt cattgtccac attattcacc agacccataa aagacctcat aggaaaaatt 841 aaaccgttga acatattgaa tatcctggca acgtgtgact ggacttttgc cggaatagtg 901 gagtctctga tattatttgc tgaactcttc ggagttttct ggacaccccc agatgtgtct 961 gctatgatcg ctcccttact cggggactac gagttgcaag ggccagaaga cctcgccgtt 1021 gagctcgtac ctgtggtaat gggggggatt ggcttggtgt tgggattcac caaagagaaa 1081 attggcaaaa tgttgtcctc agcagcatca acactcaggg cttgcaaaga tcttggtgcc 1141 tatggcttag agatactcaa attggtcatg aagtggttct tcccaaagaa agaggaggcc 1201 aatgagctag ctatggtgag ggccatagag gatgctgtat tagaccttga ggcaatagaa 1261 aataaccaca tgacaaccct gttgaaagac aaagacagct tagcaacata catgaaaaca 1321 ctggacatgg aggaggagaa agccaggagg ttgtccacaa aatctgcatc ccctgacata 1381 gttgggacaa tcaacgccct gctggctcga atagcagcgg ccaggtcatt agtccacagg 1441 gccaaggaag agctatctag taggataagg ccagtagttg ttatgatatc tggcaagcca 1501 ggaataggca aaactcatct ggccagggag gtggcaagaa aggtggcatc cactctcaca 1561 ggggatcaaa gagtcggact cataccacga aatggtgtgg accattggga tgcatacaaa 1621 ggtgagagag tcgtgctgtg ggacgactat ggcatgagta accctattca tgatgctctt 1681 cgcatacaag aattggctga tacgtgtccc cttaccttaa attgtgacag aattgaaaat 1741 aagggaaaag tttttgacag tgaagtcata ataattacaa caaaccttgc caatccagcc 1801 ccacttgatt atgtcaactt tgaggcctgt tccaggagaa ttgatttcct ggtgtacgct 1861 gaggcaccag aagtagaaaa ggcaaaacgg gactttcctg gtcagccaga tatgtggaag 1921 gacgccttca agccggactt ttcacacatc aagctacagc ttgcacccca gggcggcttt 1981 gacaagaatg gcaacacccc acatgggaaa ggggtgatga agaccctcac taccggttct 2041 ctgattgccc gtgcatcagg cctactgcat gagaggatgg atgaatttga actccaaggc 2101 cccacagtca ccaccttcaa tttcgaccga aacagaatca cggcattcag acaattggct 2161 gcagaaaaca agtatggatt ggtggatacc atgaaagttg gcaatcaatt aaaaggagtg 2221 aaaaccatgg aagaactcaa acaagcaatc aggaatgtga ccatcaagag gtgccagatc 2281 atctacggag gttccacgta tgaccttgaa tctgatggca agggcaaagt tttggtggaa 2341 aaggtcaaga acacctctgt acagaccaat aacgagttgg ctggggccct gcaccatctc 2401 aaacatgccc gaatcaggta ctatgtcaaa tgtgtgcaag aagcagtcta ttccatcata 2461 caaattgccg gcgctgcgtt tgtcaccacg cgcatcgcac gccgcatgaa catacaagaa 2521 ctctggtcaa agccacaact agatcaaaat gaatcagaga ctaaggaaga ggcccccaag 2581 tcagaagatg acgaattcat catatcttct aaggacatca aggaggaagg aaagaagggc 2641 aaaaacaaaa ctggtcgtgg caagaaacac actgcattct ccagcaaggg cttgagcgat 2701 gaggagtatg acgagtacaa gaggataaga gaagagagaa atgggaagta ctctatagag 2761 gagtatcttc aagacagaga caggtactat gaggagctcg ccattgccaa ggccacggag 2821 gaagacttct gtgaagagga ggagatcaaa atccgtcaga gaattttccg tcccaccaga 2881 aaacaaagaa aggaagagag ggccacatta gggctagtaa caggttcaga aatcagaaaa 2941 agaaaccctg atgacttcaa acccaaaggg aagctgtggg ccgatgacaa caggagtgtt 3001 gactacaatg agaaactgga ctttgaggcc cccccaagca tatggtctag gattgtgagc 3061 tttggctctg gctggggctt ctgggtgtca ccaagccttt tcataacatc aactcatgta 3121 atccccgcgg gcataacaga agcatttgga gtctccatca aacaaatcca gatccacaaa 3181 tcaggcgaat tttgccgatt cagattccca aaaccaatta gaccagatgt gacaggaatg 3241 atcttggaag aaggtgcgcc tgagggcacc gtggcaactg tgctcatcaa acgtcccacc 3301 ggagagctca tgcctcttgc agccagaatg ggaacacacg caaccatgaa aattcaaggc 3361 cgcatggttg gcggacagat gggtatgttg ctcactggat caaatgctaa aggaatggac 3421 ttgggaacaa ctcctggtga ctgtggctgt ccttacatct ataaaagggg caatgactat 3481 atagtcattg gggtgcacac tgcagcagcc cgtggtggaa acactgtcat ctgtgccaca 3541 cagggaagcg agggtgaggc aactcttgag ggtggagatg acaaagggac atactgtggg 3601 gcacccattc taggccctgg gggtgcacca aaattgagca ccaaaaccaa attttggagg 3661 tcatcgaaca cgccccttcc accagggaca tatgagcctg cctacctcgg tggccgtgat 3721 ccgcgtgtta agggtgggcc ttccctgcag caggtgatga gagaccagtt gaagccattc 3781 actgaaccca ggggcaaacc tccaagacca agtgtattgg aagcagccaa acaaaccatc 3841 atcaatgtcc tcgaacaaac cctggaccct ccacaaaaat ggacatacgc acaggcgtgc 3901 gcctcacttg acaaaaccac ttccagcggg cacccccatc acatccgaaa gaatgaattc 3961 tggaatggtg agaccttcac tggtaaactg gcagaccaag catcaaaagc aaacctaatg 4021 tttgaggaag ggaaacacat gacaccagtg tacacagcag cactcaagga cgagctagtc 4081 aagactgaga aaatttatgg aaggatcaag aagagactgc tctggggctc tgacttgtcc 4141 accatgatcc ggtgcgctag gtcatttggt gggcttatgg acgagatgaa ggcacactgc 4201 atatcactcc cagttcgagt tggcatgaat atgaatgaag atggcccaat aatatttgag 4261 aaacattcca gatacaaata tcactatgat gcagactact ctcgttggga ttcaacacaa 4321 cagagggcag ttctagcagc agccttggaa atcatggtta gattctctgc agaaccacaa 4381 ttggcacaaa tagtcgctga ggatctgctg gcccctagtg tagtagatgt gggagacttt 4441 aaaatcacaa taaatgaagg gctcccttct ggtgtgccat gcacttctca atggaactcc 4501 atcgcacact ggctgctaac tctctgtgcc ttgtctgaag tcaccaaact atcccctgac 4561 attatacagg caaattccat gttctcattt tacggtgatg acgagattgt cagtaccgac 4621 ataaaattgg accctgaaca gttaaccgcc aaattgaagg agtacggcct gaaaccaacc 4681 cgcccagaca aaaccgaggg acccctgatt atcagtgaag atttgaacgg tctcactttc 4741 ctccgaagga cggtgactcg tgacccagct ggctggttcg gaaaactgga ccaaagctca 4801 attttgaggc agatgtattg gactagagga ccaaatcatg aagaccccaa agagacaatg 4861 ataccccatt cccaaagacc catacagctc atggcactgc ttggtgaagc ctctcttcac 4921 ggaccctctt tctacagtaa aatcagtaaa ttggtcataa ctgaactcaa agaaggtggg 4981 atggactttt acgtgccaag gcaggaaccc atgttcaggt ggatgaggtt ttctgacttg 5041 agcacgtggg agggcgatcg caatctggct cccaattttg tgaatgaaga tggcgtcgaa 5101 tgacgccgct ccatctactg atggtgcagc cggcctcgtg ccagaaagta acaatgaggt 5161 catggctctt gaacccgtgg ctggtgccgc cttggcagcc ccagtcaccg gtcaaacaaa 5221 tattatagac ccttggatta gagcaaattt tgtccaggcc cccaatggtg aatttacagt 5281 ctctccccga aatgcccctg gtgaagtgct actgaatcta gagttgggtc cagaattaaa 5341 tccttatctg gcacatttag caagaatgta caatgggtat gccggtggga tggaggtgca 5401 ggtcatgttg gctgggaacg cgttcacagc cggcaagttg gtcttcgccg ccgtgccacc 5461 ccacttcccg gttgaaaacc ttagcccaca gcaaatcacc atgttccctc atgtgatcat 5521 agatgtgagg accttggaac ctgttttgtt accactccct gatgttagga ataacttctt 5581 ccattataac cagaaagatg atcccaagat gagaattgtg gctatgcttt atacccccct 5641 caggtctaat ggttcaggtg atgatgtgtt tacagtctcc tgtagagtgt tgactagacc 5701 ttcccctgac tttgacttca catacctggt gccaccaaca gtggagtcta aaacaaagcc 5761 attcaccctc ccaatcctta cacttgggga actttccaat tccaggttcc cagtgtccat 5821 agaccagatg tacaccagtc ctaatgaaat tatatcagtg cagtgtcaaa atggtaggtg 5881 cacactggac ggggagctcc aagggacaac acaactccaa gtcagtggca tttgtgcttt 5941 caaaggtgaa gtgaccgccc acttgcatga caatgatcac ctatataatg tcaccatcac 6001 aaacttgaat gggtcccctt ttgatccctc cgaggatatc cctgcccctt tgggtgtgcc 6061 tgacttccag ggaagggtct ttggtatcat ctcccaaaga gataaacaca atagtcctgg 6121 gcataatgaa ccagcaaaca ggggacacga cgctgtggtc cctacttaca cagcacagta 6181 cactccaaaa cttggacaaa ttcaaattgg cacatggcaa actgacgacc ttacagtcaa 6241 ccaaccagtc aaattcaccc cagttggact caatgacact gaacacttta accaatgggt 6301 agtccctagg tatgctggtg cccttgcccc ttctgttgct ccagtatttc caggagagcg 6361 cctgctcttc ttcagatcat acattcccct caagggcggt tatggaaacc cagccattga 6421 ttgcctacta ccacaagagt gggtgcagca cttctatcag gaagcagccc cttcaatgag 6481 tgaggtggcc ctcgtcagat acatcaaccc ggacactggt cgggcactgt ttgaggccaa 6541 gctccataga gctggtttca tgacagtctc gagcaacacc agtgccccgg tggttgtgcc 6601 tgccaacggg tacttcagat ttgattcttg ggtgaaccaa ttttattctc ttgcccccat 6661 gggaactggg aatgggcgta gaagggttca ataatggctg gagcttttgt agctggtctt 6721 gcaggggacg tgctcagcaa tgggcttagt tcattaatca atgcaggtgc taatgcaata 6781 aatcagaggg cagaatttga ttttaaccaa aaattgcagc aaaattcttt taatcatgat 6841 aaggagatgt tgcaggctca gattcaggca actaagcagc tgcaggcaga catgatggct 6901 ataaaacaag gggtcttgac cgctggcggc ttttcccctg ctgatgcagc cagaggtgct 6961 gtgaacgcgc ccatgacaca ggcgctggat tggaatggca caaggtattg ggcaccaaac 7021 tccatgagga ccacatctta ttctggaaaa ttcacatcaa ccgccccagt gaggcaggct 7081 ggcttccagc acacccaaag ccggccttcg agtggctcct ctgtgtcttc ctttgccact 7141 cagtcttcaa ggccaactct gaccacaacc actggttcct cacatggcac aacctcatca 7201 aattcaactc gcagcacaag cctctcccaa tcaacggtct ccagagctac atctaggact 7261 agcgagtggg ttagggatca gaacagaaat ttggaaccct acatgcatgg tgccttacag 7321 acagcctttg tcaccccacc ttccagcagg gcatctgacg ggacagtctc aaccgtccct 7381 aaaggtgttt tggactcctg gacacctgcg tttaacactc gcaggcagcc gctttttgca 7441 cacctccgta agagggggga gtcacaagct tagtgaaaag gtgaaaattt gtttaggatt 7501 aattgatttc acctttt //