Typing tool
|
Complete norovirus genomes
OK562729 | GI.5 | ||
---|---|---|---|
GI.P4 |
ORF1: 1..5284 ORF2: 5268..6899 ORF3: 6899..7531LOCUS OK562729 7622 bp RNA linear VRL 25-OCT-2021 DEFINITION Norovirus GI isolate 2021GZ001 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION OK562729 VERSION OK562729.1 KEYWORDS . SOURCE Norovirus GI ORGANISM Norovirus GI Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7622) AUTHORS Xie,H., Su,W. and Liu,J. TITLE Direct Submission JOURNAL Submitted (19-OCT-2021) Derpartment of Virology and Immunology, Guangzhou Center for Disease Control and Prevention, Qide Road No. 1, Baiyun District, Guangzhou, Guangdong 510440, China COMMENT ##Assembly-Data-START## Sequencing Technology :: Sanger dideoxy sequencing ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7622 /organism="Norovirus GI" /mol_type="genomic RNA" /isolate="2021GZ001" /isolation_source="rectal swab" /host="Homo sapiens" /db_xref="taxon:122928" /country="China: Guangzhou" /collection_date="15-Mar-2021" /note="genotype: GI.5[P4]" gene <1..5284 /gene="ORF1" CDS <1..5284 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="UDE31774.1" /translation="IKNRFFTRLKNLGSNNKPIKIENTHMALNLISRGPSPIPQDKPP KDQRDKPPRNVAETQQAMGWVDPPMDQNLPTWEELSQTEKQEILENNSNWFDAGGLAP ASIPTGYVKNTEQQPPDHQVKWSASNGVDLGVGNLTTVSGPAWNLCPMPPIDQRNNGP AKEPLIGDMIEFYEGHIYHYAIYIGQGKTVGVHSPQAAFSIARITIHPIAAWWRVCYV PTPDQRLNYDQLKELENEPWPYAAVTNNCYEFCCRILNLQDSWLERRLVTSGRFNHPA QDWSRDTPDFQQDSKLEIVRDAVLAAINGLVSKPFKDLLNKLKPLNVLNLLSNCDWTF MGVVETIILLMELFGIFWNPPDVSSFIASLLPDMHLQGPEDLAKDLIPLVLGGVGLAI GFTKDKVTKVMKSAVEGLRAATQLGQYGLEIFAVLKKYFFGGDQTDKTLKDIETAVID MEVISTTAVTQLVRDKQAARTYMNILDAEEEKARKLSVRHADPHVVSTTNALISRISL ARSALAKAQAEMTNRVKPVVIMMCGPPGIGKTKAAEYLAKRLANEIRPGGKVGLVPRE AVDHWDGYHGEEVMLWDDYGMSKIADDCNKLQAIADTAPLTLNCDRIEKKGLQFVSDA IVITTNAPGPAPVDFVNLGPVCRRVDFLVYCTAPDVEQTRRTNPGDTGALKDCFKSDF SHLKMELAPQGGFDNQGNTPFGKGTMRPTTLNRLLIQAVALTMERQDEFQMQGQVYDF DADRISAFTSLARANGLGLISMASLGKKLRGVDSVHGLKNALSGYTITPCSIKWQARV YDIESDGSNVRIKENTSAQTQRQQSIDTAALALTRLKAARAAAYAACIQSAITTILQM AGSAIVINRAVKRMFGAHSSTIALEGPPREHRCRAHLAKAAGGGPIGHDDVIDKYGLC ETEEEGTSEEINVELPTATSEGKNKGKTKKGRGRRANYNAFSRRGLSDEEYEEYKKIR EEKNGNYSIQEYLEDRQRYEEELAEVQAGGDGGIGETEMEIRHKIFYKSKKGRQNERR QLGLVTGSDIRKRKPIDWTPPKNDWADDNREVDYNEKLSFEAPPTLWSRVVRFGSGWG FWVSPTVFITTTHVIPSGAREFFGEPIESIAVHRAGEFTQFRFSHKVRPDLTGMVLEE GCPEGVVCSILIKRDSGELLPLAVRMGAIASMKIQGRLVHGQSGMLLTGANAKGMDLG TLPGDCGAPYVYKRNNDWVVCGVHAAATKSGNTVVCAVQAGEGETTLEGGDKGHYAGH EIIKHSKGPALSTKTKFWRSTPEPLPPGVYEPAYLGGRDPRVKSGPSLQQVLRDQLKP FAEPRGRMPEPGLLEAAVETVTSMLEQTMDTPTPWSFSDACQSLDKTTSSGHPYHKRK NDDWNGTSFVGELGEQAAHANNMYELGKSMKPLYTAALKDELVKPEKIYQKIKKRLLW GADLSTVIRAARAFGPFCDAIKSHVIKLPIKVGMNSIEDGPMIYAEHAKYKNHFDADY SAWDSTQNRQIMTESFAIMCRLTASPELASVVAKDLLAPSEMDVGDYIIRVKEGLPSG FPCTSQVNSINHWLITLCAMSEVTGLSPDVIQSQSYFSFYGDDEIVSTDIDFDPARLT QVLKEYGLRPTRPDKSEGPIILRRQVDGLVFLRRTISKDAAGFQGRLDRGSIERQLWW TRGPNHDDPSETLIPHPQRKVQLISLLGEASLHGEKFYRKISSKVIQEIKTGGLEMYV PGWQAMFRWMRFHDLGLWTGDRNLLPEFVNDDGV" mat_peptide <1..1114 /gene="ORF1" /product="p48" mat_peptide 1115..2203 /gene="ORF1" /product="NTPase" mat_peptide 2204..2806 /gene="ORF1" /product="p22" mat_peptide 2807..3214 /gene="ORF1" /product="VPg" mat_peptide 3215..3757 /gene="ORF1" /product="Pro" mat_peptide 3758..5281 /gene="ORF1" /product="RdRp" gene 5268..6899 /gene="ORF2" CDS 5268..6899 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="UDE31775.1" /translation="MMMASKDAPPSADGANGAGQLVPEVNNAEPLPLDPVAGASTALA TAGQVNMIDPWIFNNFVQAPQGEFTISPNNTPGDILFDLQLGPHLNPFLAHLAQMYNG WVGNMRVRLILAGNAFTAGKVIICCVPPGFQSRTLSIAQATLFPHVIADVRTLEPIEI PLEDVRNTLYHNNDNQPTMRLLCMLYTPLRTGGGSGGTDAFVVAGRVLTCPSPDFNFL FLVPPTVEQKTRPFSLPNIPLHLQSNSRVPNPIQSMVISPDQAQNTQFQNGRCTTDGQ LLGTTPVSVSQILKFRGKVSPGSKVINLTELDGSPFLAFEAPAPTGFPDLGTCDWHIE LSLNSNSQSSGNPIVLRDIQPNSSDFVPHLGSVAVTVAIETAGDYTGTIQWISQPSNV SPVPNVNLWTIPSYGSSLAEASQLAPVVYPPGFGEAVVYFMSNIPGPNTEHKPNLVPC LLPQEFITHFVSEQAPPMGEAALIHYVDPDTNRNLGEFKLYPEGVITCVPNGTGPQQL PLNGVFVFASWVSRFYQLKPVGTASLARGRLGVRR" gene 6899..7531 /gene="ORF3" CDS 6899..7531 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="UDE31776.1" /translation="MAQAIIGAIAASAAGSALGAGIQAGAEAALQSQRFQQDLTLQQN SFKHDKEMMSYQVEMSNALLAKNLNTRYALLQSGGLSSADAARAVAGAPVTRIVDWNG TRVAAPNSSATTLRSGGFMSVPIPIQPKQKPPTISGFNNPAYGETMSRVSSWVQSQNS SVRSVPPFHSSALRTVWVTPPGSTSTSSSVSSVPMGVFNTDRLPLFANRR" ORIGIN 1 tataaagaac cgtttcttca caaggttgaa gaatttaggt tctaacaaca agcccataaa 61 aattgagaac acccacatgg cccttaactt aattagtagg ggcccatctc caattccaca 121 ggacaagccc ccaaaagacc agagggataa acccccaagg aatgtggctg agacccagca 181 ggcaatgggc tgggtcgacc cccccatgga tcagaatcta ccaacctggg aggagttgag 241 ccaaacagaa aaacaagaga tactcgaaaa caattcaaat tggtttgatg ctggtggttt 301 ggcaccagct agcatcccaa ctggttatgt aaagaacaca gaacaacaac cccctgatca 361 ccaagtcaaa tggagtgcca gtaatggggt tgaccttggg gtaggcaatt tgacaactgt 421 gtcaggtcca gcgtggaatc tgtgccccat gccaccaatt gatcaacgta acaatggccc 481 agcgaaggaa ccacttattg gtgacatgat agaattctat gaggggcaca tttaccatta 541 tgcaatatac attggacaag gaaaaactgt tggggttcac tcaccccaag cggccttttc 601 aatagccagg ataactatac accccatcgc cgcttggtgg agggtttgtt atgtgccaac 661 acctgatcag aggttgaatt atgaccaact caaagaactt gagaatgagc catggccata 721 tgctgcagtt actaataatt gttatgagtt ctgctgcagg attctcaatc ttcaggactc 781 ttggttggaa cgtcgcttgg tcacatctgg ccgtttcaac cacccagcac aagactggtc 841 cagggacacg cctgacttcc agcaagatag caagttagaa atagtcaggg atgccgtgct 901 tgctgctatc aatggcttgg tatcaaaacc ttttaaggac ctcctaaata aacttaagcc 961 actcaatgta ctcaacttac tctctaactg tgattggaca tttatggggg tggttgagac 1021 gataatatta cttatggagt tgtttgggat cttctggaat ccccctgatg tgtccagttt 1081 catagcctca ctattacctg atatgcactt gcagggacca gaagacctgg ctaaagactt 1141 gataccgttg gtattggggg gtgtggggct tgcaattggg ttcaccaaag acaaagttac 1201 aaaagtcatg aagagtgctg ttgaaggact cagggcagcc acacaattag gccagtatgg 1261 cctagagata tttgcagttt taaagaaata tttctttggt ggagatcaaa cagacaagac 1321 cttaaaggac attgagacag ccgtgattga catggaggtc atctcaacaa ctgctgtaac 1381 tcaactagtt agggacaaac aggctgcccg aacctacatg aatatcttag atgctgaaga 1441 agagaaggca agaaagcttt cagttagaca tgcagacccc catgttgtgt ccacgaccaa 1501 tgccctgatc tcacgtattt cattggctcg ttccgcactt gctaaggcac aagctgagat 1561 gactaatagg gttaaaccag tggtgataat gatgtgtggt cccccaggta taggcaaaac 1621 aaaggcagcc gagtaccttg ctaagagact ggccaatgag atacggcctg gaggaaaagt 1681 gggccttgtg ccaagggagg ctgtggacca ctgggatgga tatcatggtg aagaggttat 1741 gctatgggat gactatggta tgtcaaaaat agctgatgac tgcaacaaat tacaggctat 1801 agccgacaca gctccactca ctttgaattg tgaccggatt gagaagaaag ggttacaatt 1861 tgtctctgat gccatagtga taacaacaaa tgccccagga ccagccccag ttgactttgt 1921 caatcttggc ccagtgtgtc ggcgggtaga tttcctggtt tattgcactg cacccgatgt 1981 tgaacagaca cgtcgcacta accctggaga caccggagcc ctcaaggatt gctttaaaag 2041 tgacttctct catttaaaaa tggagttggc tccacagggt gggttcgaca accaaggaaa 2101 cactccattt ggcaaaggga ccatgcggcc taccacatta aaccgcctcc ttatacaggc 2161 cgtagcacta accatggagc gtcaggatga gttccagatg caaggtcagg tgtacgactt 2221 tgatgctgac aggatctctg ccttcaccag tttagcaaga gccaatggtt tgggtctcat 2281 tagcatggcc tccctaggaa agaaattgag gggtgtggat tcggttcatg gcctcaagaa 2341 tgcactatca ggctatacca taaccccctg tagcataaag tggcaagcta gagtgtatga 2401 cattgaatca gatggctcca atgtgcgcat aaaggaaaac acctcagccc agacacaacg 2461 ccagcaatca attgatacag cagccctggc acttaccaga cttaaagcag cacgggcagc 2521 agcatatgca gcttgcatac aaagcgcaat aacaactatc ctgcagatgg ctgggtcagc 2581 catcgttata aatcgtgctg ttaagaggat gttcggtgca cattcgagca ctatagccct 2641 cgaaggacct ccaagagaac atcggtgcag ggctcattta gccaaggctg caggtggtgg 2701 gccaataggc catgatgatg tcattgataa atatggccta tgtgagactg aagaagaggg 2761 taccagtgag gagataaatg tagagctacc caccgccact tccgaaggta agaacaaagg 2821 aaagaccaag aaaggaagag gcagaagggc caactacaat gctttttcac ggcgcggcct 2881 gagtgatgag gagtacgagg aatataagaa gatcagggaa gaaaagaatg gcaactacag 2941 cattcaggag taccttgaag acaggcaacg ctatgaagaa gagctcgcag aagtgcaagc 3001 tggtggtgat ggaggcattg gtgagacaga aatggagatc cgacataaaa tcttttacaa 3061 gtctaaaaag ggccgccaga atgaacgccg ccaactaggg ctagtcactg gtagtgatat 3121 tagaaagaga aagccaattg attggacccc ccccaaaaat gattgggcag atgataatag 3181 ggaagtggac tacaatgaaa aactcagttt tgaagcccca ccaacccttt ggagccgcgt 3241 tgtgagattt ggctctgggt ggggcttttg ggtgagcccg actgttttta tcaccaccac 3301 tcatgtcata ccatctgggg cgcgtgaatt ctttggggaa cccatcgagt ccattgcagt 3361 ccaccgtgct ggtgaattca cacaattcag attttcacac aaggtccgcc cggatctgac 3421 tggaatggtg ctggaggaag gttgtcctga gggtgtcgtg tgttctattc tcataaagcg 3481 tgactctggt gagctgctgc cccttgccgt ccgaatgggc gctatagcgt ccatgaagat 3541 tcagggaagg ctagtgcatg gccagtctgg gatgctgctc actggagcta acgctaaggg 3601 gatggatttg ggtaccctac caggtgattg tggtgcccct tatgtgtaca aaaggaataa 3661 cgactgggtg gtctgtggtg tacatgcagc cgctacaaag tcaggcaaca ctgtggtgtg 3721 tgctgtccaa gctggagagg gtgaaaccac cctagagggt ggtgataaag gccactatgc 3781 aggccatgaa ataatcaagc atagtaaggg cccagctcta tctactaaga ccaagttttg 3841 gaggtcaacc cctgaaccat taccaccagg tgtttatgaa ccagcatatc ttgggggtcg 3901 agatcccagg gtaaaaagtg ggccctctct gcagcaagtc ttgcgtgatc agttgaaacc 3961 atttgctgag ccgcgaggtc gaatgccaga gcctggtcta cttgaggcag ctgttgaaac 4021 cgtcacgtcc atgcttgagc aaacaatgga cacaccaacc ccctggtctt tctctgatgc 4081 atgtcaatca cttgacaaga ccacaagttc aggacacccg taccacaaaa gaaagaatga 4141 tgattggaat ggaacttcat tcgttggaga gcttggagaa caggcagcac atgctaacaa 4201 catgtatgaa ttgggcaaat ctatgaaacc actatacact gcagccctta aagacgagtt 4261 agtcaagccc gagaaaattt atcaaaagat caagaagagg cttctctggg gtgctgatct 4321 ttcaacagtc ataagggctg ccagggcctt tgggcctttt tgtgatgcta taaagtctca 4381 tgttattaaa ctccccatca aagttggtat gaactccata gaagatggcc caatgattta 4441 tgcagagcat gcaaaatata agaaccattt tgatgcagat tattcggcct gggattccac 4501 gcaaaataga cagatcatga ctgaatcctt tgcaatcatg tgccgtctca cagcatcacc 4561 agaattggct tctgttgtag ctaaagattt actagcccct tcagaaatgg atgttggtga 4621 ttacatcatc cgtgtaaaag agggcttgcc atcaggattc ccttgcacat cacaggtgaa 4681 cagcataaac cactggctga tcacactgtg tgccatgtct gaagtcacgg gtttgtcacc 4741 tgatgtcata caatcccagt cttacttttc cttctatggt gatgatgaga tagtctctac 4801 agatattgat tttgaccccg ctcgcctcac tcaagtgctc aaggaatatg ggttgagacc 4861 caccagacca gacaagtcag aagggcccat catactcaga agacaggtgg atggtcttgt 4921 gttcttgagg aggacaatct ctaaagatgc tgcgggcttc cagggaaggc tagatagagg 4981 ctcaattgaa agacagctct ggtggactcg aggccctaac catgatgacc ccagtgagac 5041 tctaatccca cacccccaga ggaaagttca gcttatatcc ctccttggtg aggcttccct 5101 ccatggtgag aagttttaca ggaagatctc cagtaaagtt atacaggaga taaaaactgg 5161 tggtttggaa atgtatgtgc caggctggca ggccatgttc cgctggatgc gcttccatga 5221 cctcggattg tggacgggag atcgcaatct cctgcccgaa ttcgtaaatg atgatggcgt 5281 ctaaggacgc cccaccaagc gcagatggcg cgaatggcgc cggtcagctt gtgccggagg 5341 ttaataatgc tgaaccactg ccacttgatc cagtggcggg ggcttccacc gcccttgcca 5401 ctgctggaca agttaatatg attgacccat ggatctttaa taattttgtc caggcccccc 5461 agggtgaatt tactatttct ccaaacaata cccccggcga tatcctgttt gatttacaat 5521 taggtccaca tctaaatcct tttctagcac atttggctca gatgtacaat ggctgggttg 5581 gcaacatgcg ggtcaggctc atcttggccg gtaatgcttt cactgctggt aaagttataa 5641 tttgttgtgt tccccctggt tttcaatcta gaaccctttc catagcccag gctacactct 5701 ttccccatgt gatcgctgac gtgagaactc tggaaccaat tgagatccct ttagaagatg 5761 ttaggaatac tctttatcat aataatgaca atcaaccaac aatgcgctta ttgtgcatgc 5821 tctacacgcc attacgcacc ggtggcggtt ctggtggcac agatgcgttt gttgtagcag 5881 gtcgagtttt aacttgccct agccctgatt tcaatttctt gttcttagtc ccacccactg 5941 tggagcagaa gacacgtccc tttagtctgc ccaatatacc tttacacttg cagtccaact 6001 cccgtgtccc caaccccata cagagtatgg tcatttcgcc tgatcaggcc cagaatactc 6061 aatttcagaa tggccgctgc accaccgatg ggcagctctt gggtacaaca cctgtctcag 6121 ttagtcagat cttaaagttt agaggaaagg tgtcacctgg ttctaaggtt atcaatctca 6181 ctgagttaga tggttccccc tttctggctt ttgaggcccc cgcgcccacg ggctttcctg 6241 acttgggaac ttgtgattgg catattgaat taagccttaa ctcaaatagt caaagttctg 6301 gcaacccaat tgtactaagg gacatacaac caaattcttc agattttgtc ccacacttgg 6361 gcagtgtggc tgtcaccgtt gcaattgaaa cagctgggga ttatactggt acaatacaat 6421 ggatctctca gccatccaat gtctccccag ttcccaatgt caacctatgg actatcccca 6481 gctatggatc aagtttggca gaagcctccc aactggcccc cgtagtgtat cccccgggat 6541 ttggtgaagc tgtagtgtac tttatgtcta acataccagg cccaaacact gaacacaaac 6601 ccaatttggt accctgccta cttccacaag aatttataac tcattttgtc agtgaacaag 6661 ccccacccat gggtgaggca gctcttatcc attatgtgga cccggatacc aaccgtaacc 6721 taggtgagtt caaactgtat cctgaaggtg ttatcacctg tgtgccaaat ggcactggtc 6781 cacaacaact ccccctcaat ggggtctttg tctttgcttc ctgggtttct agattttacc 6841 aattaaagcc tgtgggaacg gccagtttgg ccagaggtag gcttggagtc agaagataat 6901 ggctcaggct atcattggag ccatcgctgc atcagcggct ggaagtgccc ttggagctgg 6961 tatccaagca ggtgctgagg ccgccctcca gagccagagg ttccagcagg accttaccct 7021 tcaacaaaat tcttttaaac atgataagga aatgatgtca tatcaagttg aaatgtctaa 7081 tgctttgtta gctaaaaatc ttaatactcg ctatgctttg ctccaatctg gaggtctatc 7141 tagtgctgat gctgctcgtg ctgtggcagg tgcgcctgtt acaagaattg ttgattggaa 7201 tggcacgaga gttgcagccc caaactcttc tgcaacaact ctcagatctg gtggatttat 7261 gtctgttcct atacctattc aacccaaaca aaagccccct accatctcag ggtttaataa 7321 tccagcatat ggagaaacta tgtcaagagt ctcctcctgg gtccaatccc agaattcaag 7381 tgttagaagt gtgcctcctt ttcatagttc tgccctcagg actgtttggg ttacaccacc 7441 aggttcaact tctacttctt catctgtttc atctgtgcct atgggtgtgt ttaacaccga 7501 taggctcccc ctattcgcaa ataggagata atattgttgt aataagaata acaatgtggg 7561 catcttattc aatttggtct aattaattat ataattaggt ttgatttgga caattgatgt 7621 tc //