Typing tool
|
Complete norovirus genomes
OK376714 | GII.4 Den Haag | ||
---|---|---|---|
GII.P4 Den Haag |
ORF1: 1..5078 ORF2: 5059..6681 ORF3: 6681..7487LOCUS OK376714 7534 bp RNA linear VRL 01-JUL-2022 DEFINITION Norovirus GII isolate 0407 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION OK376714 VERSION OK376714.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7534) AUTHORS Roy,S., Tutill,H.J., Williams,R.J., Sheth,S., Celma,C., Allen,D. and Breuer,J. TITLE Direct Submission JOURNAL Submitted (05-OCT-2021) Infection Immunity & Inflammation, UCL, ICH, Guilford St, London WC1N 1EH, United Kingdom COMMENT ##Assembly-Data-START## Assembly Method :: CLC Genomics Workbench v. 11.01 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7534 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="0407" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="United Kingdom" /collection_date="21-Feb-2017" /note="genotype: GII.P4_Den_Haag_2006b_GII.4_Den_Haag_2006b" gene <1..5078 /gene="ORF1" CDS <1..5078 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="UBX38814.1" /translation="SAAAVANSNDDTTKSSSDKMFSNMAVTLKRALGARPKQPPPREI PQRPPRPPTPELVKKVPPPPPNGEDEVVVSYSVKDGVSGLPELSTVRQPEETNTAFSV PPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGIVLGVHKPPAAISLAKVELTP LSLYWRPVYTPQYLIAPDTLKKLHGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQ RTTGFFRPYQDWNRKPLPTMDDSKIKKVANIFLCALSSLFTRPIKDIIGKLRPLNIIN ILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDFELQGPEDLVVELVPV VMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANEL AMVRSIEDAVLDLEAIENNHMTTLLKDKDTLATYMRTLDLEEEKARKLSTKSASPDIV GTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGKPGIGKTHLAREVAKRIAASL TGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDR IENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQ PDMWKSAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARATGLLHERL DEFELQGPATTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGRQLKDVKTMPELKQALK NISIKKCQIVYGGCTYALESDGKGNVKVDRVQNASVQTNNELSGALHHLKCARIRYYV KCVQEALYSIIQIAGAAFVTTRIIKRVNIQDLWSRPQVEDTEETASKDGCPKPKDDEE FVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYL QDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKR NPEDFKPKGKLWADDDRSVDYNERLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTH VIPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTILIK RPTGELMPLAARMGTHATMRIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYK RGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLS TKTKFWRSSTAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPNPG VLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGK LADQASKANLMYEQGKNMTPVYTGALKDELVKTDKIYDKIKKRLLWGSDLATMIRCAR AFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSKYRYHYDADYSRWDSTQQRAVL ATALEIMVKFSSEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCTSQWNSIAH WLLTLCALSEVTNLSPDVIQANSLFSFYGDDEIVSTDIKLDPEKLTTKLKEYGLKPTR PDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHEDPSES MIPHSQRPIQLMSLLGEAALHGPTFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRF SDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..968 /gene="ORF1" /product="p48" mat_peptide 969..2066 /gene="ORF1" /product="NTPase" mat_peptide 2067..2603 /gene="ORF1" /product="p22" mat_peptide 2604..3002 /gene="ORF1" /product="VPg" mat_peptide 3003..3545 /gene="ORF1" /product="Pro" mat_peptide 3546..5075 /gene="ORF1" /product="RdRp" gene 5059..6681 /gene="ORF2" CDS 5059..6681 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="UBX38815.1" /translation="MKMASNDANPSDGSAANLVPGVNNEVMALEPVAGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLNPSQVTMFPHVITDVRQLEPVLIPLPD VRNNFYHYNQSNDSTLKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESKTKPFTIPILTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVAHIPGSRNYTMNLASLNWSNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRSDGSTRGHKATVYTGSADFTPKLGSVQLATDTENDFEARQNAKFTPIGVIQDG NTTHRNEPQQWVLPSYSGRNAHNVHLAPAVAPSFPGEQLLFFRSTLPGCSGYPNLDLD CLLPQEWVQHFYQQAAPAQSDVALLRFVNPDTSRVLFECKLHKAGYLTVSHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6681..7487 /gene="ORF3" CDS 6681..7487 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="UBX38816.1" /translation="MAGAVFAGMASDVLGSGLSSLINAGAGAINQKIDFENNKQLQQA SFQFSNNLQQTSFQHDKEMLQAQIEATKKLQQEMMKVKQAVLLEGGFSETDAARGAIN APMTKVLDWNGTRYWAPDARVTTYNSGRFSTPQPSGALPGRINPRTPTPARGSPSTSS NASTVTSVYSNQTASTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI SYVTPPSSRSSSQGTVSTVPKGVLDSWTGAFNTRRQPLFAHIRTRGESRV" ORIGIN 1 cttctgctgc cgctgttgct aacagcaacg acgacaccac aaaatcttca agtgacaaaa 61 tgttttctaa catggctgtc acccttaaac gagccctcgg ggcgcggcct aaacagcccc 121 ccccgaggga aataccacaa agacccccac ggccgcccac tccagaacta gtcaaaaagg 181 tccctcctcc cccgcccaac ggagaggatg aagtagtggt ctcttatagt gtcaaagatg 241 gcgtctccgg tttgcctgag ctatccaccg tcaggcaacc ggaagaaacc aacacggcct 301 tcagtgtccc tccactcaac cagagggaga acagggatgc taaggaacca ctgactggga 361 cgattcttga aatgtgggat ggagaaatct accattacgg cctgtacgtt gagcgaggta 421 ttgtactggg tgtgcacaaa ccaccagctg ccatcagcct cgcaaaggtc gagttaacac 481 cactctcctt gtactggaga cctgtgtaca cccctcagta cctcattgct ccagacactc 541 tcaaaaagtt acacggagag acattcccct acacagcctt cgacaacaac tgctacgcct 601 tttgttgctg ggtcctggac ctaaacgact cgtggctgag taggagaatg atccagagaa 661 caactggctt ctttagaccc taccaagatt ggaataggaa acccctcccc actatggatg 721 attccaagat aaagaaggta gctaacatat tcctgtgtgc cctgtcttcg ctgttcacca 781 ggcccataaa agacataata ggaaagttaa gacctctcaa catcatcaac atcctggctt 841 catgtgattg gacttttgca ggcatagtgg agtccttgat actcttggca gaactctttg 901 gagtcttctg gacaccccca gatgtgtctg cgatgatcgc ccccttactc ggtgacttcg 961 agctacaagg acctgaagac cttgtagtgg agctcgtccc tgtggtaatg ggggggattg 1021 gtttggtgct aggattcacc aaagagaaga ttggaaaaat gttgtcatct gctgcatcca 1081 ccctgagagc ttgcaaagac cttggtgcat atgggctgga gatcctaaag ctagtcatga 1141 agtggttctt cccgaagaaa gaggaagcga acgaactggc tatggtgaga tccatcgagg 1201 acgcagtgct ggacctcgag gcaattgaaa acaaccatat gaccaccttg ctcaaagata 1261 aagacaccct ggcaacctac atgaggaccc tcgacctcga ggaagaaaag gccagaaagc 1321 tctcaaccaa gtctgcctcg cctgacatcg tgggcacaat caacgcgctc ctggcgcgga 1381 ttgctgctgc acgctccctg gtacaccgag cgaaggagga actttccagc agaccaagac 1441 ccgtagtctt aatgatatcg ggcaagccag gaatagggaa gacccacctt gctagggagg 1501 tggctaagag aatcgcagcc tccctcacag gagatcagcg cgttggtcta atcccacgca 1561 atggcgtcga tcactgggat gcgtacaagg gggagagggt cgtcctgtgg gacgactatg 1621 gaatgagcaa tcccatccac gatgccctta ggctgcaaga actcgctgac acttgccccc 1681 tcaccctaaa ttgtgacagg attgagaaca aaggaaaggt ctttgacagc gacgtcatca 1741 taatcaccac taatctggcc aacccagcac cactggacta cgtcaacttt gaagcatgct 1801 cgaggcgtat cgatttcctc gtgtacgcag aagcccccga ggtcgaaaag gcgaaacgcg 1861 acttcccggg tcaacctgac atgtggaaaa gcgcttttag ttctgacttc tcacacataa 1921 aattggcact ggccccacaa ggtggttttg ataagaacgg gaacacccca cacgggaagg 1981 gcgtcatgaa gaccctcacc accggctccc tcattgcccg ggcaacaggg ctactccatg 2041 agagattaga tgagtttgaa ctgcagggcc cagccaccac caccttcaac ttcgaccgta 2101 acaaggtgct cgccttcagg cagcttgctg ctgaaaacaa gtacgggttg atggacacaa 2161 tgagagttgg gaggcagctc aaggatgtca agaccatgcc agaactcaaa caagcactca 2221 agaatatatc aatcaagaag tgccaaattg tgtatggtgg ctgcacctac gcacttgagt 2281 ctgatggcaa gggcaacgtg aaagttgaca gggttcagaa cgcctccgta cagaccaaca 2341 atgagttgtc tggcgccctg catcacctca agtgcgccag aatcaggtac tatgttaagt 2401 gtgtccagga ggccctgtac tctatcatcc aaattgctgg ggctgcattt gtcaccacgc 2461 gcatcatcaa gcgtgtgaac atccaagatc tgtggtccag gccacaggtg gaagacacag 2521 aggagactgc tagcaaggac gggtgcccga aacccaagga tgatgaggag ttcgtcattt 2581 catctgacga catcaaaact gagggtaaga aagggaagaa caagactggc cgtggcaaga 2641 aacacacagc cttctcaagc aaaggtctca gtgatgaaga gtatgatgaa tacaagagga 2701 ttagagagga aagaaatggc aagtactcca tagaagaata ccttcaggac agggacaagt 2761 actatgagga ggtggccatt gccagagcga ccgaggaaga cttctgtgaa gaggaggagg 2821 ccaagatccg gcaaaggatc ttcagaccaa caaggaagca acgcaaggaa gaaagggctt 2881 cactcggttt agtcacaggt tctgaaatca ggaaaaggaa tccagaagac ttcaagccta 2941 aggggaaact atgggctgac gatgacagaa gtgtggacta caatgaaaga ctcagctttg 3001 aggccccacc aagcatctgg tcaaggatag tcaactttgg ttcaggttgg ggcttctggg 3061 tctcccccag cctgttcata acatcaaccc acgtcatacc ccagggcgca aaggagttct 3121 tcggagtccc catcaaacaa attcaggtgc acaagtcagg cgaattctgt cgcttgagat 3181 ttccaaaacc aatcaggact gatgtgactg gcatgatcct ggaagaaggt gcgcccgaag 3241 gcaccgtggt cacaatactc atcaaaaggc ctactggaga actcatgccc ctagcagcca 3301 gaatgggaac acacgcaacc atgaggattc aaggacgcac tgtcggggga cagatgggca 3361 tgcttctgac aggttccaac gccaaaagca tggatttagg caccacacca ggcgactgcg 3421 gttgccccta catctacaag agaggaaatg actatgtggt cattggagtc cacacggctg 3481 ccgcccgtgg aggaaacact gtcatatgtg ccacccaggg gagtgagggg gaagctacac 3541 ttgaaggtgg agacagcaag ggaacatact gtggtgcgcc aattctaggt ccagggagtg 3601 ccccaaaact cagcactaaa accaaattct ggaggtcatc cacagcacca cttccacctg 3661 gcacctacga gccagcctac cttggtggca aagaccccag ggtcaagggt gggccctcgt 3721 tgcaacaagt catgagggac cagctgaaac catttacaga gcctaggggt aaaccaccaa 3781 acccaggtgt attagaggct gccaagaaga ccatcatcaa tgtccttgaa caaacaattg 3841 acccacctga gaagtggtcg ttcgcacaag cttgtgcgtc cctcgacaag accacttcta 3901 gcggccatcc gcaccacatg cggaaaaacg actgctggaa cggggagtcc ttcacaggca 3961 agctggcaga ccaggcttcc aaggccaacc tgatgtacga acaagggaag aacatgaccc 4021 cagtctacac aggtgcactc aaggatgaat tagtcaaaac tgacaaaatt tatgacaaga 4081 ttaagaagag gctcctctgg ggctcggatt tggcaaccat gatccgttgc gctcgagcat 4141 tcggaggtct gatggacgaa ctcaaagcac actgtgtcac acttcctgtc agagttggta 4201 tgaatatgaa tgaggatggc cccatcatct ttgagaagca ttccaagtat agataccact 4261 atgatgctga ttactctcgg tgggactcaa cacaacaaag ggccgtgctg gcaactgctc 4321 tagaaatcat ggttaaattc tcctcagaac cacatttggc ccaggtagta gctgaagacc 4381 ttctttctcc tagtgtagtg gatgtaggtg acttcacaat atcaatcaac gagggtcttc 4441 cctcgggagt gccctgcacc tcccagtgga attccatcgc ccactggctt ctcactctct 4501 gtgcactttc cgaagtcaca aacttgtctc cagacgtcat acaggctaat tctcttttct 4561 ccttctatgg tgatgatgaa attgtcagta cagacataaa attggaccca gaaaagttga 4621 caacaaagct taaggaatat ggattgaaac caacccgtcc tgacaaaact gaagggcctc 4681 ttgtcatttc tgaagactta gatggtttga ccttcctgcg gagaaccgtg acccgcgacc 4741 cagctggttg gtttggaaaa ctggaccaaa gttcaatact caggcaaatg tactggacta 4801 ggggccctaa ccatgaagac ccatctgaat caatgatccc acactctcag agacccatac 4861 aactgatgtc cttactggga gaggccgcac tccacggccc aacattctac agcaaaatta 4921 gcaaattggt catcgcagaa ctcaaagaag gtggtatgga tttttacgtg cccaggcaag 4981 aaccaatgtt caggtggatg agattctcgg atctgagcac gtgggagggc gatcgcaatc 5041 tggctcccag ttttgtgaat gaagatggcg tcgaatgacg ccaacccatc tgatgggtcc 5101 gcagccaacc tcgtcccagg tgtcaacaat gaggtcatgg ctctggagcc cgttgccggt 5161 gccgccatcg cggcgcctgt agcgggccaa caaaatgtaa ttgacccctg gattagaaat 5221 aattttgtcc aagcccctgg tggagagttc acagtgtccc ctagaaacgc tccaggtgaa 5281 atactatgga gtgcgcccct aggccctgat ttgaatccct acctatccca tttggccaga 5341 atgtataatg gttatgcagg tggttttgaa gtgcaggtaa tcctcgcggg gaacgcgttc 5401 accgccggga aaatcatatt tgcagcagtc ccaccaaatt tcccaactga gggtttaaat 5461 cccagccagg tcactatgtt cccccacgtg ataacagatg tcagacaact ggaacctgta 5521 ttgatcccct tacctgatgt taggaacaac ttctatcatt ataaccagtc aaatgattcc 5581 acccttaaat tgatagcaat gctgtataca ccacttaggg ccaataatgc cggggatgat 5641 gtcttcacag tctcttgtcg ggttctcacg aggccatccc ctgactttga tttcatattt 5701 ctggtgccac caacagtcga gtcaaaaact aagccattca ctatcccaat cttgactgtt 5761 gaagaaatga ccaattcaag attccccatt cctttggaaa aattgtttac gggccccagc 5821 agtgcctttg ttgtccaacc acaaaatggc aggtgcacga ctgatggcgt gctcttaggc 5881 accacccaac tgtctcctgt caacatttgc actttcaggg gggatgtcgc ccacatcccg 5941 ggttctcgca attacacaat gaatctggcc tctctaaatt ggagcaatta tgacccaaca 6001 gaagaaattc cagcccctct gggaacccca gatttcgtgg gaaagatcca aggtgtactc 6061 actcaaacca caaggagtga tggctcaacc cggggccaca aagctacagt ttacactggg 6121 agtgccgact tcactccaaa gctgggcagt gttcaacttg ctactgacac agaaaatgat 6181 tttgaagccc gccaaaatgc aaaattcact ccaatcggtg tcatccagga tggcaacacc 6241 acccaccgga atgaacccca acaatgggtg ctcccaagtt actcaggtag aaatgcccac 6301 aatgtacacc tagcccctgc tgtagccccc agttttccgg gtgagcaact tcttttcttc 6361 agatccacac tgcccggatg cagcgggtac cccaatctgg acctggactg cctactcccc 6421 caggagtggg tgcagcactt ctaccaacaa gcagccccag cacaatctga tgtggcttta 6481 ctaagatttg tgaacccaga cacgagtagg gtcttgtttg agtgcaaact ccataaagca 6541 ggctacctca cagtgtctca cactggtcaa catgatttgg ttatccctcc caatggctac 6601 tttaggtttg attcctgggt caaccagttc tacacactcg cccccatggg aaatggaacg 6661 gggcgtaggc gtgctttgta atggctggag ccgtctttgc aggaatggca tctgatgtcc 6721 tcggctccgg acttagttcc ctaatcaatg ctggggctgg ggctatcaac cagaagattg 6781 actttgagaa caataaacaa ttgcagcaag cctcctttca gtttagcaat aatctacaac 6841 aaacttcctt tcagcatgac aaagagatgc tccaagcaca aattgaggcc accaaaaagt 6901 tgcaacagga aatgatgaaa gtcaaacagg cagtgctctt agaaggtggg ttctctgaaa 6961 cagatgcggc ccgtggggca atcaacgccc ctatgacaaa ggttttggat tggaacggga 7021 caaggtactg ggcccctgat gccagggtca caacatacaa ctcaggccgc ttctccactc 7081 cccagccttc gggggcactg ccaggaagaa tcaatcccag aactcccacc cccgctcggg 7141 gttcccctag cacatcttct aatgcttcta ctgtgacttc tgtgtattca aatcaaactg 7201 cttcaacgag acttggttct acagctggtt ctggcaccag tgtctcgagt ctcccgtcaa 7261 ctgcaaggac taggagttgg gttgaggatc aaaatagaaa tctgtcacct ttcatgaggg 7321 gggcccacaa catatcgtat gtcaccccac catctagcag atcctccagc caaggcacag 7381 tctcaaccgt gcccaaaggt gttttggact cctggactgg cgctttcaac acgcgcaggc 7441 agcctctctt cgcccacatc cgtacgcgag gggagtcacg ggtataatgt gaaaagacaa 7501 aattgattat ctttcccttt ctttagtgtc tttt //