![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| OK376714 | GII.4 Den Haag | ||
|---|---|---|---|
| GII.P4 Den Haag |
ORF1: 1..5078
ORF2: 5059..6681
ORF3: 6681..7487
LOCUS OK376714 7534 bp RNA linear VRL 01-JUL-2022
DEFINITION Norovirus GII isolate 0407 nonstructural polyprotein (ORF1) gene,
partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds.
ACCESSION OK376714
VERSION OK376714.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7534)
AUTHORS Roy,S., Tutill,H.J., Williams,R.J., Sheth,S., Celma,C., Allen,D.
and Breuer,J.
TITLE Direct Submission
JOURNAL Submitted (05-OCT-2021) Infection Immunity & Inflammation, UCL,
ICH, Guilford St, London WC1N 1EH, United Kingdom
COMMENT ##Assembly-Data-START##
Assembly Method :: CLC Genomics Workbench v. 11.01
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7534
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="0407"
/isolation_source="feces"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="United Kingdom"
/collection_date="21-Feb-2017"
/note="genotype:
GII.P4_Den_Haag_2006b_GII.4_Den_Haag_2006b"
gene <1..5078
/gene="ORF1"
CDS <1..5078
/gene="ORF1"
/codon_start=3
/product="nonstructural polyprotein"
/protein_id="UBX38814.1"
/translation="SAAAVANSNDDTTKSSSDKMFSNMAVTLKRALGARPKQPPPREI
PQRPPRPPTPELVKKVPPPPPNGEDEVVVSYSVKDGVSGLPELSTVRQPEETNTAFSV
PPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGIVLGVHKPPAAISLAKVELTP
LSLYWRPVYTPQYLIAPDTLKKLHGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQ
RTTGFFRPYQDWNRKPLPTMDDSKIKKVANIFLCALSSLFTRPIKDIIGKLRPLNIIN
ILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDFELQGPEDLVVELVPV
VMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANEL
AMVRSIEDAVLDLEAIENNHMTTLLKDKDTLATYMRTLDLEEEKARKLSTKSASPDIV
GTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGKPGIGKTHLAREVAKRIAASL
TGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDR
IENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQ
PDMWKSAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARATGLLHERL
DEFELQGPATTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGRQLKDVKTMPELKQALK
NISIKKCQIVYGGCTYALESDGKGNVKVDRVQNASVQTNNELSGALHHLKCARIRYYV
KCVQEALYSIIQIAGAAFVTTRIIKRVNIQDLWSRPQVEDTEETASKDGCPKPKDDEE
FVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYL
QDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKR
NPEDFKPKGKLWADDDRSVDYNERLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTH
VIPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTILIK
RPTGELMPLAARMGTHATMRIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYK
RGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLS
TKTKFWRSSTAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPNPG
VLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGK
LADQASKANLMYEQGKNMTPVYTGALKDELVKTDKIYDKIKKRLLWGSDLATMIRCAR
AFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSKYRYHYDADYSRWDSTQQRAVL
ATALEIMVKFSSEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCTSQWNSIAH
WLLTLCALSEVTNLSPDVIQANSLFSFYGDDEIVSTDIKLDPEKLTTKLKEYGLKPTR
PDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHEDPSES
MIPHSQRPIQLMSLLGEAALHGPTFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRF
SDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide <1..968
/gene="ORF1"
/product="p48"
mat_peptide 969..2066
/gene="ORF1"
/product="NTPase"
mat_peptide 2067..2603
/gene="ORF1"
/product="p22"
mat_peptide 2604..3002
/gene="ORF1"
/product="VPg"
mat_peptide 3003..3545
/gene="ORF1"
/product="Pro"
mat_peptide 3546..5075
/gene="ORF1"
/product="RdRp"
gene 5059..6681
/gene="ORF2"
CDS 5059..6681
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="UBX38815.1"
/translation="MKMASNDANPSDGSAANLVPGVNNEVMALEPVAGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLNPSQVTMFPHVITDVRQLEPVLIPLPD
VRNNFYHYNQSNDSTLKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESKTKPFTIPILTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVAHIPGSRNYTMNLASLNWSNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRSDGSTRGHKATVYTGSADFTPKLGSVQLATDTENDFEARQNAKFTPIGVIQDG
NTTHRNEPQQWVLPSYSGRNAHNVHLAPAVAPSFPGEQLLFFRSTLPGCSGYPNLDLD
CLLPQEWVQHFYQQAAPAQSDVALLRFVNPDTSRVLFECKLHKAGYLTVSHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6681..7487
/gene="ORF3"
CDS 6681..7487
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="UBX38816.1"
/translation="MAGAVFAGMASDVLGSGLSSLINAGAGAINQKIDFENNKQLQQA
SFQFSNNLQQTSFQHDKEMLQAQIEATKKLQQEMMKVKQAVLLEGGFSETDAARGAIN
APMTKVLDWNGTRYWAPDARVTTYNSGRFSTPQPSGALPGRINPRTPTPARGSPSTSS
NASTVTSVYSNQTASTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI
SYVTPPSSRSSSQGTVSTVPKGVLDSWTGAFNTRRQPLFAHIRTRGESRV"
ORIGIN
1 cttctgctgc cgctgttgct aacagcaacg acgacaccac aaaatcttca agtgacaaaa
61 tgttttctaa catggctgtc acccttaaac gagccctcgg ggcgcggcct aaacagcccc
121 ccccgaggga aataccacaa agacccccac ggccgcccac tccagaacta gtcaaaaagg
181 tccctcctcc cccgcccaac ggagaggatg aagtagtggt ctcttatagt gtcaaagatg
241 gcgtctccgg tttgcctgag ctatccaccg tcaggcaacc ggaagaaacc aacacggcct
301 tcagtgtccc tccactcaac cagagggaga acagggatgc taaggaacca ctgactggga
361 cgattcttga aatgtgggat ggagaaatct accattacgg cctgtacgtt gagcgaggta
421 ttgtactggg tgtgcacaaa ccaccagctg ccatcagcct cgcaaaggtc gagttaacac
481 cactctcctt gtactggaga cctgtgtaca cccctcagta cctcattgct ccagacactc
541 tcaaaaagtt acacggagag acattcccct acacagcctt cgacaacaac tgctacgcct
601 tttgttgctg ggtcctggac ctaaacgact cgtggctgag taggagaatg atccagagaa
661 caactggctt ctttagaccc taccaagatt ggaataggaa acccctcccc actatggatg
721 attccaagat aaagaaggta gctaacatat tcctgtgtgc cctgtcttcg ctgttcacca
781 ggcccataaa agacataata ggaaagttaa gacctctcaa catcatcaac atcctggctt
841 catgtgattg gacttttgca ggcatagtgg agtccttgat actcttggca gaactctttg
901 gagtcttctg gacaccccca gatgtgtctg cgatgatcgc ccccttactc ggtgacttcg
961 agctacaagg acctgaagac cttgtagtgg agctcgtccc tgtggtaatg ggggggattg
1021 gtttggtgct aggattcacc aaagagaaga ttggaaaaat gttgtcatct gctgcatcca
1081 ccctgagagc ttgcaaagac cttggtgcat atgggctgga gatcctaaag ctagtcatga
1141 agtggttctt cccgaagaaa gaggaagcga acgaactggc tatggtgaga tccatcgagg
1201 acgcagtgct ggacctcgag gcaattgaaa acaaccatat gaccaccttg ctcaaagata
1261 aagacaccct ggcaacctac atgaggaccc tcgacctcga ggaagaaaag gccagaaagc
1321 tctcaaccaa gtctgcctcg cctgacatcg tgggcacaat caacgcgctc ctggcgcgga
1381 ttgctgctgc acgctccctg gtacaccgag cgaaggagga actttccagc agaccaagac
1441 ccgtagtctt aatgatatcg ggcaagccag gaatagggaa gacccacctt gctagggagg
1501 tggctaagag aatcgcagcc tccctcacag gagatcagcg cgttggtcta atcccacgca
1561 atggcgtcga tcactgggat gcgtacaagg gggagagggt cgtcctgtgg gacgactatg
1621 gaatgagcaa tcccatccac gatgccctta ggctgcaaga actcgctgac acttgccccc
1681 tcaccctaaa ttgtgacagg attgagaaca aaggaaaggt ctttgacagc gacgtcatca
1741 taatcaccac taatctggcc aacccagcac cactggacta cgtcaacttt gaagcatgct
1801 cgaggcgtat cgatttcctc gtgtacgcag aagcccccga ggtcgaaaag gcgaaacgcg
1861 acttcccggg tcaacctgac atgtggaaaa gcgcttttag ttctgacttc tcacacataa
1921 aattggcact ggccccacaa ggtggttttg ataagaacgg gaacacccca cacgggaagg
1981 gcgtcatgaa gaccctcacc accggctccc tcattgcccg ggcaacaggg ctactccatg
2041 agagattaga tgagtttgaa ctgcagggcc cagccaccac caccttcaac ttcgaccgta
2101 acaaggtgct cgccttcagg cagcttgctg ctgaaaacaa gtacgggttg atggacacaa
2161 tgagagttgg gaggcagctc aaggatgtca agaccatgcc agaactcaaa caagcactca
2221 agaatatatc aatcaagaag tgccaaattg tgtatggtgg ctgcacctac gcacttgagt
2281 ctgatggcaa gggcaacgtg aaagttgaca gggttcagaa cgcctccgta cagaccaaca
2341 atgagttgtc tggcgccctg catcacctca agtgcgccag aatcaggtac tatgttaagt
2401 gtgtccagga ggccctgtac tctatcatcc aaattgctgg ggctgcattt gtcaccacgc
2461 gcatcatcaa gcgtgtgaac atccaagatc tgtggtccag gccacaggtg gaagacacag
2521 aggagactgc tagcaaggac gggtgcccga aacccaagga tgatgaggag ttcgtcattt
2581 catctgacga catcaaaact gagggtaaga aagggaagaa caagactggc cgtggcaaga
2641 aacacacagc cttctcaagc aaaggtctca gtgatgaaga gtatgatgaa tacaagagga
2701 ttagagagga aagaaatggc aagtactcca tagaagaata ccttcaggac agggacaagt
2761 actatgagga ggtggccatt gccagagcga ccgaggaaga cttctgtgaa gaggaggagg
2821 ccaagatccg gcaaaggatc ttcagaccaa caaggaagca acgcaaggaa gaaagggctt
2881 cactcggttt agtcacaggt tctgaaatca ggaaaaggaa tccagaagac ttcaagccta
2941 aggggaaact atgggctgac gatgacagaa gtgtggacta caatgaaaga ctcagctttg
3001 aggccccacc aagcatctgg tcaaggatag tcaactttgg ttcaggttgg ggcttctggg
3061 tctcccccag cctgttcata acatcaaccc acgtcatacc ccagggcgca aaggagttct
3121 tcggagtccc catcaaacaa attcaggtgc acaagtcagg cgaattctgt cgcttgagat
3181 ttccaaaacc aatcaggact gatgtgactg gcatgatcct ggaagaaggt gcgcccgaag
3241 gcaccgtggt cacaatactc atcaaaaggc ctactggaga actcatgccc ctagcagcca
3301 gaatgggaac acacgcaacc atgaggattc aaggacgcac tgtcggggga cagatgggca
3361 tgcttctgac aggttccaac gccaaaagca tggatttagg caccacacca ggcgactgcg
3421 gttgccccta catctacaag agaggaaatg actatgtggt cattggagtc cacacggctg
3481 ccgcccgtgg aggaaacact gtcatatgtg ccacccaggg gagtgagggg gaagctacac
3541 ttgaaggtgg agacagcaag ggaacatact gtggtgcgcc aattctaggt ccagggagtg
3601 ccccaaaact cagcactaaa accaaattct ggaggtcatc cacagcacca cttccacctg
3661 gcacctacga gccagcctac cttggtggca aagaccccag ggtcaagggt gggccctcgt
3721 tgcaacaagt catgagggac cagctgaaac catttacaga gcctaggggt aaaccaccaa
3781 acccaggtgt attagaggct gccaagaaga ccatcatcaa tgtccttgaa caaacaattg
3841 acccacctga gaagtggtcg ttcgcacaag cttgtgcgtc cctcgacaag accacttcta
3901 gcggccatcc gcaccacatg cggaaaaacg actgctggaa cggggagtcc ttcacaggca
3961 agctggcaga ccaggcttcc aaggccaacc tgatgtacga acaagggaag aacatgaccc
4021 cagtctacac aggtgcactc aaggatgaat tagtcaaaac tgacaaaatt tatgacaaga
4081 ttaagaagag gctcctctgg ggctcggatt tggcaaccat gatccgttgc gctcgagcat
4141 tcggaggtct gatggacgaa ctcaaagcac actgtgtcac acttcctgtc agagttggta
4201 tgaatatgaa tgaggatggc cccatcatct ttgagaagca ttccaagtat agataccact
4261 atgatgctga ttactctcgg tgggactcaa cacaacaaag ggccgtgctg gcaactgctc
4321 tagaaatcat ggttaaattc tcctcagaac cacatttggc ccaggtagta gctgaagacc
4381 ttctttctcc tagtgtagtg gatgtaggtg acttcacaat atcaatcaac gagggtcttc
4441 cctcgggagt gccctgcacc tcccagtgga attccatcgc ccactggctt ctcactctct
4501 gtgcactttc cgaagtcaca aacttgtctc cagacgtcat acaggctaat tctcttttct
4561 ccttctatgg tgatgatgaa attgtcagta cagacataaa attggaccca gaaaagttga
4621 caacaaagct taaggaatat ggattgaaac caacccgtcc tgacaaaact gaagggcctc
4681 ttgtcatttc tgaagactta gatggtttga ccttcctgcg gagaaccgtg acccgcgacc
4741 cagctggttg gtttggaaaa ctggaccaaa gttcaatact caggcaaatg tactggacta
4801 ggggccctaa ccatgaagac ccatctgaat caatgatccc acactctcag agacccatac
4861 aactgatgtc cttactggga gaggccgcac tccacggccc aacattctac agcaaaatta
4921 gcaaattggt catcgcagaa ctcaaagaag gtggtatgga tttttacgtg cccaggcaag
4981 aaccaatgtt caggtggatg agattctcgg atctgagcac gtgggagggc gatcgcaatc
5041 tggctcccag ttttgtgaat gaagatggcg tcgaatgacg ccaacccatc tgatgggtcc
5101 gcagccaacc tcgtcccagg tgtcaacaat gaggtcatgg ctctggagcc cgttgccggt
5161 gccgccatcg cggcgcctgt agcgggccaa caaaatgtaa ttgacccctg gattagaaat
5221 aattttgtcc aagcccctgg tggagagttc acagtgtccc ctagaaacgc tccaggtgaa
5281 atactatgga gtgcgcccct aggccctgat ttgaatccct acctatccca tttggccaga
5341 atgtataatg gttatgcagg tggttttgaa gtgcaggtaa tcctcgcggg gaacgcgttc
5401 accgccggga aaatcatatt tgcagcagtc ccaccaaatt tcccaactga gggtttaaat
5461 cccagccagg tcactatgtt cccccacgtg ataacagatg tcagacaact ggaacctgta
5521 ttgatcccct tacctgatgt taggaacaac ttctatcatt ataaccagtc aaatgattcc
5581 acccttaaat tgatagcaat gctgtataca ccacttaggg ccaataatgc cggggatgat
5641 gtcttcacag tctcttgtcg ggttctcacg aggccatccc ctgactttga tttcatattt
5701 ctggtgccac caacagtcga gtcaaaaact aagccattca ctatcccaat cttgactgtt
5761 gaagaaatga ccaattcaag attccccatt cctttggaaa aattgtttac gggccccagc
5821 agtgcctttg ttgtccaacc acaaaatggc aggtgcacga ctgatggcgt gctcttaggc
5881 accacccaac tgtctcctgt caacatttgc actttcaggg gggatgtcgc ccacatcccg
5941 ggttctcgca attacacaat gaatctggcc tctctaaatt ggagcaatta tgacccaaca
6001 gaagaaattc cagcccctct gggaacccca gatttcgtgg gaaagatcca aggtgtactc
6061 actcaaacca caaggagtga tggctcaacc cggggccaca aagctacagt ttacactggg
6121 agtgccgact tcactccaaa gctgggcagt gttcaacttg ctactgacac agaaaatgat
6181 tttgaagccc gccaaaatgc aaaattcact ccaatcggtg tcatccagga tggcaacacc
6241 acccaccgga atgaacccca acaatgggtg ctcccaagtt actcaggtag aaatgcccac
6301 aatgtacacc tagcccctgc tgtagccccc agttttccgg gtgagcaact tcttttcttc
6361 agatccacac tgcccggatg cagcgggtac cccaatctgg acctggactg cctactcccc
6421 caggagtggg tgcagcactt ctaccaacaa gcagccccag cacaatctga tgtggcttta
6481 ctaagatttg tgaacccaga cacgagtagg gtcttgtttg agtgcaaact ccataaagca
6541 ggctacctca cagtgtctca cactggtcaa catgatttgg ttatccctcc caatggctac
6601 tttaggtttg attcctgggt caaccagttc tacacactcg cccccatggg aaatggaacg
6661 gggcgtaggc gtgctttgta atggctggag ccgtctttgc aggaatggca tctgatgtcc
6721 tcggctccgg acttagttcc ctaatcaatg ctggggctgg ggctatcaac cagaagattg
6781 actttgagaa caataaacaa ttgcagcaag cctcctttca gtttagcaat aatctacaac
6841 aaacttcctt tcagcatgac aaagagatgc tccaagcaca aattgaggcc accaaaaagt
6901 tgcaacagga aatgatgaaa gtcaaacagg cagtgctctt agaaggtggg ttctctgaaa
6961 cagatgcggc ccgtggggca atcaacgccc ctatgacaaa ggttttggat tggaacggga
7021 caaggtactg ggcccctgat gccagggtca caacatacaa ctcaggccgc ttctccactc
7081 cccagccttc gggggcactg ccaggaagaa tcaatcccag aactcccacc cccgctcggg
7141 gttcccctag cacatcttct aatgcttcta ctgtgacttc tgtgtattca aatcaaactg
7201 cttcaacgag acttggttct acagctggtt ctggcaccag tgtctcgagt ctcccgtcaa
7261 ctgcaaggac taggagttgg gttgaggatc aaaatagaaa tctgtcacct ttcatgaggg
7321 gggcccacaa catatcgtat gtcaccccac catctagcag atcctccagc caaggcacag
7381 tctcaaccgt gcccaaaggt gttttggact cctggactgg cgctttcaac acgcgcaggc
7441 agcctctctt cgcccacatc cgtacgcgag gggagtcacg ggtataatgt gaaaagacaa
7501 aattgattat ctttcccttt ctttagtgtc tttt
//