Typing tool

Complete norovirus genomes

OK376714  GII.4 Den Haag
 GII.P4 Den Haag

Length: 7,534 | 3 CDS

ORF1: 1..5078
ORF2: 5059..6681
ORF3: 6681..7487
LOCUS       OK376714                7534 bp    RNA     linear   VRL 01-JUL-2022
DEFINITION  Norovirus GII isolate 0407 nonstructural polyprotein (ORF1) gene,
            partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds.
ACCESSION   OK376714
VERSION     OK376714.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7534)
  AUTHORS   Roy,S., Tutill,H.J., Williams,R.J., Sheth,S., Celma,C., Allen,D.
            and Breuer,J.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-OCT-2021) Infection Immunity & Inflammation, UCL,
            ICH, Guilford St, London WC1N 1EH, United Kingdom
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: CLC Genomics Workbench v. 11.01
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7534
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="0407"
                     /isolation_source="feces"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="United Kingdom"
                     /collection_date="21-Feb-2017"
                     /note="genotype:
                     GII.P4_Den_Haag_2006b_GII.4_Den_Haag_2006b"
     gene            <1..5078
                     /gene="ORF1"
     CDS             <1..5078
                     /gene="ORF1"
                     /codon_start=3
                     /product="nonstructural polyprotein"
                     /protein_id="UBX38814.1"
                     /translation="SAAAVANSNDDTTKSSSDKMFSNMAVTLKRALGARPKQPPPREI
                     PQRPPRPPTPELVKKVPPPPPNGEDEVVVSYSVKDGVSGLPELSTVRQPEETNTAFSV
                     PPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGIVLGVHKPPAAISLAKVELTP
                     LSLYWRPVYTPQYLIAPDTLKKLHGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQ
                     RTTGFFRPYQDWNRKPLPTMDDSKIKKVANIFLCALSSLFTRPIKDIIGKLRPLNIIN
                     ILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDFELQGPEDLVVELVPV
                     VMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANEL
                     AMVRSIEDAVLDLEAIENNHMTTLLKDKDTLATYMRTLDLEEEKARKLSTKSASPDIV
                     GTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGKPGIGKTHLAREVAKRIAASL
                     TGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDR
                     IENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQ
                     PDMWKSAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARATGLLHERL
                     DEFELQGPATTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGRQLKDVKTMPELKQALK
                     NISIKKCQIVYGGCTYALESDGKGNVKVDRVQNASVQTNNELSGALHHLKCARIRYYV
                     KCVQEALYSIIQIAGAAFVTTRIIKRVNIQDLWSRPQVEDTEETASKDGCPKPKDDEE
                     FVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYL
                     QDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKR
                     NPEDFKPKGKLWADDDRSVDYNERLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTH
                     VIPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTILIK
                     RPTGELMPLAARMGTHATMRIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYK
                     RGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLS
                     TKTKFWRSSTAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPNPG
                     VLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGK
                     LADQASKANLMYEQGKNMTPVYTGALKDELVKTDKIYDKIKKRLLWGSDLATMIRCAR
                     AFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSKYRYHYDADYSRWDSTQQRAVL
                     ATALEIMVKFSSEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCTSQWNSIAH
                     WLLTLCALSEVTNLSPDVIQANSLFSFYGDDEIVSTDIKLDPEKLTTKLKEYGLKPTR
                     PDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHEDPSES
                     MIPHSQRPIQLMSLLGEAALHGPTFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRF
                     SDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..968
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     969..2066
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2067..2603
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2604..3002
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3003..3545
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3546..5075
                     /gene="ORF1"
                     /product="RdRp"
     gene            5059..6681
                     /gene="ORF2"
     CDS             5059..6681
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="UBX38815.1"
                     /translation="MKMASNDANPSDGSAANLVPGVNNEVMALEPVAGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLNPSQVTMFPHVITDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDSTLKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESKTKPFTIPILTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVAHIPGSRNYTMNLASLNWSNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRSDGSTRGHKATVYTGSADFTPKLGSVQLATDTENDFEARQNAKFTPIGVIQDG
                     NTTHRNEPQQWVLPSYSGRNAHNVHLAPAVAPSFPGEQLLFFRSTLPGCSGYPNLDLD
                     CLLPQEWVQHFYQQAAPAQSDVALLRFVNPDTSRVLFECKLHKAGYLTVSHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
     gene            6681..7487
                     /gene="ORF3"
     CDS             6681..7487
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="UBX38816.1"
                     /translation="MAGAVFAGMASDVLGSGLSSLINAGAGAINQKIDFENNKQLQQA
                     SFQFSNNLQQTSFQHDKEMLQAQIEATKKLQQEMMKVKQAVLLEGGFSETDAARGAIN
                     APMTKVLDWNGTRYWAPDARVTTYNSGRFSTPQPSGALPGRINPRTPTPARGSPSTSS
                     NASTVTSVYSNQTASTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI
                     SYVTPPSSRSSSQGTVSTVPKGVLDSWTGAFNTRRQPLFAHIRTRGESRV"
ORIGIN      
        1 cttctgctgc cgctgttgct aacagcaacg acgacaccac aaaatcttca agtgacaaaa
       61 tgttttctaa catggctgtc acccttaaac gagccctcgg ggcgcggcct aaacagcccc
      121 ccccgaggga aataccacaa agacccccac ggccgcccac tccagaacta gtcaaaaagg
      181 tccctcctcc cccgcccaac ggagaggatg aagtagtggt ctcttatagt gtcaaagatg
      241 gcgtctccgg tttgcctgag ctatccaccg tcaggcaacc ggaagaaacc aacacggcct
      301 tcagtgtccc tccactcaac cagagggaga acagggatgc taaggaacca ctgactggga
      361 cgattcttga aatgtgggat ggagaaatct accattacgg cctgtacgtt gagcgaggta
      421 ttgtactggg tgtgcacaaa ccaccagctg ccatcagcct cgcaaaggtc gagttaacac
      481 cactctcctt gtactggaga cctgtgtaca cccctcagta cctcattgct ccagacactc
      541 tcaaaaagtt acacggagag acattcccct acacagcctt cgacaacaac tgctacgcct
      601 tttgttgctg ggtcctggac ctaaacgact cgtggctgag taggagaatg atccagagaa
      661 caactggctt ctttagaccc taccaagatt ggaataggaa acccctcccc actatggatg
      721 attccaagat aaagaaggta gctaacatat tcctgtgtgc cctgtcttcg ctgttcacca
      781 ggcccataaa agacataata ggaaagttaa gacctctcaa catcatcaac atcctggctt
      841 catgtgattg gacttttgca ggcatagtgg agtccttgat actcttggca gaactctttg
      901 gagtcttctg gacaccccca gatgtgtctg cgatgatcgc ccccttactc ggtgacttcg
      961 agctacaagg acctgaagac cttgtagtgg agctcgtccc tgtggtaatg ggggggattg
     1021 gtttggtgct aggattcacc aaagagaaga ttggaaaaat gttgtcatct gctgcatcca
     1081 ccctgagagc ttgcaaagac cttggtgcat atgggctgga gatcctaaag ctagtcatga
     1141 agtggttctt cccgaagaaa gaggaagcga acgaactggc tatggtgaga tccatcgagg
     1201 acgcagtgct ggacctcgag gcaattgaaa acaaccatat gaccaccttg ctcaaagata
     1261 aagacaccct ggcaacctac atgaggaccc tcgacctcga ggaagaaaag gccagaaagc
     1321 tctcaaccaa gtctgcctcg cctgacatcg tgggcacaat caacgcgctc ctggcgcgga
     1381 ttgctgctgc acgctccctg gtacaccgag cgaaggagga actttccagc agaccaagac
     1441 ccgtagtctt aatgatatcg ggcaagccag gaatagggaa gacccacctt gctagggagg
     1501 tggctaagag aatcgcagcc tccctcacag gagatcagcg cgttggtcta atcccacgca
     1561 atggcgtcga tcactgggat gcgtacaagg gggagagggt cgtcctgtgg gacgactatg
     1621 gaatgagcaa tcccatccac gatgccctta ggctgcaaga actcgctgac acttgccccc
     1681 tcaccctaaa ttgtgacagg attgagaaca aaggaaaggt ctttgacagc gacgtcatca
     1741 taatcaccac taatctggcc aacccagcac cactggacta cgtcaacttt gaagcatgct
     1801 cgaggcgtat cgatttcctc gtgtacgcag aagcccccga ggtcgaaaag gcgaaacgcg
     1861 acttcccggg tcaacctgac atgtggaaaa gcgcttttag ttctgacttc tcacacataa
     1921 aattggcact ggccccacaa ggtggttttg ataagaacgg gaacacccca cacgggaagg
     1981 gcgtcatgaa gaccctcacc accggctccc tcattgcccg ggcaacaggg ctactccatg
     2041 agagattaga tgagtttgaa ctgcagggcc cagccaccac caccttcaac ttcgaccgta
     2101 acaaggtgct cgccttcagg cagcttgctg ctgaaaacaa gtacgggttg atggacacaa
     2161 tgagagttgg gaggcagctc aaggatgtca agaccatgcc agaactcaaa caagcactca
     2221 agaatatatc aatcaagaag tgccaaattg tgtatggtgg ctgcacctac gcacttgagt
     2281 ctgatggcaa gggcaacgtg aaagttgaca gggttcagaa cgcctccgta cagaccaaca
     2341 atgagttgtc tggcgccctg catcacctca agtgcgccag aatcaggtac tatgttaagt
     2401 gtgtccagga ggccctgtac tctatcatcc aaattgctgg ggctgcattt gtcaccacgc
     2461 gcatcatcaa gcgtgtgaac atccaagatc tgtggtccag gccacaggtg gaagacacag
     2521 aggagactgc tagcaaggac gggtgcccga aacccaagga tgatgaggag ttcgtcattt
     2581 catctgacga catcaaaact gagggtaaga aagggaagaa caagactggc cgtggcaaga
     2641 aacacacagc cttctcaagc aaaggtctca gtgatgaaga gtatgatgaa tacaagagga
     2701 ttagagagga aagaaatggc aagtactcca tagaagaata ccttcaggac agggacaagt
     2761 actatgagga ggtggccatt gccagagcga ccgaggaaga cttctgtgaa gaggaggagg
     2821 ccaagatccg gcaaaggatc ttcagaccaa caaggaagca acgcaaggaa gaaagggctt
     2881 cactcggttt agtcacaggt tctgaaatca ggaaaaggaa tccagaagac ttcaagccta
     2941 aggggaaact atgggctgac gatgacagaa gtgtggacta caatgaaaga ctcagctttg
     3001 aggccccacc aagcatctgg tcaaggatag tcaactttgg ttcaggttgg ggcttctggg
     3061 tctcccccag cctgttcata acatcaaccc acgtcatacc ccagggcgca aaggagttct
     3121 tcggagtccc catcaaacaa attcaggtgc acaagtcagg cgaattctgt cgcttgagat
     3181 ttccaaaacc aatcaggact gatgtgactg gcatgatcct ggaagaaggt gcgcccgaag
     3241 gcaccgtggt cacaatactc atcaaaaggc ctactggaga actcatgccc ctagcagcca
     3301 gaatgggaac acacgcaacc atgaggattc aaggacgcac tgtcggggga cagatgggca
     3361 tgcttctgac aggttccaac gccaaaagca tggatttagg caccacacca ggcgactgcg
     3421 gttgccccta catctacaag agaggaaatg actatgtggt cattggagtc cacacggctg
     3481 ccgcccgtgg aggaaacact gtcatatgtg ccacccaggg gagtgagggg gaagctacac
     3541 ttgaaggtgg agacagcaag ggaacatact gtggtgcgcc aattctaggt ccagggagtg
     3601 ccccaaaact cagcactaaa accaaattct ggaggtcatc cacagcacca cttccacctg
     3661 gcacctacga gccagcctac cttggtggca aagaccccag ggtcaagggt gggccctcgt
     3721 tgcaacaagt catgagggac cagctgaaac catttacaga gcctaggggt aaaccaccaa
     3781 acccaggtgt attagaggct gccaagaaga ccatcatcaa tgtccttgaa caaacaattg
     3841 acccacctga gaagtggtcg ttcgcacaag cttgtgcgtc cctcgacaag accacttcta
     3901 gcggccatcc gcaccacatg cggaaaaacg actgctggaa cggggagtcc ttcacaggca
     3961 agctggcaga ccaggcttcc aaggccaacc tgatgtacga acaagggaag aacatgaccc
     4021 cagtctacac aggtgcactc aaggatgaat tagtcaaaac tgacaaaatt tatgacaaga
     4081 ttaagaagag gctcctctgg ggctcggatt tggcaaccat gatccgttgc gctcgagcat
     4141 tcggaggtct gatggacgaa ctcaaagcac actgtgtcac acttcctgtc agagttggta
     4201 tgaatatgaa tgaggatggc cccatcatct ttgagaagca ttccaagtat agataccact
     4261 atgatgctga ttactctcgg tgggactcaa cacaacaaag ggccgtgctg gcaactgctc
     4321 tagaaatcat ggttaaattc tcctcagaac cacatttggc ccaggtagta gctgaagacc
     4381 ttctttctcc tagtgtagtg gatgtaggtg acttcacaat atcaatcaac gagggtcttc
     4441 cctcgggagt gccctgcacc tcccagtgga attccatcgc ccactggctt ctcactctct
     4501 gtgcactttc cgaagtcaca aacttgtctc cagacgtcat acaggctaat tctcttttct
     4561 ccttctatgg tgatgatgaa attgtcagta cagacataaa attggaccca gaaaagttga
     4621 caacaaagct taaggaatat ggattgaaac caacccgtcc tgacaaaact gaagggcctc
     4681 ttgtcatttc tgaagactta gatggtttga ccttcctgcg gagaaccgtg acccgcgacc
     4741 cagctggttg gtttggaaaa ctggaccaaa gttcaatact caggcaaatg tactggacta
     4801 ggggccctaa ccatgaagac ccatctgaat caatgatccc acactctcag agacccatac
     4861 aactgatgtc cttactggga gaggccgcac tccacggccc aacattctac agcaaaatta
     4921 gcaaattggt catcgcagaa ctcaaagaag gtggtatgga tttttacgtg cccaggcaag
     4981 aaccaatgtt caggtggatg agattctcgg atctgagcac gtgggagggc gatcgcaatc
     5041 tggctcccag ttttgtgaat gaagatggcg tcgaatgacg ccaacccatc tgatgggtcc
     5101 gcagccaacc tcgtcccagg tgtcaacaat gaggtcatgg ctctggagcc cgttgccggt
     5161 gccgccatcg cggcgcctgt agcgggccaa caaaatgtaa ttgacccctg gattagaaat
     5221 aattttgtcc aagcccctgg tggagagttc acagtgtccc ctagaaacgc tccaggtgaa
     5281 atactatgga gtgcgcccct aggccctgat ttgaatccct acctatccca tttggccaga
     5341 atgtataatg gttatgcagg tggttttgaa gtgcaggtaa tcctcgcggg gaacgcgttc
     5401 accgccggga aaatcatatt tgcagcagtc ccaccaaatt tcccaactga gggtttaaat
     5461 cccagccagg tcactatgtt cccccacgtg ataacagatg tcagacaact ggaacctgta
     5521 ttgatcccct tacctgatgt taggaacaac ttctatcatt ataaccagtc aaatgattcc
     5581 acccttaaat tgatagcaat gctgtataca ccacttaggg ccaataatgc cggggatgat
     5641 gtcttcacag tctcttgtcg ggttctcacg aggccatccc ctgactttga tttcatattt
     5701 ctggtgccac caacagtcga gtcaaaaact aagccattca ctatcccaat cttgactgtt
     5761 gaagaaatga ccaattcaag attccccatt cctttggaaa aattgtttac gggccccagc
     5821 agtgcctttg ttgtccaacc acaaaatggc aggtgcacga ctgatggcgt gctcttaggc
     5881 accacccaac tgtctcctgt caacatttgc actttcaggg gggatgtcgc ccacatcccg
     5941 ggttctcgca attacacaat gaatctggcc tctctaaatt ggagcaatta tgacccaaca
     6001 gaagaaattc cagcccctct gggaacccca gatttcgtgg gaaagatcca aggtgtactc
     6061 actcaaacca caaggagtga tggctcaacc cggggccaca aagctacagt ttacactggg
     6121 agtgccgact tcactccaaa gctgggcagt gttcaacttg ctactgacac agaaaatgat
     6181 tttgaagccc gccaaaatgc aaaattcact ccaatcggtg tcatccagga tggcaacacc
     6241 acccaccgga atgaacccca acaatgggtg ctcccaagtt actcaggtag aaatgcccac
     6301 aatgtacacc tagcccctgc tgtagccccc agttttccgg gtgagcaact tcttttcttc
     6361 agatccacac tgcccggatg cagcgggtac cccaatctgg acctggactg cctactcccc
     6421 caggagtggg tgcagcactt ctaccaacaa gcagccccag cacaatctga tgtggcttta
     6481 ctaagatttg tgaacccaga cacgagtagg gtcttgtttg agtgcaaact ccataaagca
     6541 ggctacctca cagtgtctca cactggtcaa catgatttgg ttatccctcc caatggctac
     6601 tttaggtttg attcctgggt caaccagttc tacacactcg cccccatggg aaatggaacg
     6661 gggcgtaggc gtgctttgta atggctggag ccgtctttgc aggaatggca tctgatgtcc
     6721 tcggctccgg acttagttcc ctaatcaatg ctggggctgg ggctatcaac cagaagattg
     6781 actttgagaa caataaacaa ttgcagcaag cctcctttca gtttagcaat aatctacaac
     6841 aaacttcctt tcagcatgac aaagagatgc tccaagcaca aattgaggcc accaaaaagt
     6901 tgcaacagga aatgatgaaa gtcaaacagg cagtgctctt agaaggtggg ttctctgaaa
     6961 cagatgcggc ccgtggggca atcaacgccc ctatgacaaa ggttttggat tggaacggga
     7021 caaggtactg ggcccctgat gccagggtca caacatacaa ctcaggccgc ttctccactc
     7081 cccagccttc gggggcactg ccaggaagaa tcaatcccag aactcccacc cccgctcggg
     7141 gttcccctag cacatcttct aatgcttcta ctgtgacttc tgtgtattca aatcaaactg
     7201 cttcaacgag acttggttct acagctggtt ctggcaccag tgtctcgagt ctcccgtcaa
     7261 ctgcaaggac taggagttgg gttgaggatc aaaatagaaa tctgtcacct ttcatgaggg
     7321 gggcccacaa catatcgtat gtcaccccac catctagcag atcctccagc caaggcacag
     7381 tctcaaccgt gcccaaaggt gttttggact cctggactgg cgctttcaac acgcgcaggc
     7441 agcctctctt cgcccacatc cgtacgcgag gggagtcacg ggtataatgt gaaaagacaa
     7501 aattgattat ctttcccttt ctttagtgtc tttt
//