Typing tool

Complete norovirus genomes

OR051927  GII.4 Den Haag
 GII.P4 Den Haag

Length: 7,509 | 3 CDS

ORF1: 1..5100
ORF2: 5081..6703
ORF3: 6703..7509
LOCUS       OR051927                7509 bp    RNA     linear   VRL 06-OCT-2023
DEFINITION  Norovirus GII isolate GII/Hu/US/2014/GII.4DenHaag[P4]/NIH37.1
            nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes,
            complete cds.
ACCESSION   OR051927
VERSION     OR051927.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7509)
  AUTHORS   Chaimongkol,N., Dabilla,N., Tohma,K., Matsushima,Y., Behrle
            Yardley,A., Levenson,E.A., Johnson,J.A., Ahorrio,C., Oler,A.J.,
            Kim,D.Y., Souza,M., Sosnovtsev,S.V., Parra,G.I. and Green,K.Y.
  TITLE     Norovirus Evolves as One or More Distinct Clonal Populations in
            Immunocompromised Hosts
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7509)
  AUTHORS   Chaimongkol,N., Dabilla,N., Sosnovtsev,S.V. and Green,K.Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-MAY-2023) LID, NIAID, 50 South Drive/R6318, Bethesda,
            MD 20892, USA
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: HIVE Hexagon/Heptagon v. 2
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7509
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="GII/Hu/US/2014/GII.4DenHaag[P4]/NIH37.1"
                     /isolation_source="feces"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="USA"
                     /collection_date="Nov-2014"
                     /note="genotype: GII.4"
     gene            1..5100
                     /gene="ORF1"
     CDS             1..5100
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="WHW97613.1"
                     /translation="MKMASNDASAAAVANSNDDTAKSSSDKMFSNMAVTFKRALGARP
                     KQPPPREIPQRPPRPPTPELIKKIPPPPPNGEGEVVVSYSAKDGVSGLPELSTVRQPE
                     ETNTAFSVPPLNQRESRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAIS
                     LAKVELTPLSLYWRPVYTPQYLISPDTLKRLSGETFPYTAFDNNCYAFCCWVLDLNDS
                     WLSRRMIHRTTGFFRPYQDWNRKPLPTMDDSKLKKAADIFLCALSSLFTRPIKDIIGK
                     LKPLNIINILASCDWTFAGIVESLILLAELFGGFWTPPDVSAMIAPLLGDFELQGPED
                     LVVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
                     KKEEVDELAMVRSIEDAILDLEAIENNHMTTLLKDKDSLATYMKTLDIEEEKARKLST
                     KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREI
                     AKRIATSLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
                     PLTLNCDRIENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
                     AKRDFPGQPDMWKNAFSSDFSHIKLTLAPQGGFDKNGNTPHGKGVMKTLTMGSLTARA
                     SGLLHERLDEFELQGPTLTTFNFDRNKVLAFRQLAAENKYGLMDTMKVGKRLKDVKTM
                     PELKQALKNTSIKKCQIVYSGCTYTLESDGNGNVKIDKVQSTSVQTNNELTGALHHLR
                     CARVRYYVKCVQEALYSILQIAGAAFVTARIIKRVNIQDLWSKPQVENTEEATSKDGC
                     PRPKDDEEFVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
                     KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
                     TGSEIRKRNPEDFKPRGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSP
                     SLFITSSHVIPQGAKEFFGVPIKQIQVHKSGEFCHLRFPKPIRTDVTGMILEEGAPEG
                     TVVTILIKRSTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
                     CGCPYVYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
                     PGSAPKLSTKTKFWRSSTAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
                     RGKPPKPSVLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCW
                     NGESFTGKLADQASKANLMFEQGKNMTPIYTGALKDELVKTDKIYGTIKKRLLWGSDL
                     ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD
                     STQQRAVLAAALEIMVKFSSEPHLAQVVAEDLLSPSVMDVGDFTISINEGLPSGVPCT
                     SQWNSIAHWLLTLCAISETTNLSPDIVQANSQFSFYGDDEIVSTDIKLDPEKLTAKLK
                     EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
                     NHEDPSESMIPHSQRPVQLMSLLGEAALHGPTFYSKISKLVITELKEGGMDFYVPRQE
                     PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     1..990
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     991..2088
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2089..2625
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2626..3024
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3025..3567
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3568..5097
                     /gene="ORF1"
                     /product="RdRp"
     gene            5081..6703
                     /gene="ORF2"
     CDS             5081..6703
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="WHW97614.1"
                     /translation="MKMASNDANPSDGSAANLVPEVNNEVMALEPVAGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
                     VRNNFYHYNQSKDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFTVPILTVEEMTNSRFPISLDGLFTGPSNALVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFKGDVTHIAGSHNFTMNLASLNWNNYDPTEEIPAPLGTPDFVGKIHGML
                     TQTTRGDGSTRGHKATVYTGSADFTPKLGSVQFNTDTENDFDPHQNTKFTPVGVIQDG
                     NTTHRNEPQQWELPSYSGRGAQNVHLAPAVAPTFPGEQLLFFRSTLPGCSGYPNMDLD
                     CLLPQEWVQHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYITVAHTGQHNFV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
     gene            6703..7509
                     /gene="ORF3"
     CDS             6703..7509
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="WHW97615.1"
                     /translation="MAGALFAGMASDVLSSGLGSLINAGAGAINQKIDLENNKELQQA
                     SFQYSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAVLLEGGFSETDAARGAIN
                     APMTKALDWNGTRYWAPNANTTTYNTGHFSTPQSSGALSGRFNPRIPTPARGSSNTSS
                     NASATTSVYSNQTVSTRLGSTAGSGTNVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI
                     SFVTPPSSKSSSQGTVSTVPKEILDSWTGAFNTRKQPLFAQLRRRGESRV"
ORIGIN      
        1 atgaagatgg cgtctaacga cgcttccgct gccgctgttg ctaacagcaa cgacgacacc
       61 gcaaaatctt caagtgacaa aatgttttct aacatggctg tcacttttaa acgagccctc
      121 ggggcacggc ctaaacagcc ccccccgagg gaaataccac aaagaccccc acgaccacct
      181 actccagagc tgatcaaaaa gatccctcct cccccaccca acggggaggg tgaagtagtg
      241 gtttcttaca gtgccaaaga tggtgtttcc ggtttgcctg agctttccac cgtcaggcaa
      301 ccggaagaaa ccaacacggc cttcagtgtc cctccgctca accagaggga aagcagggat
      361 gctaaggagc cactgactgg aacaattctg gaaatgtggg atggggaaat ctaccattat
      421 ggcctgtatg ttgagcgagg tcttgtactg ggtgtgcaca aaccaccagc tgccattagc
      481 ctcgccaagg tcgaactaac accactctcc ttgtactgga gacctgtgta caccccccag
      541 tacctcatct ctccagacac tctcaagagg ttaagcggag aaacatttcc ctatacagcc
      601 tttgacaaca actgctatgc tttttgttgc tgggtcctgg atctaaacga ctcatggctg
      661 agtaggagaa tgatccatag aacaactggc ttcttcagac cctaccaaga ttggaataga
      721 aaacccctcc ccactatgga tgattccaaa ttgaagaagg cagctgacat attcctgtgt
      781 gccctgtctt cgctgttcac tagacccata aaagacataa taggaaagtt aaagcctctc
      841 aatatcatca acatcctggc ttcatgtgat tggactttcg caggcatagt ggaatcctta
      901 atactcttgg cagagctctt tggaggtttc tggacacccc cagatgtgtc tgcgatgatc
      961 gcccccttac tcggtgattt cgagttacaa ggacctgagg accttgtagt ggaactcgtc
     1021 cccgtagtga tgggggggat tggtttggtg ctgggattca ccaaagagaa gattgggaaa
     1081 atgttgtcat ctgcagcatc caccttgaga gcttgtaaag accttggtgc atatgggtta
     1141 gagatcctaa agttagtcat gaaatggttc ttcccgaaga aagaggaagt ggatgaactg
     1201 gctatggtga gatctattga ggatgcaata ttggaccttg aggcgattga gaacaaccat
     1261 atgaccacct tgctcaaaga caaagacagc cttgcaacat acatgaaaac ccttgacatc
     1321 gaggaagaaa aagccaggaa actctcaacc aagtctgctt cacctgacat cgtgggcaca
     1381 atcaacgccc tcctggcgag aatcgccgct gcacgttccc tggtgcaccg agcgaaggag
     1441 gagctttcca gcagaccaag acccgtggtc ttgatgatat caggcagacc aggaataggg
     1501 aagacccacc ttgctaggga aatagccaag agaatcgcaa cctccctcac aggagaccag
     1561 cgcgtgggcc tcatcccacg caacggcgtc gatcactggg atgcgtacaa gggggagagg
     1621 gtcgtcctgt gggacgacta tggcatgagc aaccccatcc acgacgccct caggctgcaa
     1681 gaactcgctg acacttgccc cctcactcta aattgtgaca ggattgagaa taaaggaaag
     1741 gtttttgaca gcgatgtcat catcataact actaatctgg ccaacccagc accactggac
     1801 tatgtcaact ttgaagcatg ctcaaggcgc attgacttcc tcgtgtatgc agaggccccc
     1861 gaggtcgaaa aggcaaagcg tgacttcccg ggccaacctg acatgtggaa aaacgctttt
     1921 agttcagatt tctcccacat aaaactgaca ctagctccac aaggtggctt tgacaagaac
     1981 gggaacaccc cacacgggaa gggcgtcatg aagactctca ccatgggctc cctcactgcc
     2041 cgggcatcag ggctgctcca tgagagactg gatgagtttg aactacaagg cccaactctc
     2101 accaccttca actttgaccg caacaaagtg cttgccttca ggcagcttgc tgctgaaaac
     2161 aaatatgggt tgatggacac aatgaaagtc gggaagcggc tcaaggatgt caaaaccatg
     2221 ccagaactta aacaagcact caagaacacc tcaatcaaga aatgccaaat tgtgtacagt
     2281 ggttgcacct acacactcga gtctgatggc aatggtaatg tgaaaattga caaagttcag
     2341 agcacctccg tccagaccaa caatgagctg actggcgccc tgcaccatct aaggtgcgcc
     2401 agagtcaggt actatgtcaa gtgtgttcag gaagccctct attctatcct ccagattgct
     2461 ggggctgcat ttgttaccgc gcgcatcatc aagcgtgtga acattcaaga cttatggtcc
     2521 aagccacaag tggaaaacac agaggaggct accagcaagg acgggtgccc aaggcccaaa
     2581 gacgatgagg agttcgtcat ttcatctgac gacattaaaa ctgagggtaa gaaagggaag
     2641 aataaaactg gtcgtggcaa gaagcacaca gccttctcga gtaaaggtct cagtgatgaa
     2701 gagtatgatg agtacaagag aattagagag gaaagaaatg gcaagtattc catagaagag
     2761 taccttcagg acagggacaa atactatgag gaggtggcca tcgccagggc gaccgaggaa
     2821 gacttctgtg aagaggagga ggccaagatc cggcaaagga tcttcaggcc gaccaggaaa
     2881 caacgcaagg aagaaagggc ttctctcggt ttagtcacag gttctgaaat taggaaaaga
     2941 aacccagaag atttcaagcc cagggggaaa ctatgggctg acgatgacag gagtgtggac
     3001 tataatgaaa aactcagttt tgaggcccca ccaagcatct ggtcaagaat agtcaacttt
     3061 ggctcaggtt ggggcttctg ggtctcccct agcctgttca taacgtcatc ccacgtcata
     3121 ccccagggtg caaaggagtt ctttggagtc ccaataaaac aaatccaggt gcacaagtcg
     3181 ggcgaattct gtcacttgag attcccaaaa ccaatcagga ctgatgtgac tggtatgatc
     3241 ttggaagaag gtgcgcccga aggcaccgtg gtcacaatac tcatcaaaag atccactgga
     3301 gaactcatgc ccctagcagc tagaatgggg acccatgcaa ccatgaaaat ccaaggacgc
     3361 accgttggag gccaaatggg catgcttcta acaggatcca acgccaaaag catggatcta
     3421 ggcaccacac caggtgattg cggctgtccc tatgtctaca agagaggaaa tgactacgtg
     3481 gtcattggag tccacacggc tgctgctcgt gggggaaaca ctgtcatatg tgcaacccaa
     3541 gggagtgagg gggaagccac acttgaaggt ggtgacagta aggggacata ctgtggtgca
     3601 ccaatcctag gcccagggag tgccccaaaa cttagcacca aaaccaagtt ctggagatca
     3661 tccacagcac cgctcccacc tggcacctat gaaccagcct accttggtgg caaagacccc
     3721 agagtcaagg gtggcccctc gctgcagcaa gttatgaggg accagttaaa accatttaca
     3781 gagcctaggg gtaaaccacc aaagccaagt gtgttggaag ctgccaagaa aaccatcatc
     3841 aatgtccttg agcagacaat tgacccacct gagaagtggt cgttcgcaca agcttgcgcg
     3901 tcccttgaca agaccacttc tagcggccat ccgcaccaca tgcggaaaaa cgactgctgg
     3961 aacggggaat ccttcacagg caagctggca gaccaggctt ccaaggccaa cctgatgttt
     4021 gaacagggaa agaacatgac cccaatctac actggtgcac ttaaggatga gttagtcaaa
     4081 actgacaaaa tttatggcac tatcaagaag aggcttctct ggggctcgga cttagcaacc
     4141 atgatccggt gcgctcgagc atttgggggc ttgatggatg aactcaaagc acactgtgtc
     4201 acactccctg tcagagttgg tatgaacatg aatgaagatg gccccatcat cttcgagaag
     4261 cactccaggt acaaatacca ttatgatgct gattactctc ggtgggattc aacacaacag
     4321 agggccgtgc tggcagctgc tctagaaatc atggttaaat tctcctcaga accacatttg
     4381 gctcaggtgg tggcagaaga tcttctctcc cctagcgtga tggatgtagg tgacttcaca
     4441 atatcaatca acgagggtct tccctctggg gtgccctgca cctcccaatg gaactccatc
     4501 gcccactggc tcctcaccct ctgtgcaatc tccgagacca caaatttgtc cccagacatc
     4561 gtgcaggcta actctcaatt ctccttctat ggtgatgatg aaattgtcag tacagatata
     4621 aaattggacc cagaaaagtt aacagcaaag ctcaaggaat atgggctgaa accaacccgc
     4681 cctgacaaaa ctgaaggacc tctcgtcatt tctgaagact tagacggttt gactttcctg
     4741 cggagaactg tgacccgcga cccagctggt tggtttggaa aactggaaca gagttcaata
     4801 ctcaggcaaa tgtactggac taggggtccc aaccacgaag acccatctga atcaatgatc
     4861 ccacactctc aaaggcccgt acaattgatg tccttgctgg gagaggccgc actccacggc
     4921 ccaacattct acagtaaaat cagcaaattg gtcattacag agctcaaaga aggtggcatg
     4981 gatttttacg tgcccagaca agaaccaatg ttcaggtgga tgagattctc agatctgagc
     5041 acgtgggagg gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgaatga
     5101 cgccaaccca tctgatgggt ccgcagccaa cctcgtccca gaggtcaaca atgaggttat
     5161 ggctttggag cccgtcgccg gtgccgctat tgcggcacct gtagcgggcc aacaaaatgt
     5221 aattgacccc tggattagaa ataattttgt acaagcccct ggtggagaat tcacagtatc
     5281 ccctagaaac gctccgggtg aaatactatg gagcgcgccc ttaggtcctg atctgaatcc
     5341 ctacctatct catttggcca gaatgtataa tggttatgca ggtggttttg aagtgcaggt
     5401 gatcctcgcg gggaacgcgt tcaccgcggg aaaaattata tttgcagcag tcccaccaaa
     5461 ttttccaact gaaggcttga gtcccagtca ggttactatg ttcccccaca taatagtaga
     5521 tgttaggcaa ttagaacctg tgttaattcc cttacctgat gttaggaaca atttctacca
     5581 ttataatcag tcaaaagatt ccaccatcaa attgatagca atgctgtata caccacttag
     5641 ggccaataat gctggggatg atgtcttcac agtctcctgt cgagttctca cgagaccatc
     5701 ccctgatttt gactttatat tcttggtacc acccacagtt gagtcaagaa ctaaaccttt
     5761 cactgtccca atcctgactg ttgaagaaat gaccaattca agattcccca tttctctgga
     5821 cgggttgttt acgggtccca gcaatgccct tgttgtccaa ccacaaaatg gcagatgcac
     5881 gactgatggc gtgcttttgg gcaccaccca actgtctcct gttaacatct gtactttcaa
     5941 aggggacgtc acccacattg caggttctca caatttcaca atgaatttgg cttctctaaa
     6001 ttggaacaat tatgacccaa cagaagaaat tccagcccct ttgggaactc cagatttcgt
     6061 gggcaagatc catggtatgc tcactcaaac tacaagagga gatggctcga cccggggcca
     6121 caaggccaca gtatacactg gaagtgctga cttcactcca aagctgggca gtgttcaatt
     6181 taacactgat acagaaaatg attttgaccc tcaccaaaac acaaaattca ccccagtcgg
     6241 tgtcatccag gatggtaaca ccacccaccg aaatgaaccc caacaatggg agctcccaag
     6301 ttattcaggt agaggtgccc aaaatgtaca cctagcccct gctgtggccc ccactttccc
     6361 gggtgaacaa cttcttttct tcagatccac tctccccgga tgcagcgggt atcccaacat
     6421 ggatttagat tgcctactcc cccaagagtg ggtgcagcac ttctaccagg aagcagctcc
     6481 agcacaatct gatgtggctc tattaagatt tgtgaatcca gacacgggta gggtcctgtt
     6541 tgaatgcaaa cttcataaat caggctatat cacagtggct cacaccggcc agcataattt
     6601 tgtcatcccc cccaatggct acttcaggtt tgactcctgg gttaatcaat tctacacact
     6661 tgcccccatg ggaaatggaa cggggcgtag gcgtgctcta taatggcagg agctctcttt
     6721 gctggaatgg catctgatgt ccttagctct ggacttggtt ccctaatcaa tgctggggct
     6781 ggggctatca accaaaagat tgatcttgaa aataacaaag aattgcagca agcttccttc
     6841 cagtatagca gtaacctgca gcaggcctcc tttcaacatg ataaagagat gctccaagca
     6901 caaattgaag ccactaaaag gttgcaacag gaaatgatga aagtcaaaca ggcagtgctc
     6961 ttagagggtg gattctctga aacagatgca gcccgtgggg caatcaacgc ccccatgaca
     7021 aaggctttgg attggaacgg aacgaggtac tgggccccta atgctaatac cacaacatac
     7081 aatacaggcc acttttccac tccgcaatct tcgggggcgc tgtcaggaag atttaatccc
     7141 aggattccca cccccgctcg gggctcctct aatacatctt ctaatgcttc tgctaccact
     7201 tctgtgtatt caaatcaaac tgtttcaacg agacttggtt ctacagctgg ttctggcacc
     7261 aatgtctcga gtctcccgtc aactgcaagg actaggagtt gggttgagga tcaaaacaga
     7321 aacttgtcac ctttcatgag gggggcccac aacatatcgt ttgtcacccc accatctagc
     7381 aaatcctcta gccaaggcac agtctcaacc gtgcctaaag aaattttgga ctcctggact
     7441 ggcgctttca acacgcgcaa gcagcctctc ttcgcccaac ttcgtaggcg aggggagtca
     7501 cgggtgtaa
//