Typing tool

Complete norovirus genomes

OL943799  GII.4 Den Haag
 GII.P4 Den Haag

Length: 7,558 | 3 CDS

ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7513
LOCUS       OL943799                7558 bp    RNA     linear   VRL 25-DEC-2021
DEFINITION  Norovirus GII isolate Hu/US/2012/GII.4 Den Haag[P4 Den
            Haag]/Guatemala0054, complete genome.
ACCESSION   OL943799
VERSION     OL943799.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7558)
  AUTHORS   Montmayeur,A.M., Lopez,M.R., Chhabra,P. and Vinje,J.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-DEC-2021) Viral Gastroenteritis Branch, Centers for
            Disease Control and Prevention, 1600 Clifton Road NE, Atlanta, GA
            30329, USA
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: Geneious v. v. 11.1.2; SPAdes v. v. 3.6
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7558
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="Hu/US/2012/GII.4 Den Haag[P4 Den
                     Haag]/Guatemala0054"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Guatemala"
                     /collection_date="21-May-2012"
                     /note="genotype: GII.4 Den Haag[P4 Den Haag]"
     gene            5..5104
                     /gene="ORF1"
     CDS             5..5104
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="UHH89709.1"
                     /translation="MKMASNDASAAAVANSNNDAAKSSSDKMFSNMAVTLKRALGARP
                     KQPPPREIPQRPPRPPTPELVKKIPPPPPNGEDEVVVSYSVKDGVSGLPELSTVRQPE
                     ETNTAFSVPPLNQRESRDAKEPLTGTILEMWDGEIYHYGLYVEQGFVLGVHKPPAAIS
                     LAKVELTPLSLFWRPVYTPQYLISPDTLKKLHGETFPYTAFDNNCYAFCCWVLDLNDS
                     WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCALSSLFTRPIKDLIGK
                     LRPLNIINILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDFELQGPED
                     LVVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
                     KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
                     KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREV
                     AKRIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
                     PLTLNCDRIENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
                     AKRDFPGQPDMWKNAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
                     SGLLHERLDEFELQGPTLTTFNFDRNKVLAFRQLAAENKYGLMDTMKVGRQLKDVKTM
                     PELKQALKNISIKKCQIVYSGCTYTLESDGKGNVRVDRVQSTSVQTNNELAGALHHLR
                     CARIRYYVKCVQEALYSIIQIAGAAFVTTRIIKRVNIQDLWSKPQVENTEEATSKDGC
                     PKPKDDEEFVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
                     KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRRQRKEERASLGLV
                     TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSP
                     SLFITSTHVIPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVAGMILEEGAPEG
                     TVVSLLIKRSTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
                     CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGGEGEATLEGGDNKGTYCGAPILG
                     PGSAPKLSTKTKFWRSSTAPLPPGTYEPAYLGGKDPRIKGGPSLQQVMRDQLKPFTEP
                     RGKPPKPSVLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCW
                     NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL
                     ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
                     STQQRAVLAAALEIMVKFSSEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCT
                     SQWNSIAHWLLTLCALSEVTNLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
                     EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
                     NHEDPSESMIPHSQRPIQLMSLLGEAALHGPTFYSKISKLVIAELKEGGMDFYVPRQE
                     PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     5..994
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     995..2092
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2093..2629
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2630..3028
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3029..3571
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3572..5101
                     /gene="ORF1"
                     /product="RdRp"
     gene            5085..6707
                     /gene="ORF2"
     CDS             5085..6707
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="UHH89710.1"
                     /translation="MKMASNDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPILTVEEMTNSRFPIPLEKLFTGPSSTFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHIAGSRNYTMNLASQNWNSYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFSPKLGRVQFATDTDNDFVANQNTKFTPVGVIQDG
                     DTAHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
     gene            6707..7513
                     /gene="ORF3"
     CDS             6707..7513
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="UHH89711.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQSQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN
                     APMTKVLDWSGTRYWAPDARTTTYNAGRFSTPQPSGAPPGRTNLRAAAPARGSSSTSS
                     NSSIATSVYSSQTTSTRLGSTAGSGTSVSSLPSSARTRSWVEDQNRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN      
        1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gttgctaaca gcaacaacga
       61 tgccgcaaaa tcttcaagtg acaaaatgtt ttctaacatg gctgtcactc ttaaacgagc
      121 cctcggggcg cggcctaaac agcccccccc gagggaaata ccacaaagac ccccacgacc
      181 acccactcca gaactggtca aaaagatccc tcctcccccg cccaacggag aggatgaagt
      241 ggtggtttct tatagtgtca aagatggtgt ttccggtttg cctgagcttt ccaccgtcag
      301 gcaaccggaa gaaaccaata cggccttcag tgtccctcca ctcaatcaga gggagagtag
      361 ggatgctaag gaaccactaa ctgggacaat tctggaaatg tgggatggag aaatctacca
      421 ttatggtctg tatgttgagc aaggttttgt gctgggcgta cacaagccac cagctgccat
      481 tagcctcgcc aaggtcgaat taacaccact ctccttgttc tggagacctg tgtacactcc
      541 tcagtacctc atttctccag acaccctcaa gaaattacac ggagaaacat ttccctacac
      601 agcctttgac aacaattgct atgccttttg ttgctgggtc ctggatttaa acgattcgtg
      661 gctgagtagg agaatgatcc agagaacaac aggcttcttc agaccctacc aagattggaa
      721 taggaaaccc ctccccacta tggatgactc caaattaaag aaggtggcta acatattcct
      781 gtgcgccctg tcttcgttgt tcaccaggcc cataaaagac ttaataggaa aattaaggcc
      841 tctcaacatc atcaacatcc tggcttcatg tgattggact ttcgcgggca tagtggagtc
      901 cttgatactc ttggcggagc tctttggagt cttctggaca cccccagatg tgtctgcgat
      961 gatcgccccc ttactcggtg atttcgagtt acaaggacct gaggaccttg tagtggagct
     1021 cgtccctgtg gtaatggggg ggattggttt ggtgctagga ttcaccaaag aaaagattgg
     1081 aaaaatgttg tcatctgctg catccacctt gagagcttgc aaagaccttg gtgcatatgg
     1141 gctagagatc ttaaagttag tcatgaagtg gttcttcccg aagaaagagg aagcgaatga
     1201 actggccatg gtgagatcca tcgaggatgc agtactggac cttgaggcaa ttgaaaacaa
     1261 ccatatgacc accttgctca aagacaaaga tagcctggca acctacatga gaaccctcga
     1321 cctcgaggaa gagaaagcca gaaaactctc aaccaagtct gcttcacctg acatcgtggg
     1381 cacaattaac gcacttctgg cgagaatcgc cgctgcacgc tccctggtac acagagcaaa
     1441 ggaggagctt tccagcagac caagacctgt agtcttgatg atatcaggca ggccaggaat
     1501 agggaagacc caccttgcta gggaagtggc taagagaatc gcagcctccc tcacaggaga
     1561 ccagcgtgta ggcctcatcc cacgcaatgg cgttgatcac tgggatgcgt acaaggggga
     1621 aagggtcgtc ctatgggacg actatggaat gagcaacccc atccatgacg ctctcaggct
     1681 gcaagaactc gctgacactt gccccctcac tctaaattgt gacaggattg agaataaagg
     1741 aaaggtcttt gacagcgatg tcatcattat cactactaat ctggccaacc cagcaccact
     1801 ggactatgtc aactttgagg catgctcgag acgcatcgac ttcctcgtgt acgcagaagc
     1861 ccccgaggtc gaaaaggcaa agcgcgactt cccgggccag cctgacatgt ggaaaaacgc
     1921 ttttagttct gatttctcac acataaaatt ggcactggct ccacaaggtg gctttgataa
     1981 gaacgggaac accccacacg ggaagggcgt catgaagact ctcaccactg gctccctcat
     2041 tgctcgggca tcagggctgc tccatgagag attggatgag ttcgagctac agggtccaac
     2101 acttaccacc ttcaacttcg accgcaacaa agtgcttgcc ttcaggcagc ttgccgctga
     2161 aaataaatat ggattgatgg acacaatgaa agttgggagg cagctcaagg atgtcaaaac
     2221 catgccagaa cttaagcaag cactcaagaa tatctcaatc aaaaagtgcc agattgtgta
     2281 cagtggttgc acctacacac ttgagtctga tggcaagggc aacgtgagag ttgacagagt
     2341 acagagcacc tccgttcaga ccaacaatga gctggctggc gccctgcacc atctaaggtg
     2401 cgccaggatc aggtattatg ttaagtgtgt tcaggaagcc ctgtattcta tcatccagat
     2461 tgctggggct gcatttgtca ccacgcgcat catcaagcgt gtgaacattc aagacttatg
     2521 gtccaagcca caagtggaaa acacagagga ggctaccagc aaggacgggt gcccaaaacc
     2581 caaagatgat gaagagttcg tcatttcatc tgacgacatt aaaactgagg gtaagaaagg
     2641 gaagaacaag actggccgtg gcaagaagca cacagccttc tcaagtaaag gtctcagtga
     2701 tgaagagtat gatgaataca agagaattag agaggaaaga aatggcaagt actccataga
     2761 ggagtacctc caggacaggg acaaatacta tgaagaggtg gccattgcca gggcgaccga
     2821 ggaagacttc tgtgaagagg aggaggccaa gatccggcaa aggatcttta gaccaacaag
     2881 gagacaacgc aaggaagaaa gagcttctct cggtttagtc acaggttctg aaattaggaa
     2941 aagaaatcca gaagacttca agcccaaggg gaaactatgg gctgacgatg acagaagtgt
     3001 ggactataat gaaaaactca gttttgaggc cccaccaagc atctggtcaa ggatagtcaa
     3061 cttcggctca ggttggggct tctgggtctc ccccagcctg ttcataacat caacccacgt
     3121 cataccccag ggcgcaaagg agttctttgg agtccccatc aaacaaattc aggtgcacaa
     3181 gtcaggtgaa ttctgtcgct tgaggttccc aaaaccaatc aggaccgatg tggctggcat
     3241 gatcttggaa gaaggtgcgc ccgaaggcac cgtggtctca ctactcatca aaaggtctac
     3301 tggagaactc atgcccctag cagctagaat gggaacccac gcaaccatga agattcaagg
     3361 acgcactgtt ggaggccaga tgggcatgct tctgacagga tccaacgcga aaagcatgga
     3421 tctaggcacc acaccaggcg attgcggctg tccctacatc tacaaaagag gaaacgacta
     3481 tgtagttatt ggggtccaca cggctgccgc tcgtggggga aacactgtca tatgtgccac
     3541 ccaggggggt gagggggaag ctacacttga aggtggtgac aataagggaa catactgtgg
     3601 tgcaccaatc ctaggtccag ggagtgcccc aaaactcagc accaaaacca aattctggag
     3661 atcatccaca gcaccactcc cacctggcac ctatgaacca gcctaccttg gtggcaaaga
     3721 ccccagaatc aagggtggcc cctcgctgca gcaagtcatg agggaccaac tgaaaccatt
     3781 cacggagcct aggggtaagc caccaaagcc aagtgtgtta gaagctgcca agaaaaccat
     3841 catcaatgtc cttgagcaga caattgatcc acctgagaag tggtcgttcg cacaagcttg
     3901 cgcgtccctt gataagacca cttctagcgg ccatccgcac cacatgcgga aaaatgactg
     3961 ctggaacggg gagtccttca caggtaagct ggcagaccag gcttccaaag ccaatctgat
     4021 gtttgaagag gggaagaaca tgaccccagt ctacacaggt gcacttaagg atgaattagt
     4081 caaaactgac aaaatttatg gtaagatcaa gaagaggctt ctctggggct cggacctggc
     4141 aaccatgatc cggtgtgctc gagcgttcgg aggtctgatg gatgagctca aagcacactg
     4201 tgtcacactt cctgtcagag ttggcatgaa tatgaatgag gacggtccca tcatcttcga
     4261 gaagcattcc aggtacagat accattatga cgctgattac tctcggtggg actcaacaca
     4321 acagagagcc gtgctggcag ctgctctaga aatcatggtt aaattctcct cagaaccaca
     4381 tttggctcag gtagtagcag aagaccttct ttctcctagc gtagtggatg tgggtgactt
     4441 cacaatatca atcaacgagg gtcttccctc tggggtgccc tgcacctccc aatggaactc
     4501 catcgcccac tggcttctca ctctttgtgc gctctccgaa gtcacaaatt tgtctccaga
     4561 catcatacag gctaattctc tcttctcctt ctatggtgat gatgaaattg ttagcacaga
     4621 cataaaatta gacccagaaa aattaacagc aaaactcaag gaatatgggt tgaaaccaac
     4681 ccgccctgac aaaactgaag ggcctcttgt tatttctgaa gacttagacg gtttgacttt
     4741 cctgcggaga actgtgaccc gcgacccagc tggttggttt ggaaaactgg agcaaagctc
     4801 aatactcagg caaatgtact ggaccagggg ccccaatcat gaagacccat ctgaatcaat
     4861 gatcccgcac tctcaaagac ccatacaatt gatgtcctta ctgggagagg ccgcactcca
     4921 cggcccaaca ttctacagta aaatcagcaa attagttatt gctgagctta aagaaggtgg
     4981 catggatttt tacgtgccca ggcaagagcc aatgttcaga tggatgagat tctcggatct
     5041 gagcacgtgg gagggcgatc gcaatctggc tcccagcttt gtgaatgaag atggcgtcga
     5101 atgacgccaa cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg
     5161 ttatggcttt ggagcccgtt gttggtgccg ctattgcggc gcctgtagcg ggccaacaaa
     5221 atgtaattga cccctggatt agaaataatt ttgtacaagc ccctggtgga gagttcacag
     5281 tatcccctag aaacgctcca ggtgaaatac tatggagcgc gcccttaggc cctgatctaa
     5341 atccctacct atctcatttg gccagaatgt ataatggtta tgcaggtggt tttgaagtgc
     5401 aggtgattct cgcggggaac gcgttcaccg ccggaaaaat tatatttgca gcagtcccac
     5461 caaattttcc aactgaaggc ttaagcccca gccaggttac tatgttcccc cacataatag
     5521 tagatgttag gcaactggaa cctgtgttga tccccttacc tgatgttaga aataacttct
     5581 atcattataa tcagtcaaat gattctacca ttaaattgat agcaatgctg tacacaccac
     5641 ttagggctaa taatgctggg gatgatgtct ttacagtttc ttgtcgagtt ctcacgagac
     5701 catcccccga ttttgatttc atatttttag taccacctac agttgaatca agaactaaac
     5761 cattctctgt cccaatttta accgttgaag aaatgaccaa ttcaaggttc cccattcctt
     5821 tggaaaagtt gttcacgggc cccagtagta cctttgttgt tcaaccacaa aatggcaggt
     5881 gcacgactga tggcgtgctc ctaggtacca cccaactgtc tcctgtcaac atctgcacct
     5941 tcagagggga tgtcacccac attgcaggca gtcgtaacta cacaatgaat ctggcttccc
     6001 aaaattggaa cagttatgac ccaacagaag aaatcccagc ccctctagga actccagatt
     6061 tcgtggggaa gattcaaggt gtgctcaccc aaaccacaag gacagatggc tcgacccgcg
     6121 gtcacaaagc tacagtgtac actgggagcg ccgacttttc tccaaaactg ggtagagtcc
     6181 aatttgccac tgacacagac aatgatttcg ttgctaacca aaacacaaag ttcaccccag
     6241 tcggtgttat ccaggatggt gatactgccc accgaaatga accccaacaa tgggtgctcc
     6301 caagctactc aggcagaaac acccataatg tgcacctggc ccccgctgta gcccccactt
     6361 ttccgggtga gcaactcctc ttcttcaggt ctaccatgcc cggatgcagc gggtacccca
     6421 acatggattt ggactgtctg ctcccccagg aatgggtgca gtatttctac caggaagcag
     6481 ccccagcaca atctgatgtg gctctgctaa gatttgtgaa tccagacaca ggtagggttc
     6541 tgtttgagtg taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg
     6601 atttggttat cccccccaat ggttatttta gatttgattc ctgggtcaac cagttctaca
     6661 cacttgcccc catgggaaat gggacggggc gtaggcgtgc attataatgg ctggagcttt
     6721 ctttgctgga ttggcatctg acgtccttgg ctctggcctt ggttccctaa tcaatgctgg
     6781 ggctggggcc atcaaccaaa aagttgaatt tgaaaataac agaaaattgc aacaggcttc
     6841 cttccaattt agtagcaacc tacaacaagc ttcctttcaa catgacaaag agatgctcca
     6901 atcacaaatt gaggccacca aaaagttgca acaggaaatg atgagagtta aacaagcaat
     6961 gctcctagag ggtggatttt ctgagacaga tgcagctcgt ggggcaatca acgcccccat
     7021 gacaaaagtt ttggactgga gcgggacaag gtactgggct cccgatgcta ggactacaac
     7081 atataatgca ggccgctttt ccacccctca accctcgggg gcaccaccag gaagaactaa
     7141 tcttagggct gctgcccccg cccggggttc ctccagcaca tcttctaact cttctattgc
     7201 tacttctgtg tattcaagtc aaaccacttc aacgagactt ggttctacag ctggttctgg
     7261 taccagtgtc tcgagcctcc cgtcatctgc aaggactagg agctgggtcg aggatcaaaa
     7321 taggaatttg tcacctttca tgaggggggc ccacaacatc tcgtttgtca ccccaccatc
     7381 tagcagatcc tctagccaag gcacagtctc aaccgtgccc aaagaagttt tggactcctg
     7441 gactggcgct tttaatacgc gcaggcagcc tctcttcgct cacatccgta agcgagggga
     7501 gtcacgggtg taatgtgaaa agacaaattg attatctttc ttttctttag tgtctttt
//