![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| OL943799 | GII.4 Den Haag | ||
|---|---|---|---|
| GII.P4 Den Haag |
ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7513
LOCUS OL943799 7558 bp RNA linear VRL 25-DEC-2021
DEFINITION Norovirus GII isolate Hu/US/2012/GII.4 Den Haag[P4 Den
Haag]/Guatemala0054, complete genome.
ACCESSION OL943799
VERSION OL943799.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7558)
AUTHORS Montmayeur,A.M., Lopez,M.R., Chhabra,P. and Vinje,J.
TITLE Direct Submission
JOURNAL Submitted (17-DEC-2021) Viral Gastroenteritis Branch, Centers for
Disease Control and Prevention, 1600 Clifton Road NE, Atlanta, GA
30329, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Geneious v. v. 11.1.2; SPAdes v. v. 3.6
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7558
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="Hu/US/2012/GII.4 Den Haag[P4 Den
Haag]/Guatemala0054"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="Guatemala"
/collection_date="21-May-2012"
/note="genotype: GII.4 Den Haag[P4 Den Haag]"
gene 5..5104
/gene="ORF1"
CDS 5..5104
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="UHH89709.1"
/translation="MKMASNDASAAAVANSNNDAAKSSSDKMFSNMAVTLKRALGARP
KQPPPREIPQRPPRPPTPELVKKIPPPPPNGEDEVVVSYSVKDGVSGLPELSTVRQPE
ETNTAFSVPPLNQRESRDAKEPLTGTILEMWDGEIYHYGLYVEQGFVLGVHKPPAAIS
LAKVELTPLSLFWRPVYTPQYLISPDTLKKLHGETFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCALSSLFTRPIKDLIGK
LRPLNIINILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDFELQGPED
LVVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREV
AKRIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
AKRDFPGQPDMWKNAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEFELQGPTLTTFNFDRNKVLAFRQLAAENKYGLMDTMKVGRQLKDVKTM
PELKQALKNISIKKCQIVYSGCTYTLESDGKGNVRVDRVQSTSVQTNNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIIKRVNIQDLWSKPQVENTEEATSKDGC
PKPKDDEEFVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRRQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVAGMILEEGAPEG
TVVSLLIKRSTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGGEGEATLEGGDNKGTYCGAPILG
PGSAPKLSTKTKFWRSSTAPLPPGTYEPAYLGGKDPRIKGGPSLQQVMRDQLKPFTEP
RGKPPKPSVLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL
ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
STQQRAVLAAALEIMVKFSSEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTNLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPSESMIPHSQRPIQLMSLLGEAALHGPTFYSKISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 5..994
/gene="ORF1"
/product="p48"
mat_peptide 995..2092
/gene="ORF1"
/product="NTPase"
mat_peptide 2093..2629
/gene="ORF1"
/product="p22"
mat_peptide 2630..3028
/gene="ORF1"
/product="VPg"
mat_peptide 3029..3571
/gene="ORF1"
/product="Pro"
mat_peptide 3572..5101
/gene="ORF1"
/product="RdRp"
gene 5085..6707
/gene="ORF2"
CDS 5085..6707
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="UHH89710.1"
/translation="MKMASNDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPILTVEEMTNSRFPIPLEKLFTGPSSTFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHIAGSRNYTMNLASQNWNSYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFSPKLGRVQFATDTDNDFVANQNTKFTPVGVIQDG
DTAHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6707..7513
/gene="ORF3"
CDS 6707..7513
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="UHH89711.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQSQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN
APMTKVLDWSGTRYWAPDARTTTYNAGRFSTPQPSGAPPGRTNLRAAAPARGSSSTSS
NSSIATSVYSSQTTSTRLGSTAGSGTSVSSLPSSARTRSWVEDQNRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN
1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gttgctaaca gcaacaacga
61 tgccgcaaaa tcttcaagtg acaaaatgtt ttctaacatg gctgtcactc ttaaacgagc
121 cctcggggcg cggcctaaac agcccccccc gagggaaata ccacaaagac ccccacgacc
181 acccactcca gaactggtca aaaagatccc tcctcccccg cccaacggag aggatgaagt
241 ggtggtttct tatagtgtca aagatggtgt ttccggtttg cctgagcttt ccaccgtcag
301 gcaaccggaa gaaaccaata cggccttcag tgtccctcca ctcaatcaga gggagagtag
361 ggatgctaag gaaccactaa ctgggacaat tctggaaatg tgggatggag aaatctacca
421 ttatggtctg tatgttgagc aaggttttgt gctgggcgta cacaagccac cagctgccat
481 tagcctcgcc aaggtcgaat taacaccact ctccttgttc tggagacctg tgtacactcc
541 tcagtacctc atttctccag acaccctcaa gaaattacac ggagaaacat ttccctacac
601 agcctttgac aacaattgct atgccttttg ttgctgggtc ctggatttaa acgattcgtg
661 gctgagtagg agaatgatcc agagaacaac aggcttcttc agaccctacc aagattggaa
721 taggaaaccc ctccccacta tggatgactc caaattaaag aaggtggcta acatattcct
781 gtgcgccctg tcttcgttgt tcaccaggcc cataaaagac ttaataggaa aattaaggcc
841 tctcaacatc atcaacatcc tggcttcatg tgattggact ttcgcgggca tagtggagtc
901 cttgatactc ttggcggagc tctttggagt cttctggaca cccccagatg tgtctgcgat
961 gatcgccccc ttactcggtg atttcgagtt acaaggacct gaggaccttg tagtggagct
1021 cgtccctgtg gtaatggggg ggattggttt ggtgctagga ttcaccaaag aaaagattgg
1081 aaaaatgttg tcatctgctg catccacctt gagagcttgc aaagaccttg gtgcatatgg
1141 gctagagatc ttaaagttag tcatgaagtg gttcttcccg aagaaagagg aagcgaatga
1201 actggccatg gtgagatcca tcgaggatgc agtactggac cttgaggcaa ttgaaaacaa
1261 ccatatgacc accttgctca aagacaaaga tagcctggca acctacatga gaaccctcga
1321 cctcgaggaa gagaaagcca gaaaactctc aaccaagtct gcttcacctg acatcgtggg
1381 cacaattaac gcacttctgg cgagaatcgc cgctgcacgc tccctggtac acagagcaaa
1441 ggaggagctt tccagcagac caagacctgt agtcttgatg atatcaggca ggccaggaat
1501 agggaagacc caccttgcta gggaagtggc taagagaatc gcagcctccc tcacaggaga
1561 ccagcgtgta ggcctcatcc cacgcaatgg cgttgatcac tgggatgcgt acaaggggga
1621 aagggtcgtc ctatgggacg actatggaat gagcaacccc atccatgacg ctctcaggct
1681 gcaagaactc gctgacactt gccccctcac tctaaattgt gacaggattg agaataaagg
1741 aaaggtcttt gacagcgatg tcatcattat cactactaat ctggccaacc cagcaccact
1801 ggactatgtc aactttgagg catgctcgag acgcatcgac ttcctcgtgt acgcagaagc
1861 ccccgaggtc gaaaaggcaa agcgcgactt cccgggccag cctgacatgt ggaaaaacgc
1921 ttttagttct gatttctcac acataaaatt ggcactggct ccacaaggtg gctttgataa
1981 gaacgggaac accccacacg ggaagggcgt catgaagact ctcaccactg gctccctcat
2041 tgctcgggca tcagggctgc tccatgagag attggatgag ttcgagctac agggtccaac
2101 acttaccacc ttcaacttcg accgcaacaa agtgcttgcc ttcaggcagc ttgccgctga
2161 aaataaatat ggattgatgg acacaatgaa agttgggagg cagctcaagg atgtcaaaac
2221 catgccagaa cttaagcaag cactcaagaa tatctcaatc aaaaagtgcc agattgtgta
2281 cagtggttgc acctacacac ttgagtctga tggcaagggc aacgtgagag ttgacagagt
2341 acagagcacc tccgttcaga ccaacaatga gctggctggc gccctgcacc atctaaggtg
2401 cgccaggatc aggtattatg ttaagtgtgt tcaggaagcc ctgtattcta tcatccagat
2461 tgctggggct gcatttgtca ccacgcgcat catcaagcgt gtgaacattc aagacttatg
2521 gtccaagcca caagtggaaa acacagagga ggctaccagc aaggacgggt gcccaaaacc
2581 caaagatgat gaagagttcg tcatttcatc tgacgacatt aaaactgagg gtaagaaagg
2641 gaagaacaag actggccgtg gcaagaagca cacagccttc tcaagtaaag gtctcagtga
2701 tgaagagtat gatgaataca agagaattag agaggaaaga aatggcaagt actccataga
2761 ggagtacctc caggacaggg acaaatacta tgaagaggtg gccattgcca gggcgaccga
2821 ggaagacttc tgtgaagagg aggaggccaa gatccggcaa aggatcttta gaccaacaag
2881 gagacaacgc aaggaagaaa gagcttctct cggtttagtc acaggttctg aaattaggaa
2941 aagaaatcca gaagacttca agcccaaggg gaaactatgg gctgacgatg acagaagtgt
3001 ggactataat gaaaaactca gttttgaggc cccaccaagc atctggtcaa ggatagtcaa
3061 cttcggctca ggttggggct tctgggtctc ccccagcctg ttcataacat caacccacgt
3121 cataccccag ggcgcaaagg agttctttgg agtccccatc aaacaaattc aggtgcacaa
3181 gtcaggtgaa ttctgtcgct tgaggttccc aaaaccaatc aggaccgatg tggctggcat
3241 gatcttggaa gaaggtgcgc ccgaaggcac cgtggtctca ctactcatca aaaggtctac
3301 tggagaactc atgcccctag cagctagaat gggaacccac gcaaccatga agattcaagg
3361 acgcactgtt ggaggccaga tgggcatgct tctgacagga tccaacgcga aaagcatgga
3421 tctaggcacc acaccaggcg attgcggctg tccctacatc tacaaaagag gaaacgacta
3481 tgtagttatt ggggtccaca cggctgccgc tcgtggggga aacactgtca tatgtgccac
3541 ccaggggggt gagggggaag ctacacttga aggtggtgac aataagggaa catactgtgg
3601 tgcaccaatc ctaggtccag ggagtgcccc aaaactcagc accaaaacca aattctggag
3661 atcatccaca gcaccactcc cacctggcac ctatgaacca gcctaccttg gtggcaaaga
3721 ccccagaatc aagggtggcc cctcgctgca gcaagtcatg agggaccaac tgaaaccatt
3781 cacggagcct aggggtaagc caccaaagcc aagtgtgtta gaagctgcca agaaaaccat
3841 catcaatgtc cttgagcaga caattgatcc acctgagaag tggtcgttcg cacaagcttg
3901 cgcgtccctt gataagacca cttctagcgg ccatccgcac cacatgcgga aaaatgactg
3961 ctggaacggg gagtccttca caggtaagct ggcagaccag gcttccaaag ccaatctgat
4021 gtttgaagag gggaagaaca tgaccccagt ctacacaggt gcacttaagg atgaattagt
4081 caaaactgac aaaatttatg gtaagatcaa gaagaggctt ctctggggct cggacctggc
4141 aaccatgatc cggtgtgctc gagcgttcgg aggtctgatg gatgagctca aagcacactg
4201 tgtcacactt cctgtcagag ttggcatgaa tatgaatgag gacggtccca tcatcttcga
4261 gaagcattcc aggtacagat accattatga cgctgattac tctcggtggg actcaacaca
4321 acagagagcc gtgctggcag ctgctctaga aatcatggtt aaattctcct cagaaccaca
4381 tttggctcag gtagtagcag aagaccttct ttctcctagc gtagtggatg tgggtgactt
4441 cacaatatca atcaacgagg gtcttccctc tggggtgccc tgcacctccc aatggaactc
4501 catcgcccac tggcttctca ctctttgtgc gctctccgaa gtcacaaatt tgtctccaga
4561 catcatacag gctaattctc tcttctcctt ctatggtgat gatgaaattg ttagcacaga
4621 cataaaatta gacccagaaa aattaacagc aaaactcaag gaatatgggt tgaaaccaac
4681 ccgccctgac aaaactgaag ggcctcttgt tatttctgaa gacttagacg gtttgacttt
4741 cctgcggaga actgtgaccc gcgacccagc tggttggttt ggaaaactgg agcaaagctc
4801 aatactcagg caaatgtact ggaccagggg ccccaatcat gaagacccat ctgaatcaat
4861 gatcccgcac tctcaaagac ccatacaatt gatgtcctta ctgggagagg ccgcactcca
4921 cggcccaaca ttctacagta aaatcagcaa attagttatt gctgagctta aagaaggtgg
4981 catggatttt tacgtgccca ggcaagagcc aatgttcaga tggatgagat tctcggatct
5041 gagcacgtgg gagggcgatc gcaatctggc tcccagcttt gtgaatgaag atggcgtcga
5101 atgacgccaa cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg
5161 ttatggcttt ggagcccgtt gttggtgccg ctattgcggc gcctgtagcg ggccaacaaa
5221 atgtaattga cccctggatt agaaataatt ttgtacaagc ccctggtgga gagttcacag
5281 tatcccctag aaacgctcca ggtgaaatac tatggagcgc gcccttaggc cctgatctaa
5341 atccctacct atctcatttg gccagaatgt ataatggtta tgcaggtggt tttgaagtgc
5401 aggtgattct cgcggggaac gcgttcaccg ccggaaaaat tatatttgca gcagtcccac
5461 caaattttcc aactgaaggc ttaagcccca gccaggttac tatgttcccc cacataatag
5521 tagatgttag gcaactggaa cctgtgttga tccccttacc tgatgttaga aataacttct
5581 atcattataa tcagtcaaat gattctacca ttaaattgat agcaatgctg tacacaccac
5641 ttagggctaa taatgctggg gatgatgtct ttacagtttc ttgtcgagtt ctcacgagac
5701 catcccccga ttttgatttc atatttttag taccacctac agttgaatca agaactaaac
5761 cattctctgt cccaatttta accgttgaag aaatgaccaa ttcaaggttc cccattcctt
5821 tggaaaagtt gttcacgggc cccagtagta cctttgttgt tcaaccacaa aatggcaggt
5881 gcacgactga tggcgtgctc ctaggtacca cccaactgtc tcctgtcaac atctgcacct
5941 tcagagggga tgtcacccac attgcaggca gtcgtaacta cacaatgaat ctggcttccc
6001 aaaattggaa cagttatgac ccaacagaag aaatcccagc ccctctagga actccagatt
6061 tcgtggggaa gattcaaggt gtgctcaccc aaaccacaag gacagatggc tcgacccgcg
6121 gtcacaaagc tacagtgtac actgggagcg ccgacttttc tccaaaactg ggtagagtcc
6181 aatttgccac tgacacagac aatgatttcg ttgctaacca aaacacaaag ttcaccccag
6241 tcggtgttat ccaggatggt gatactgccc accgaaatga accccaacaa tgggtgctcc
6301 caagctactc aggcagaaac acccataatg tgcacctggc ccccgctgta gcccccactt
6361 ttccgggtga gcaactcctc ttcttcaggt ctaccatgcc cggatgcagc gggtacccca
6421 acatggattt ggactgtctg ctcccccagg aatgggtgca gtatttctac caggaagcag
6481 ccccagcaca atctgatgtg gctctgctaa gatttgtgaa tccagacaca ggtagggttc
6541 tgtttgagtg taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg
6601 atttggttat cccccccaat ggttatttta gatttgattc ctgggtcaac cagttctaca
6661 cacttgcccc catgggaaat gggacggggc gtaggcgtgc attataatgg ctggagcttt
6721 ctttgctgga ttggcatctg acgtccttgg ctctggcctt ggttccctaa tcaatgctgg
6781 ggctggggcc atcaaccaaa aagttgaatt tgaaaataac agaaaattgc aacaggcttc
6841 cttccaattt agtagcaacc tacaacaagc ttcctttcaa catgacaaag agatgctcca
6901 atcacaaatt gaggccacca aaaagttgca acaggaaatg atgagagtta aacaagcaat
6961 gctcctagag ggtggatttt ctgagacaga tgcagctcgt ggggcaatca acgcccccat
7021 gacaaaagtt ttggactgga gcgggacaag gtactgggct cccgatgcta ggactacaac
7081 atataatgca ggccgctttt ccacccctca accctcgggg gcaccaccag gaagaactaa
7141 tcttagggct gctgcccccg cccggggttc ctccagcaca tcttctaact cttctattgc
7201 tacttctgtg tattcaagtc aaaccacttc aacgagactt ggttctacag ctggttctgg
7261 taccagtgtc tcgagcctcc cgtcatctgc aaggactagg agctgggtcg aggatcaaaa
7321 taggaatttg tcacctttca tgaggggggc ccacaacatc tcgtttgtca ccccaccatc
7381 tagcagatcc tctagccaag gcacagtctc aaccgtgccc aaagaagttt tggactcctg
7441 gactggcgct tttaatacgc gcaggcagcc tctcttcgct cacatccgta agcgagggga
7501 gtcacgggtg taatgtgaaa agacaaattg attatctttc ttttctttag tgtctttt
//