Typing tool
|
Complete norovirus genomes
OR051927 | GII.4 Den Haag | ||
---|---|---|---|
GII.P4 Den Haag |
ORF1: 1..5100 ORF2: 5081..6703 ORF3: 6703..7509LOCUS OR051927 7509 bp RNA linear VRL 06-OCT-2023 DEFINITION Norovirus GII isolate GII/Hu/US/2014/GII.4DenHaag[P4]/NIH37.1 nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION OR051927 VERSION OR051927.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7509) AUTHORS Chaimongkol,N., Dabilla,N., Tohma,K., Matsushima,Y., Behrle Yardley,A., Levenson,E.A., Johnson,J.A., Ahorrio,C., Oler,A.J., Kim,D.Y., Souza,M., Sosnovtsev,S.V., Parra,G.I. and Green,K.Y. TITLE Norovirus Evolves as One or More Distinct Clonal Populations in Immunocompromised Hosts JOURNAL Unpublished REFERENCE 2 (bases 1 to 7509) AUTHORS Chaimongkol,N., Dabilla,N., Sosnovtsev,S.V. and Green,K.Y. TITLE Direct Submission JOURNAL Submitted (26-MAY-2023) LID, NIAID, 50 South Drive/R6318, Bethesda, MD 20892, USA COMMENT ##Assembly-Data-START## Assembly Method :: HIVE Hexagon/Heptagon v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7509 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="GII/Hu/US/2014/GII.4DenHaag[P4]/NIH37.1" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="Nov-2014" /note="genotype: GII.4" gene 1..5100 /gene="ORF1" CDS 1..5100 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="WHW97613.1" /translation="MKMASNDASAAAVANSNDDTAKSSSDKMFSNMAVTFKRALGARP KQPPPREIPQRPPRPPTPELIKKIPPPPPNGEGEVVVSYSAKDGVSGLPELSTVRQPE ETNTAFSVPPLNQRESRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAIS LAKVELTPLSLYWRPVYTPQYLISPDTLKRLSGETFPYTAFDNNCYAFCCWVLDLNDS WLSRRMIHRTTGFFRPYQDWNRKPLPTMDDSKLKKAADIFLCALSSLFTRPIKDIIGK LKPLNIINILASCDWTFAGIVESLILLAELFGGFWTPPDVSAMIAPLLGDFELQGPED LVVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP KKEEVDELAMVRSIEDAILDLEAIENNHMTTLLKDKDSLATYMKTLDIEEEKARKLST KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREI AKRIATSLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC PLTLNCDRIENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK AKRDFPGQPDMWKNAFSSDFSHIKLTLAPQGGFDKNGNTPHGKGVMKTLTMGSLTARA SGLLHERLDEFELQGPTLTTFNFDRNKVLAFRQLAAENKYGLMDTMKVGKRLKDVKTM PELKQALKNTSIKKCQIVYSGCTYTLESDGNGNVKIDKVQSTSVQTNNELTGALHHLR CARVRYYVKCVQEALYSILQIAGAAFVTARIIKRVNIQDLWSKPQVENTEEATSKDGC PRPKDDEEFVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV TGSEIRKRNPEDFKPRGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSP SLFITSSHVIPQGAKEFFGVPIKQIQVHKSGEFCHLRFPKPIRTDVTGMILEEGAPEG TVVTILIKRSTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD CGCPYVYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG PGSAPKLSTKTKFWRSSTAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP RGKPPKPSVLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCW NGESFTGKLADQASKANLMFEQGKNMTPIYTGALKDELVKTDKIYGTIKKRLLWGSDL ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD STQQRAVLAAALEIMVKFSSEPHLAQVVAEDLLSPSVMDVGDFTISINEGLPSGVPCT SQWNSIAHWLLTLCAISETTNLSPDIVQANSQFSFYGDDEIVSTDIKLDPEKLTAKLK EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP NHEDPSESMIPHSQRPVQLMSLLGEAALHGPTFYSKISKLVITELKEGGMDFYVPRQE PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide 1..990 /gene="ORF1" /product="p48" mat_peptide 991..2088 /gene="ORF1" /product="NTPase" mat_peptide 2089..2625 /gene="ORF1" /product="p22" mat_peptide 2626..3024 /gene="ORF1" /product="VPg" mat_peptide 3025..3567 /gene="ORF1" /product="Pro" mat_peptide 3568..5097 /gene="ORF1" /product="RdRp" gene 5081..6703 /gene="ORF2" CDS 5081..6703 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="WHW97614.1" /translation="MKMASNDANPSDGSAANLVPEVNNEVMALEPVAGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD VRNNFYHYNQSKDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFTVPILTVEEMTNSRFPISLDGLFTGPSNALVVQPQNGRCTTDGVLLGTT QLSPVNICTFKGDVTHIAGSHNFTMNLASLNWNNYDPTEEIPAPLGTPDFVGKIHGML TQTTRGDGSTRGHKATVYTGSADFTPKLGSVQFNTDTENDFDPHQNTKFTPVGVIQDG NTTHRNEPQQWELPSYSGRGAQNVHLAPAVAPTFPGEQLLFFRSTLPGCSGYPNMDLD CLLPQEWVQHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYITVAHTGQHNFV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6703..7509 /gene="ORF3" CDS 6703..7509 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="WHW97615.1" /translation="MAGALFAGMASDVLSSGLGSLINAGAGAINQKIDLENNKELQQA SFQYSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAVLLEGGFSETDAARGAIN APMTKALDWNGTRYWAPNANTTTYNTGHFSTPQSSGALSGRFNPRIPTPARGSSNTSS NASATTSVYSNQTVSTRLGSTAGSGTNVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI SFVTPPSSKSSSQGTVSTVPKEILDSWTGAFNTRKQPLFAQLRRRGESRV" ORIGIN 1 atgaagatgg cgtctaacga cgcttccgct gccgctgttg ctaacagcaa cgacgacacc 61 gcaaaatctt caagtgacaa aatgttttct aacatggctg tcacttttaa acgagccctc 121 ggggcacggc ctaaacagcc ccccccgagg gaaataccac aaagaccccc acgaccacct 181 actccagagc tgatcaaaaa gatccctcct cccccaccca acggggaggg tgaagtagtg 241 gtttcttaca gtgccaaaga tggtgtttcc ggtttgcctg agctttccac cgtcaggcaa 301 ccggaagaaa ccaacacggc cttcagtgtc cctccgctca accagaggga aagcagggat 361 gctaaggagc cactgactgg aacaattctg gaaatgtggg atggggaaat ctaccattat 421 ggcctgtatg ttgagcgagg tcttgtactg ggtgtgcaca aaccaccagc tgccattagc 481 ctcgccaagg tcgaactaac accactctcc ttgtactgga gacctgtgta caccccccag 541 tacctcatct ctccagacac tctcaagagg ttaagcggag aaacatttcc ctatacagcc 601 tttgacaaca actgctatgc tttttgttgc tgggtcctgg atctaaacga ctcatggctg 661 agtaggagaa tgatccatag aacaactggc ttcttcagac cctaccaaga ttggaataga 721 aaacccctcc ccactatgga tgattccaaa ttgaagaagg cagctgacat attcctgtgt 781 gccctgtctt cgctgttcac tagacccata aaagacataa taggaaagtt aaagcctctc 841 aatatcatca acatcctggc ttcatgtgat tggactttcg caggcatagt ggaatcctta 901 atactcttgg cagagctctt tggaggtttc tggacacccc cagatgtgtc tgcgatgatc 961 gcccccttac tcggtgattt cgagttacaa ggacctgagg accttgtagt ggaactcgtc 1021 cccgtagtga tgggggggat tggtttggtg ctgggattca ccaaagagaa gattgggaaa 1081 atgttgtcat ctgcagcatc caccttgaga gcttgtaaag accttggtgc atatgggtta 1141 gagatcctaa agttagtcat gaaatggttc ttcccgaaga aagaggaagt ggatgaactg 1201 gctatggtga gatctattga ggatgcaata ttggaccttg aggcgattga gaacaaccat 1261 atgaccacct tgctcaaaga caaagacagc cttgcaacat acatgaaaac ccttgacatc 1321 gaggaagaaa aagccaggaa actctcaacc aagtctgctt cacctgacat cgtgggcaca 1381 atcaacgccc tcctggcgag aatcgccgct gcacgttccc tggtgcaccg agcgaaggag 1441 gagctttcca gcagaccaag acccgtggtc ttgatgatat caggcagacc aggaataggg 1501 aagacccacc ttgctaggga aatagccaag agaatcgcaa cctccctcac aggagaccag 1561 cgcgtgggcc tcatcccacg caacggcgtc gatcactggg atgcgtacaa gggggagagg 1621 gtcgtcctgt gggacgacta tggcatgagc aaccccatcc acgacgccct caggctgcaa 1681 gaactcgctg acacttgccc cctcactcta aattgtgaca ggattgagaa taaaggaaag 1741 gtttttgaca gcgatgtcat catcataact actaatctgg ccaacccagc accactggac 1801 tatgtcaact ttgaagcatg ctcaaggcgc attgacttcc tcgtgtatgc agaggccccc 1861 gaggtcgaaa aggcaaagcg tgacttcccg ggccaacctg acatgtggaa aaacgctttt 1921 agttcagatt tctcccacat aaaactgaca ctagctccac aaggtggctt tgacaagaac 1981 gggaacaccc cacacgggaa gggcgtcatg aagactctca ccatgggctc cctcactgcc 2041 cgggcatcag ggctgctcca tgagagactg gatgagtttg aactacaagg cccaactctc 2101 accaccttca actttgaccg caacaaagtg cttgccttca ggcagcttgc tgctgaaaac 2161 aaatatgggt tgatggacac aatgaaagtc gggaagcggc tcaaggatgt caaaaccatg 2221 ccagaactta aacaagcact caagaacacc tcaatcaaga aatgccaaat tgtgtacagt 2281 ggttgcacct acacactcga gtctgatggc aatggtaatg tgaaaattga caaagttcag 2341 agcacctccg tccagaccaa caatgagctg actggcgccc tgcaccatct aaggtgcgcc 2401 agagtcaggt actatgtcaa gtgtgttcag gaagccctct attctatcct ccagattgct 2461 ggggctgcat ttgttaccgc gcgcatcatc aagcgtgtga acattcaaga cttatggtcc 2521 aagccacaag tggaaaacac agaggaggct accagcaagg acgggtgccc aaggcccaaa 2581 gacgatgagg agttcgtcat ttcatctgac gacattaaaa ctgagggtaa gaaagggaag 2641 aataaaactg gtcgtggcaa gaagcacaca gccttctcga gtaaaggtct cagtgatgaa 2701 gagtatgatg agtacaagag aattagagag gaaagaaatg gcaagtattc catagaagag 2761 taccttcagg acagggacaa atactatgag gaggtggcca tcgccagggc gaccgaggaa 2821 gacttctgtg aagaggagga ggccaagatc cggcaaagga tcttcaggcc gaccaggaaa 2881 caacgcaagg aagaaagggc ttctctcggt ttagtcacag gttctgaaat taggaaaaga 2941 aacccagaag atttcaagcc cagggggaaa ctatgggctg acgatgacag gagtgtggac 3001 tataatgaaa aactcagttt tgaggcccca ccaagcatct ggtcaagaat agtcaacttt 3061 ggctcaggtt ggggcttctg ggtctcccct agcctgttca taacgtcatc ccacgtcata 3121 ccccagggtg caaaggagtt ctttggagtc ccaataaaac aaatccaggt gcacaagtcg 3181 ggcgaattct gtcacttgag attcccaaaa ccaatcagga ctgatgtgac tggtatgatc 3241 ttggaagaag gtgcgcccga aggcaccgtg gtcacaatac tcatcaaaag atccactgga 3301 gaactcatgc ccctagcagc tagaatgggg acccatgcaa ccatgaaaat ccaaggacgc 3361 accgttggag gccaaatggg catgcttcta acaggatcca acgccaaaag catggatcta 3421 ggcaccacac caggtgattg cggctgtccc tatgtctaca agagaggaaa tgactacgtg 3481 gtcattggag tccacacggc tgctgctcgt gggggaaaca ctgtcatatg tgcaacccaa 3541 gggagtgagg gggaagccac acttgaaggt ggtgacagta aggggacata ctgtggtgca 3601 ccaatcctag gcccagggag tgccccaaaa cttagcacca aaaccaagtt ctggagatca 3661 tccacagcac cgctcccacc tggcacctat gaaccagcct accttggtgg caaagacccc 3721 agagtcaagg gtggcccctc gctgcagcaa gttatgaggg accagttaaa accatttaca 3781 gagcctaggg gtaaaccacc aaagccaagt gtgttggaag ctgccaagaa aaccatcatc 3841 aatgtccttg agcagacaat tgacccacct gagaagtggt cgttcgcaca agcttgcgcg 3901 tcccttgaca agaccacttc tagcggccat ccgcaccaca tgcggaaaaa cgactgctgg 3961 aacggggaat ccttcacagg caagctggca gaccaggctt ccaaggccaa cctgatgttt 4021 gaacagggaa agaacatgac cccaatctac actggtgcac ttaaggatga gttagtcaaa 4081 actgacaaaa tttatggcac tatcaagaag aggcttctct ggggctcgga cttagcaacc 4141 atgatccggt gcgctcgagc atttgggggc ttgatggatg aactcaaagc acactgtgtc 4201 acactccctg tcagagttgg tatgaacatg aatgaagatg gccccatcat cttcgagaag 4261 cactccaggt acaaatacca ttatgatgct gattactctc ggtgggattc aacacaacag 4321 agggccgtgc tggcagctgc tctagaaatc atggttaaat tctcctcaga accacatttg 4381 gctcaggtgg tggcagaaga tcttctctcc cctagcgtga tggatgtagg tgacttcaca 4441 atatcaatca acgagggtct tccctctggg gtgccctgca cctcccaatg gaactccatc 4501 gcccactggc tcctcaccct ctgtgcaatc tccgagacca caaatttgtc cccagacatc 4561 gtgcaggcta actctcaatt ctccttctat ggtgatgatg aaattgtcag tacagatata 4621 aaattggacc cagaaaagtt aacagcaaag ctcaaggaat atgggctgaa accaacccgc 4681 cctgacaaaa ctgaaggacc tctcgtcatt tctgaagact tagacggttt gactttcctg 4741 cggagaactg tgacccgcga cccagctggt tggtttggaa aactggaaca gagttcaata 4801 ctcaggcaaa tgtactggac taggggtccc aaccacgaag acccatctga atcaatgatc 4861 ccacactctc aaaggcccgt acaattgatg tccttgctgg gagaggccgc actccacggc 4921 ccaacattct acagtaaaat cagcaaattg gtcattacag agctcaaaga aggtggcatg 4981 gatttttacg tgcccagaca agaaccaatg ttcaggtgga tgagattctc agatctgagc 5041 acgtgggagg gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgaatga 5101 cgccaaccca tctgatgggt ccgcagccaa cctcgtccca gaggtcaaca atgaggttat 5161 ggctttggag cccgtcgccg gtgccgctat tgcggcacct gtagcgggcc aacaaaatgt 5221 aattgacccc tggattagaa ataattttgt acaagcccct ggtggagaat tcacagtatc 5281 ccctagaaac gctccgggtg aaatactatg gagcgcgccc ttaggtcctg atctgaatcc 5341 ctacctatct catttggcca gaatgtataa tggttatgca ggtggttttg aagtgcaggt 5401 gatcctcgcg gggaacgcgt tcaccgcggg aaaaattata tttgcagcag tcccaccaaa 5461 ttttccaact gaaggcttga gtcccagtca ggttactatg ttcccccaca taatagtaga 5521 tgttaggcaa ttagaacctg tgttaattcc cttacctgat gttaggaaca atttctacca 5581 ttataatcag tcaaaagatt ccaccatcaa attgatagca atgctgtata caccacttag 5641 ggccaataat gctggggatg atgtcttcac agtctcctgt cgagttctca cgagaccatc 5701 ccctgatttt gactttatat tcttggtacc acccacagtt gagtcaagaa ctaaaccttt 5761 cactgtccca atcctgactg ttgaagaaat gaccaattca agattcccca tttctctgga 5821 cgggttgttt acgggtccca gcaatgccct tgttgtccaa ccacaaaatg gcagatgcac 5881 gactgatggc gtgcttttgg gcaccaccca actgtctcct gttaacatct gtactttcaa 5941 aggggacgtc acccacattg caggttctca caatttcaca atgaatttgg cttctctaaa 6001 ttggaacaat tatgacccaa cagaagaaat tccagcccct ttgggaactc cagatttcgt 6061 gggcaagatc catggtatgc tcactcaaac tacaagagga gatggctcga cccggggcca 6121 caaggccaca gtatacactg gaagtgctga cttcactcca aagctgggca gtgttcaatt 6181 taacactgat acagaaaatg attttgaccc tcaccaaaac acaaaattca ccccagtcgg 6241 tgtcatccag gatggtaaca ccacccaccg aaatgaaccc caacaatggg agctcccaag 6301 ttattcaggt agaggtgccc aaaatgtaca cctagcccct gctgtggccc ccactttccc 6361 gggtgaacaa cttcttttct tcagatccac tctccccgga tgcagcgggt atcccaacat 6421 ggatttagat tgcctactcc cccaagagtg ggtgcagcac ttctaccagg aagcagctcc 6481 agcacaatct gatgtggctc tattaagatt tgtgaatcca gacacgggta gggtcctgtt 6541 tgaatgcaaa cttcataaat caggctatat cacagtggct cacaccggcc agcataattt 6601 tgtcatcccc cccaatggct acttcaggtt tgactcctgg gttaatcaat tctacacact 6661 tgcccccatg ggaaatggaa cggggcgtag gcgtgctcta taatggcagg agctctcttt 6721 gctggaatgg catctgatgt ccttagctct ggacttggtt ccctaatcaa tgctggggct 6781 ggggctatca accaaaagat tgatcttgaa aataacaaag aattgcagca agcttccttc 6841 cagtatagca gtaacctgca gcaggcctcc tttcaacatg ataaagagat gctccaagca 6901 caaattgaag ccactaaaag gttgcaacag gaaatgatga aagtcaaaca ggcagtgctc 6961 ttagagggtg gattctctga aacagatgca gcccgtgggg caatcaacgc ccccatgaca 7021 aaggctttgg attggaacgg aacgaggtac tgggccccta atgctaatac cacaacatac 7081 aatacaggcc acttttccac tccgcaatct tcgggggcgc tgtcaggaag atttaatccc 7141 aggattccca cccccgctcg gggctcctct aatacatctt ctaatgcttc tgctaccact 7201 tctgtgtatt caaatcaaac tgtttcaacg agacttggtt ctacagctgg ttctggcacc 7261 aatgtctcga gtctcccgtc aactgcaagg actaggagtt gggttgagga tcaaaacaga 7321 aacttgtcac ctttcatgag gggggcccac aacatatcgt ttgtcacccc accatctagc 7381 aaatcctcta gccaaggcac agtctcaacc gtgcctaaag aaattttgga ctcctggact 7441 ggcgctttca acacgcgcaa gcagcctctc ttcgcccaac ttcgtaggcg aggggagtca 7501 cgggtgtaa //