Typing tool
|
Complete norovirus genomes
OR051787 | GII.4 New Orleans | ||
---|---|---|---|
GII.P4 New Orleans |
ORF1: 1..5100 ORF2: 5081..6700 ORF3: 6700..7506LOCUS OR051787 7551 bp RNA linear VRL 06-OCT-2023 DEFINITION Norovirus GII isolate GII/Hu/US/2011/GII.4NewOrleans[P4]/NIH11.1 nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION OR051787 VERSION OR051787.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7551) AUTHORS Chaimongkol,N., Dabilla,N., Tohma,K., Matsushima,Y., Behrle Yardley,A., Levenson,E.A., Johnson,J.A., Ahorrio,C., Oler,A.J., Kim,D.Y., Souza,M., Sosnovtsev,S.V., Parra,G.I. and Green,K.Y. TITLE Norovirus Evolves as One or More Distinct Clonal Populations in Immunocompromised Hosts JOURNAL Unpublished REFERENCE 2 (bases 1 to 7551) AUTHORS Chaimongkol,N., Dabilla,N., Sosnovtsev,S.V. and Green,K.Y. TITLE Direct Submission JOURNAL Submitted (26-MAY-2023) LID, NIAID, 50 South Drive/R6318, Bethesda, MD 20892, USA COMMENT ##Assembly-Data-START## Assembly Method :: HIVE Hexagon/Heptagon v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7551 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="GII/Hu/US/2011/GII.4NewOrleans[P4]/NIH11.1" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="Nov-2011" /note="genotype: GII.4" gene 1..5100 /gene="ORF1" CDS 1..5100 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="WHW76521.1" /translation="MKMASNDASAAAVANSNNDTAKSSSDGVLSSMAVTFKRALGARP KQPPPREKSQRPPRPPTPELVKSIPPPPPNGEDEIVVSYSVKDGVSGLPDLSTVRQPE ESNTAFSVPPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAIS LARIELAPLSLYWRPVYTPQYLISPDTLKKLSGETFPYTAFDNNCYAFCCWVLDLNDS WLSRRMIQRTTGFFRPYQDWNRKPLPTTDDSKIKKVANIFLCALSSLFTRPIKDIIGK IRPLNILNILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED LAVELVPVVMGGIGLVLGFTKEKVGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP KKEEANELAIVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREV AKRIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC PLTLNCDRIENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK AKRDFPGQPDMWKNAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA SGLLHERLDEFELQGPALTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGKQLKDVKTM PELKQALKNVSIKKCQIVYSGCTYTLESDGKGNVKVDRIQSAAVQTNNELTGALHHLR SARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQDLWSKPQVENTEETTSKDGC PKPKDDEEFVISSDDIKAEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV TGSEIRKRNPDDFKPKGKLWADDDRNVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSP SLFITSTHVIPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG TVATLLIKRSTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILG PGSAPTLSTKTKFWRSSTASLPPGTYEPAYLGGKDPRVKGGPSLQQVMREQLKPFTEP RGKPPKPSVLEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCW NGESFTGKLADQASKANLMFEEGKNMTPVYTAALKDELVKTDKIYGKIKKRLLWGSDL ATMIRCARAFGGLMDELKAHCVTLPIRVGMNMNEDGPIIFERHSRYTYHYDADYSRWD STQQRAVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCT SQWNSIAHWLLTLCALSEVTNLSPDTIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLR EYGLKPTRPDKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP NHGDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide 1..990 /gene="ORF1" /product="p48" mat_peptide 991..2088 /gene="ORF1" /product="NTPase" mat_peptide 2089..2625 /gene="ORF1" /product="p22" mat_peptide 2626..3024 /gene="ORF1" /product="VPg" mat_peptide 3025..3567 /gene="ORF1" /product="Pro" mat_peptide 3568..5097 /gene="ORF1" /product="RdRp" gene 5081..6700 /gene="ORF2" CDS 5081..6700 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="WHW76522.1" /translation="MKMASSDANPSDGSTANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLDPVLIPLPD VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHISGSHNYTMNLASQNWNSYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFSPKLGRVQFATDTDNDFETNQNTKFTPVGVIQDG GTHQNEPQQWELPSYSGRNIHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMNLDC LLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHRSGYVTVAHTGQHDLVI PPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6700..7506 /gene="ORF3" CDS 6700..7506 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="WHW76523.1" /translation="MAGAIFAGLASDVLSSGLSSLINAGAGAINQTIEFENNRKLQQA SFQFSSNLQQASFQHDKEMLHAQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN APMTKTLDWSGTRYWAPDARTTTYNAGRFSTPQPSGAPPGRANLRATVPARGSSSTSS NSSIATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRV" ORIGIN 1 atgaagatgg cgtctaacga cgcttccgct gccgctgttg ctaacagcaa caacgacacc 61 gcaaaatctt caagtgacgg agtgctctct agcatggctg tcacttttaa acgagccctc 121 ggggcgcggc ctaaacagcc tcccccgagg gaaaaatcac aaagaccccc acggccacct 181 actccagaac tggttaaaag tattcctcct cccccaccca acggagagga tgaaatagtg 241 gtttcttata gtgtcaaaga tggtgtttcc ggcttgcctg acctttccac cgtcaggcaa 301 ccggaagaat ctaacacggc cttcagtgtc cctccactca atcagaggga gaacagagat 361 gctaaggaac cactcactgg aacaattctg gaaatgtggg acggggaaat ctaccattat 421 ggcctgtatg tggagcgagg tcttgtactg ggtgtgcaca aaccaccagc tgccatcagc 481 ctcgctagga ttgagctagc accactctcc ttatactgga gacctgtgta cactcctcag 541 tacctcatct ccccagacac tctcaagaaa ttgtccggag aaacgttccc ctacacagcc 601 tttgacaaca actgttatgc cttttgttgc tgggtcctgg acctaaatga ctcgtggctg 661 agcaggagaa tgatccagag aacaactggt ttcttcaggc cctaccaaga ctggaatagg 721 aaaccccttc ccactacgga tgactccaaa ataaagaagg tagctaacat attcctgtgt 781 gctctgtcct cgctgttcac cagacccata aaagatataa tagggaagat aagacctctt 841 aacatcctca acatcttagc ctcatgtgat tggacttttg caggtatagt ggagtccctg 901 atactcttgg cagaactctt tggagttttc tggacacccc cagatgtgtc tgcgatgatt 961 gcccccttac tcggtgacta cgagctacaa ggacctgagg accttgcagt tgagctcgtc 1021 cccgtggtga tggggggaat tggtttggtg ctaggattca ccaaagagaa ggttgggaaa 1081 atgttgtcat ctgctgcgtc caccttgaga gcttgtaaag accttggtgc atatgggctg 1141 gagatcctaa agttggtcat gaagtggttc ttcccgaaga aggaagaggc aaatgagctg 1201 gctatagtga gatctatcga ggatgcagtc ttggatctcg aggcaattga aaacaaccat 1261 atgaccacct tgctcaaaga caaagacagt ctggcaacct atatgagaac acttgacctt 1321 gaggaggaga aagccaggaa actctcaacc aagtctgcct cacccgatat cgtgggcaca 1381 atcaacgccc tcctggcgag aatcgctgcc gcacgctctc tggtacaccg agcgaaggag 1441 gagctttcca gcagaccaag acctgtggtg ttgatgatat caggcaggcc aggaataggg 1501 aagacccacc tcgctaggga agtggctaag agaatcgcag cctcccttac aggagaccag 1561 cgtgtgggcc tcatcccacg caatggcgtc gaccattggg atgcgtacaa gggggagagg 1621 gtcgtcctat gggacgatta tggaatgagc aaccctattc acgatgccct caggctgcaa 1681 gaactcgctg acacttgtcc cctcactcta aactgtgaca ggatcgaaaa caaaggaaag 1741 gtctttgaca gcgatgtcat cattatcacc actaatctgg ccaacccagc accactggac 1801 tatgtcaact ttgaagcatg ctcgaggcgc atcgacttcc tcgtgtatgc agaagcccct 1861 gaagtcgaaa aggcgaagcg tgacttccca ggccagcctg acatgtggaa gaacgctttc 1921 agttctgact tctcacacat aaaattagca ctggccccac agggtggttt cgacaagaac 1981 gggaacaccc cacacggaaa gggcgttatg aagactctca ccactggctc ccttattgcc 2041 cgggcatcag ggctactcca tgagaggtta gatgaatttg aactgcaggg cccagccctc 2101 accaccttca atttcgatcg caataaggtg cttgccttta gacagcttgc tgctgaaaac 2161 aaatatggat tgatggacac aatgagagtt gggaaacagc tcaaggatgt caaaaccatg 2221 ccagagctca aacaagcact caagaatgtc tcaatcaaga agtgccaaat agtgtatagt 2281 ggttgcacct acacacttga gtctgatggc aagggcaatg tgaaagttga cagaatccaa 2341 agcgccgccg tgcagaccaa caatgagctg actggtgccc tacaccattt gaggagcgcc 2401 agaatcagat actatgtcaa gtgtgtccag gaggccctgt attccatcat ccaaatcgct 2461 ggggctgcat ttgtcaccac gcgcattgcc aagcgcatga acatacaaga cctatggtcc 2521 aagccacaag tggaaaacac agaggagact accagcaagg atgggtgccc aaaacctaag 2581 gacgatgagg agtttgtcat ttcatccgac gacatcaaag ctgagggtaa gaaagggaag 2641 aacaagactg gccgcggcaa gaagcacaca gcattttcaa gcaaaggcct tagtgatgaa 2701 gagtacgatg agtacaagag gattagagaa gaaaggaatg gcaagtactc catagaagag 2761 taccttcagg acagggacaa atactatgag gaggtggcca ttgccagggc gactgaggaa 2821 gacttctgtg aagaggagga ggccaagatc cggcaaagga tctttaggcc aacaaggaaa 2881 caacgcaagg aggaaagagc ctccctcggt ctggtcacag gctctgaaat taggaaaaga 2941 aacccagatg acttcaaacc caaagggaaa ttgtgggctg acgatgacag gaatgtagac 3001 tacaatgaga aactcagttt tgaggcccca ccaagcatct ggtcgagaat agtcaacttt 3061 ggttcaggct ggggattttg ggtctccccc agtctgttca taacatcaac ccatgtcata 3121 ccccagggcg cgaaggagtt ctttggagtc cccatcaaac aaatacaggt acacaagtcg 3181 ggcgagttct gtcgccttag attcccaaaa ccaatcagga ctgatgtgac gggcatgatc 3241 ttagaagaag gcgcacctga gggcaccgtg gccacgctac tcatcaaaag gtccactggg 3301 gaactcatgc ccctagcagc taggatgggg acccatgcga ccatgaagat ccaagggcgc 3361 actgttggag gccagatggg tatgcttctg acaggttcca acgccaagag catggacctg 3421 ggtactacac caggtgattg tggctgcccc tatatctaca agagaggtaa tgactatgtg 3481 gtcattgggg tccacacggc tgccgcacgt ggggggaaca ctgtcatatg tgccacccag 3541 gggagtgagg gagaggctac acttgaaggt ggtgacaaca agggaacata ctgtggtgca 3601 ccaatcctag gcccagggag tgccccaaca cttagcacca aaaccaaatt ctggagatcg 3661 tccacagcat cactcccacc tggcacctat gaaccagcct atcttggtgg caaggaccct 3721 agggtcaagg gtggtccttc actgcagcaa gtcatgaggg aacagttgaa gccattcaca 3781 gagcccaggg gcaagccacc aaaaccaagt gtattagaag ctgccaagaa gaccatcatc 3841 aatgtccttg agcaaacaat tgatccacct gagaaatggt cgttcgcaca agcttgcgcg 3901 tcccttgaca agaccacttc cagtggtcat ccgcaccaca tgcggaaaaa cgactgctgg 3961 aacggggagt ccttcacagg caagctggca gaccaagctt ccaaggccaa cctgatgttt 4021 gaagaaggga agaacatgac cccagtctac acagctgcgc tcaaggatga gttagttaaa 4081 actgacaaaa tttatggtaa gatcaagaag aggcttctct ggggctcgga cttggcgacc 4141 atgatccggt gtgctcgagc attcggaggc ctaatggatg aactcaaagc acactgtgtc 4201 acacttccta ttagagttgg catgaatatg aatgaggatg gccccatcat cttcgagagg 4261 cattccaggt acacatatca ctatgatgct gattactctc gatgggattc aacacaacag 4321 agagccgtgt tggcagcagc tctagaaatc atggttaaat tctccccaga accacacttg 4381 gctcaggtag tcgcggagga ccttctttct cctagcgtgg tggacgtggg cgacttcaca 4441 atatcaatca acgagggtct tccctctggg gtgccctgca cctcccaatg gaactccatc 4501 gcccactggc ttctcactct ctgtgcactc tctgaagtca caaacctgtc ccctgatacc 4561 atacaggcta actccctctt ctctttttat ggtgatgatg aaattgttag cacagacata 4621 aaattggacc cagaaaaatt aacagcaaag ctcagagaat atgggttaaa accaacccgc 4681 cctgacaaaa ctgaaggacc ccttgtcatc tctgaagacc tgaatggcct aactttcctg 4741 cggagaaccg tgacccgcga cccagctggt tggtttggaa aactggagca gagttcaata 4801 ctcaggcaaa tgtactggac taggggtccc aaccatggag acccatctga aacaatgatc 4861 ccacactccc aaagacccat acaattgatg tccctactgg gggaggccgc tctccacggc 4921 ccagcatttt acagcaaaat cagcaaattg gtcattgcag agctaaaaga aggtggcatg 4981 gatttttacg tgcccagaca agagccaatg ttcagatgga tgagattctc agatctgagc 5041 acgtgggagg gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgagtga 5101 cgccaaccca tctgatgggt ccacagccaa ccttgtccca gaggtcaaca atgaggttat 5161 ggctttggag cccgtagttg gtgccgccat tgcggcacct gtagcgggcc aacaaaatgt 5221 aattgacccc tggattagaa ataattttgt acaagcccct ggtggagagt ttacagtatc 5281 ccctagaaac gctccaggtg aaatactatg gagcgcgccc ttaggccctg atttgaatcc 5341 ctacctttcc catttggcca gaatgtacaa tggttatgca ggtggttttg aagtgcaggt 5401 aatcctcgcg gggaacgcgt tcaccgccgg gaaaatcata tttgcagcag tcccaccaaa 5461 tttcccaact gaaggtttga gccccagcca ggtcactatg ttcccccaca taatagtaga 5521 tgttaggcaa ttggatcctg tgttgattcc cttacccgat gttaggaata atttctacca 5581 ttataatcaa tcaaatgatt ccaccatcaa attgatagca atgttgtata caccacttag 5641 ggctaataat gccggggacg atgtcttcac agtttcttgt cgagtcctca cgagaccatc 5701 ccccgatttt gattttatat ttttggtgcc acccacagtt gaatcaagaa ccaaaccatt 5761 ctctgtccca gttttaactg ttgaggagat gaccaattca aggttcccca ttcctttgga 5821 aaagttgttc acgggcccca gtagtgcctt tgtcgttcaa ccacaaaacg gcaggtgcac 5881 gactgatggc gtgctcctag gtaccaccca actgtctccc gtcaacatct gcaccttcag 5941 aggggatgtc acccatattt caggcagtca taactacaca atgaatttgg cctcccaaaa 6001 ttggaacagt tacgatccaa cagaagaaat cccagcccct ctaggaactc cagatttcgt 6061 gggaaagatt caaggtgtgc tcacccaaac cacaaggaca gatggctcga cccgcggcca 6121 caaagctaca gtgtacactg ggagcgccga cttctctcca aaactgggta gagttcaatt 6181 tgccactgac acagacaatg attttgaaac taaccaaaac acaaagttca ccccagtcgg 6241 tgttatccag gatggtggta cccaccaaaa tgaaccccaa caatgggagc tcccaagtta 6301 ctcaggcaga aacattcaca atgtgcacct ggcccccgct gtagccccca ctttcccggg 6361 cgagcagctc ctcttcttca gatctactat gcccggatgc agcgggtacc ccaacatgaa 6421 cttggattgt ctgctccccc aggagtgggt gcagtatttc taccaggagg cagccccagc 6481 acaatctgat gtggctctgc taagatttgt gaatccggat acaggtaggg ttttgtttga 6541 gtgtaagctt catagatcag gctatgtcac agtggctcac actggccaac atgatttggt 6601 tatccccccc aatggttatt ttagatttga ttcctgggtc aaccagttct acacacttgc 6661 ccccatggga aatggaacgg ggcgtagacg tgcattataa tggctggagc tatctttgct 6721 ggattggcat ctgacgtcct tagctctgga cttagttccc taatcaatgc tggggctggg 6781 gccatcaacc aaacaattga atttgaaaat aacagaaaac tgcaacaagc ttccttccag 6841 tttagtagca atctacaaca ggcttccttt caacatgaca aagagatgct ccatgcacaa 6901 attgaggcca ccaaaaagtt gcaacaggaa atgatgagag ttaaacaagc aatgctccta 6961 gagggtggat tctctgagac agatgcagcc cgcggggcaa tcaacgcccc catgacaaaa 7021 actttggact ggagcggaac aaggtactgg gctcccgatg ctaggactac aacatataat 7081 gcaggccgct tttccacccc ccaaccctcg ggggcaccac caggaagagc taatcttagg 7141 gctactgtcc ccgcccgggg ttcctctagc acatcttcta attcttctat tgctacttct 7201 gtgtattcaa atcaaaccac ctcaacgaga cttggttcta cagctggttc tggtaccagt 7261 gtctcgagcc tcccgtcaac tgcaaggact aggagttggg ttgaggatca aaataggaat 7321 ttgtcacctt tcatgagggg ggcccataac atctcgtttg tcaccccacc atctagcaga 7381 tcctctagcc aaggcacagt ctcaaccgtg cccaaagaag ttttggactc ctggactggc 7441 gctttcaaca cgcgcaggca gcctctcttt gctcacattc gtaagcgagg ggagtcacgg 7501 gtgtaatgtg aaaagaccaa attgattatc tttcttttct ttagtgtctt t //