Typing tool
|
Complete norovirus genomes
OR069406 | GII.14 | ||
---|---|---|---|
GII.P7 |
ORF1: 1..5094 ORF2: 5075..6685 ORF3: 6685..7479LOCUS OR069406 7525 bp RNA linear VRL 06-OCT-2023 DEFINITION Norovirus GII isolate GII/Hu/US/2019/GII.14[P7]/NIH129.1 nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION OR069406 VERSION OR069406.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7525) AUTHORS Chaimongkol,N., Dabilla,N., Tohma,K., Matsushima,Y., Behrle Yardley,A., Levenson,E.A., Johnson,J.A., Ahorrio,C., Oler,A.J., Kim,D.Y., Souza,M., Sosnovtsev,S.V., Parra,G.I. and Green,K.Y. TITLE Norovirus Evolves as One or More Distinct Clonal Populations in Immunocompromised Hosts JOURNAL Unpublished REFERENCE 2 (bases 1 to 7525) AUTHORS Chaimongkol,N., Dabilla,N., Sosnovtsev,S.V. and Green,K.Y. TITLE Direct Submission JOURNAL Submitted (26-MAY-2023) LID, NIAID, 50 South Drive/R6318, Bethesda, MD 20892, USA COMMENT ##Assembly-Data-START## Assembly Method :: HIVE Hexagon/Heptagon v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7525 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="GII/Hu/US/2019/GII.14[P7]/NIH129.1" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="May-2019" /note="genotype: GII.14" gene 1..5094 /gene="ORF1" CDS 1..5094 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="WID03734.1" /translation="MKMASNDASAAFGSQKPVNDSNNATPSDKEEVGTFSNIKVGFKK MLGAVPKGTKAPSSDQHCPTVKIGPKTLTIPPEPPNGEDIVQFDAKSETVIGLPDLTT VQNEHENTPYTVPPLSEREHRPATEPLPGTILEMWDGEFYHYSVYVSDGKALGVHKPP AAISLATIELTPISLYWRPVYTPNYLVSPDTLKGLAGEKFPYTAFSNNCYNFCCWVLE LNDTWLNRRCISRTTGFFKPYQSWNRKPLPTVDDGKIKKVANAILCALGSLFSRPIKD LLGKLKPLNLLHLLASCDWTFAGIVETVILMAELFNVFWTPPDVSNFIASLIGDFELQ GPEDLAVELVPVVMGGIGMVLGFTAEKIGRMLSSAASTLRACKDLGNYALDILKLVMK WFFPKKEEKAEMETLRAIEDAVLDMEAIGNNHLTTLLKDKDSLTAFMKTLDLEEEKAR KLSTKSSSPDIVGTINAILARIAAARSLLHKAKEEMFSRIRPVVVMISGRPGIGKTHM ARHLAKSIANTMSGDQRVGLVPRNGVDHWDAYRGERVVLWDDYGMGSPVKDALTLQEL ADTCPVTLNCDRIENKGKMFDSDVIIITTNLVNPAPLDYVNFEACSRRVDFLVYAESP EIEKVKRDFPGQPDMWKDHFKSDFSHIKLTLAPQGGFDKNGNTPHGKGTMRSLTQGSL TARVAGLVHERKDEFQLQGNDLQTYNFDTNRVSAFRKLAADNKYGIMETMRVGAALKS VKTLEDLKFALRDVKFNECEIIYKSSKYRVSSNGKGSVSVDKVEDNASQTVNEVHSAL LRLRQARARYYISCFQDLVYTLIQVAGASFVVSRISRRFCWERWVKPTETQEVNEPEK EVAQGRWEIEPKDTEPEGKKGKNKKGRGKKHTAFSSKGLSDEEYDEFKRIREERNGKY SIEEYLQDRDRYYEEVAVARATEEDFCEEEEAKIRQRIFRPTRKQRKEERGVLGLVTG SDIRKRRPDDFQPKGNPWADDTRSVDYNERLDFEAPPSVWSRIVPLGTGWGFWVSSNL LITTTHVLPKGIKELFGVEIKQIQIHKSGEFCRFRFPRPIRPDVTGLVLEEGAPEGTV CSVLVKRPTGEMIPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDLGTGPGDCG CPYIYKRGNDIVVAGVHTAAARGGNTVICATQGQDGEAVLEGNEDLGTYCGAPILGPG KAPKLSTKTKFWRSSPDALPPGTYEPAYLGGKDPRVEKGPSLHQVMRDQLRPFTEPRG KPPRPAILEEAKKTVMNVLEQTIDPAKPWSYSQACASLDKTTSSGSPHHVKKNDHWNG ESFTGPLADQASKANLMYEQAKHVQPVYTAALKDELVKTDKIYKKIKKRLLWGSDLGT MIRCARAFGGLMDSMKASCIALPCRVGMNMNEDGPIIFDKHSKYRYHYDADYSRWDST QQRSILSAAMEVMVRFSAEPELAQVVAEDLLAPSQLDVGDFVISVQEGLPSGVPCTSQ WNSIAHWILTLSAMAEVSGLSPDVIQAHSCFSFYGDDEIVSTDINLDPMKLTQKLREY GLVPTRPDKTEGPLVITEDLTGLTFLRRSIARDPAGWFGKLDQDSILRQLYWTRGPNH ENPYESMVPHSQRATQLMALLGEASLHGPQFYKKVSKMVINEIKSGGLEFYVPRQEAM FRWMRFSDLSTWEGDRNLAPEGVNEDGVE" mat_peptide 1..1002 /gene="ORF1" /product="p48" mat_peptide 1003..2100 /gene="ORF1" /product="NTPase" mat_peptide 2101..2619 /gene="ORF1" /product="p22" mat_peptide 2620..3018 /gene="ORF1" /product="VPg" mat_peptide 3019..3561 /gene="ORF1" /product="Pro" mat_peptide 3562..5091 /gene="ORF1" /product="RdRp" gene 5075..6685 /gene="ORF2" CDS 5075..6685 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="WID03735.1" /translation="MKMASNDATPSDDGAAGLVPEINNEVMALEPVAGASIAAPVVGQ QNIIDPWIRNNFVQAPAGEFTVSPRNSPGEVLLDLELGPELNPYLAHLARMYNGHAGG MEVQIVLAGNAFTAGKILFAAIPPSFPYENLSPAQLTMCPHVIVDVRQLEPVHLPMPD IRNVFYHYNQDNSSRLRLVAMLYTPLRANNSGDDVFTVSCRVLTRPSPDFQFTFLVPP TVESKTKNFTLPVLRVSEMTNSRFPVVLDQMYTSRNENIIVQPQNGRCTTDGELLGTT ILQSVSICNFKGTMQAKLNEEPQYQLQLTNLDGSPIDLTDDMPAPLGTPDFQAMLYGV ASQRSSRDNATRAHDAQIDTAGDTFAPKIGQVRFKSSSSDFDLHDPTKFTPIGVNVDD QHPFRQWSLPSYGGHLALNNHLAPAVTPLFPGEQILFFRSYIPSAGGHTDGAMDCLLP QEWIEHFYQEAAPSQSDIALVRFINPDTGRVLFEAKLHKQGFLTIAASGDHPIVMPTN GYFRFEAWVNPFYTLAPVGTGSGRRRIQ" gene 6685..7479 /gene="ORF3" CDS 6685..7479 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="WID03736.1" /translation="MAGAFLAGLAGDVVTSSLGSLVGAGANAINQKVEYDFNRQLQEA SFRHDKDMLKAQVAATTGLQQAMIDIKREALTAGGFSPADAARGAVKAPMTQILDWNG SRYWAPNSMKTTGYSGTFSSQGIRSSPHLSNQTPSVVKSTRSLPSISSSSSVYSSPST APSQSTQSTTLSAGTGSSRPDTVSTKTSTLSRTSDWVRGQNEMLDPFMSGALQTAYVT PPSSKASSQGTVSTVPKAVLDSWTPMFNTHRQPLFAHLRRRGESQI" ORIGIN 1 atgaagatgg cgtctaacga cgcttccgct gcctttggca gtcaaaaacc tgtcaatgac 61 agtaataatg ccaccccttc tgacaaagaa gaggttggta cattctccaa cattaaagtt 121 ggattcaaga aaatgctggg tgccgtaccc aaaggtacta aggcacccag cagtgatcag 181 cattgtccca cagttaagat cgggcctaaa acattaacaa tcccccctga acccccaaat 241 ggtgaagaca tcgtacaatt tgatgcaaag tcagaaactg taattgggct accggacttg 301 acaacggtgc aaaacgaaca cgaaaacact ccatacactg tccccccatt aagtgagagg 361 gagcacagac cagctaccga accacttcct ggcacaatat tggagatgtg ggatggtgag 421 ttttaccatt actccgtgta cgtcagcgat ggcaaagctc tgggggtcca taaaccaccg 481 gcggctataa gtctcgcgac gatagaactc acccccatat cactctactg gagacccgtc 541 tacactccca actacttggt ctccccagac acattaaagg gcctcgccgg tgagaagttc 601 ccctacacgg ccttcagcaa caactgttat aacttctgct gttgggtgct tgagctcaac 661 gacacatggc ttaacagaag atgcatatcc agaactaccg gcttctttaa accataccag 721 tcttggaata ggaaacctct cccaactgtt gatgatggaa agatcaaaaa ggtggccaac 781 gcgatcctct gtgcactcgg ctcgctgttt tcgagaccaa tcaaagatct attaggtaaa 841 ctcaagccat tgaatctact acacttgctt gcatcctgtg attggacgtt tgcgggtata 901 gtagagacag tcatcttaat ggcggagctt ttcaatgtat tctggacccc gccagatgtc 961 tctaatttta tagcctccct gataggtgat tttgagttgc aaggtcctga agacttggct 1021 gtggagctcg ttcccgtggt tatgggtggc atagggatgg ttctcggctt caccgctgag 1081 aagataggcc gcatgctgtc gtccgctgca tcaacactac gggcgtgcaa agacctaggg 1141 aactatgccc ttgacatact caaattggtc atgaaatggt tcttcccaaa gaaggaagag 1201 aaagctgaga tggagacctt gagggcgatt gaagatgctg tccttgatat ggaagctata 1261 gggaataacc acctcacaac ccttctcaag gacaaagata gcctcacggc ttttatgaaa 1321 accctggatc tagaagagga gaaagcgagg aaactgtcca ctaaatcatc ttcaccggac 1381 atagtcggca ccatcaacgc catattagct agaatagctg ctgctagatc cctactacat 1441 aaggccaaag aggagatgtt tagcaggatt agaccagtag ttgtcatgat ctcaggcaga 1501 cctggcattg gaaaaaccca catggcaaga cacttggcta agagcatcgc caacaccatg 1561 agtggcgatc agagggtcgg actcgtcccg cgcaatggtg tcgatcactg ggacgcctac 1621 agaggagaga gagtggtctt gtgggacgac tacggtatgg ggagccctgt caaagacgcc 1681 ctgacactgc aagaattggc cgacacatgt ccagtcacct taaattgtga taggattgag 1741 aataagggga agatgtttga cagtgatgtt atcataatca cgaccaacct agtcaacccc 1801 gcgcccctcg actatgtgaa cttcgaggcg tgttccagga gagttgactt cctggtctat 1861 gcagagtcac cagaaattga aaaggtcaag agagacttcc ccggccagcc tgacatgtgg 1921 aaagaccatt ttaagtcaga cttttcccac attaagctga ctctagcccc ccagggtgga 1981 tttgacaaga atggcaacac cccacatggc aagggcacca tgcggtccct aacccagggg 2041 tccctaaccg cgagagttgc aggccttgtc catgagagga aggacgagtt ccagctccaa 2101 ggaaatgacc ttcaaacata caattttgac accaacagag tctccgcatt tagaaaattg 2161 gccgcagaca acaagtatgg gatcatggag acgatgagag ttggcgcagc gctcaagagt 2221 gtgaagacct tggaagatct taaatttgca ttgagggatg taaaattcaa tgaatgtgaa 2281 ataatttaca agagttctaa gtaccgagtc tcttccaacg gtaaaggttc agtctctgtt 2341 gacaaagttg aagacaatgc ttcccagact gttaacgagg tgcattcagc actcctcaga 2401 ctcaggcagg cgagggctag atattacatc agctgcttcc aagacctcgt ctacactctc 2461 atacaggttg ccggggcatc atttgttgtc agtaggatct ctagaaggtt ctgctgggaa 2521 aggtgggtca aaccgaccga gacccaggag gtgaatgagc ccgaaaagga agtggcccaa 2581 ggcaggtggg aaattgagcc caaggacaca gaaccagaag gcaagaaggg taagaacaag 2641 aaaggaaggg gcaagaaaca tacagccttc tctagcaaag gtctgagtga tgaagagtat 2701 gacgagttca agaggattag ggaagaaaga aatgggaagt actccattga ggaatacctg 2761 caggaccgag accgctatta tgaagaagta gcagttgctc gggcgacaga ggaggacttc 2821 tgtgaagagg aagaggccaa gataagacaa aggattttcc gcccgacaag gaaacaaagg 2881 aaggaggaaa ggggtgtgct cggcttggtc accggctcag acatcaggaa gagaagacca 2941 gacgattttc aaccaaaagg caatccatgg gcagatgaca ccagaagtgt ggactacaat 3001 gagaggcttg attttgaagc acccccgagt gtctggtcaa gaatagtccc actaggcacc 3061 ggttgggggt tttgggtttc atccaacctt ctgattacaa caacacacgt cctacctaaa 3121 gggattaagg agctctttgg agttgaaatc aaacaaatcc aaattcataa gtctggagag 3181 ttttgcaggt ttagattccc aagacctatc aggccagatg tcacaggact cgtgctggag 3241 gagggcgccc cagagggcac tgtttgctct gtgctcgtga aaagacccac gggtgaaatg 3301 atcccccttg cagtgaggat gggtacacat gcatccatga aaatacaggg caggaccgtt 3361 ggtggccaga tgggaatgct cctcacaggg gcaaacgcaa agaacatgga tcttggcacc 3421 ggccctggtg actgcggttg cccctacatc tacaagcgtg gcaacgacat cgttgttgcg 3481 ggtgtccaca ccgcagcagc ccggggaggc aacactgtaa tatgtgctac ccaggggcaa 3541 gatggggaag cagtccttga gggaaatgag gaccttggca cctactgtgg tgccccaatt 3601 ctgggccctg gcaaggcgcc caaactcagc acgaagacca agttttggcg ctcatcacca 3661 gatgccctgc cacctggcac atatgaacct gcttacctag gaggcaaaga ccctagagtg 3721 gaaaaaggtc cctccctgca tcaagtcatg agagaccagt tgaggccctt cacagaaccc 3781 agaggcaaac cgcctagacc tgcaattttg gaggaagcca aaaagacagt aatgaatgtc 3841 ctagaacaaa ccattgaccc cgccaagcca tggtcctatt cgcaggcatg tgcctcattg 3901 gacaaaacca cttcaagtgg tagtccccac cacgttaaga aaaatgacca ctggaatggg 3961 gagtccttca ctggtcccct tgcagaccag gcttctaaag ccaacctcat gtatgagcag 4021 gctaaacatg tgcagcccgt gtatacggct gcgcttaaag atgagcttgt taagactgac 4081 aaaatctaca agaagataaa gaagaggctt ctatgggggt cagatcttgg cacaatgatc 4141 agatgcgcca gggcttttgg tggtctcatg gatagtatga aggcaagttg catagctctt 4201 ccttgcaggg tgggaatgaa catgaatgaa gatggcccca tcatatttga caaacactct 4261 aagtacaggt accactatga tgctgactat tctaggtggg actcaaccca acaaaggagc 4321 atcctctctg ccgctatgga agtgatggtg cggttttctg ctgaaccaga attggcacaa 4381 gtggttgcag aggacctttt ggcacctagt cagcttgatg ttggcgactt tgtcatctca 4441 gtccaggaag gcctgccatc aggagtccca tgcacatcgc aatggaattc aatagcacac 4501 tggatactca ctttgagcgc gatggcagaa gtatcaggcc tctcaccaga tgttatccaa 4561 gctcactctt gtttttcctt ttatggggac gatgagatcg tcagcactga catcaacctt 4621 gaccctatga aattgacaca aaaacttagg gagtatggcc tggtccccac ccggcctgac 4681 aaaactgaag gtcccctcgt cataactgaa gacctcaccg gcctgacgtt ccttcgtagg 4741 tcgattgctc gggacccagc tggctggttt ggaaaactag accaagattc aatcctcagg 4801 caactgtact ggacaagggg gcctaaccat gagaacccgt acgaaagcat ggttcctcat 4861 tctcagcggg ccacacaact catggccctt ctcggtgaag cctcattgca tggtcctcag 4921 ttttacaaga aagttagcaa aatggtcatc aatgaaatta agagtggtgg tctggaattt 4981 tacgtgccca gacaagaggc catgttcaga tggatgagat tctctgacct cagcacgtgg 5041 gagggcgatc gcaatcttgc tcccgagggt gtgaatgaag atggcgtcga atgacgctac 5101 tccatctgat gatggtgcag ccggcctcgt accagagatc aacaatgagg ttatggctct 5161 tgaacccgtt gctggggcct ccattgcagc ccccgtagtc ggtcaacaga atataattga 5221 tccctggatt agaaataatt ttgtacaagc ccctgctggt gaatttacag tttcccctag 5281 aaactctcct ggagaagtcc tacttgattt ggaattgggt cctgaactta acccctatct 5341 tgcacacttg gccagaatgt acaatgggca cgcaggagga atggaagtgc agatagtact 5401 ggctgggaat gcgttcacag ctggtaagat cctatttgcc gccatcccac ctagcttccc 5461 ttatgaaaat ttgtcacctg cccaattgac tatgtgcccc catgtgatag tggatgtgag 5521 acaattggaa ccagtacact tgccaatgcc agatataaga aatgttttct atcattataa 5581 tcaggataat agttccagac ttaggcttgt agctatgctt tatactcctc tgagggccaa 5641 taattcaggt gatgatgtgt tcacggtgtc ttgtcgcgtc ctaacgcgcc cttctccaga 5701 tttccagttc actttcctgg tcccacctac agttgagtct aaaactaaga acttcaccct 5761 ccccgttctc agagtctcag agatgacaaa ctcaaggttc cccgttgttt tggaccaaat 5821 gtacacaagc aggaatgaaa acatcattgt ccaaccccaa aacggcagat gcacaactga 5881 tggtgagctg cttggtacca ctatcttaca atctgtgtct atttgcaatt tcaaaggaac 5941 aatgcaggca aagctgaatg aagaaccaca ataccaatta caactcacca acttggatgg 6001 gtcacccata gatctaacag atgatatgcc tgcccctctt ggcacaccag acttccaggc 6061 catgttatat ggcgttgcaa gccaacgctc ttccagagat aatgccacca gggcacatga 6121 tgcacagatt gacactgcgg gtgacacatt cgccccgaaa attggccagg ttcgatttaa 6181 atcaagctcc agtgattttg atctacatga ccctacaaaa ttcacaccta ttggtgtcaa 6241 tgtggatgac cagcacccct ttagacagtg gtccctgcca agctacggtg gtcaccttgc 6301 cctgaataac catttagccc cagctgtgac accgctcttt cctggcgagc agatcttgtt 6361 ctttaggtca tacattccaa gtgccggagg ccatacagac ggtgctatgg actgtttgct 6421 gccccaagag tggatagagc acttctacca ggaggcggct ccttcccaat ctgacatcgc 6481 actggtaagg ttcatcaacc ctgacacagg aagagtgctc tttgaagcta aattgcataa 6541 acaaggtttc ctcacaattg cagcatctgg agaccacccc attgtgatgc ccactaatgg 6601 ttactttagg tttgaagctt gggttaatcc tttctatact ctcgcccccg tgggaactgg 6661 gtctgggcgc aggaggatcc aataatggct ggagctttcc ttgcaggttt ggcaggtgac 6721 gttgtgactt ccagcttggg ctcactcgtg ggtgctggag ccaacgccat caaccagaaa 6781 gtggagtatg acttcaacag gcaactccag gaggcatctt ttagacatga taaggatatg 6841 ctcaaagccc aggttgcagc aacaacaggt ctccaacagg ccatgataga catcaagcgg 6901 gaggcattga ccgcaggcgg cttttccccc gctgacgctg caagaggcgc agtcaaagca 6961 cctatgacgc agatccttga ttggaatggt tctagatatt gggctcctaa ctccatgaag 7021 acaactggct actcaggtac attttcttct caaggcatta gaagttcccc tcacctgtct 7081 aatcaaaccc caagtgttgt taaatctact agatctttac catctatatc tagttcttct 7141 agtgtgtata gttctccttc tactgccccc tcgcaatcaa cacaatcaac aacgctgtct 7201 gcagggacag ggtcctccag acctgacact gtgtccacaa agacttccac attgtcgagg 7261 acaagtgatt gggtcagagg tcaaaatgag atgcttgacc cgttcatgag tggtgcactc 7321 caaactgctt atgtcacacc accatcgagc aaggcctcat cacaagggac ggtctcgacc 7381 gtacccaagg ctgttttgga ctcctggact cccatgttta acacccacag gcagcctctc 7441 ttcgcccact tacgtaggcg aggggagtca cagatttagt gaaaaggatg attagggttt 7501 cttcctttcc ccttcttatt ctttt //