Typing tool
|
Complete norovirus genomes
OR069405 | GII.6 | ||
---|---|---|---|
GII.P7 |
ORF1: 1..5094 ORF2: 5075..6718 ORF3: 6718..7494LOCUS OR069405 7546 bp RNA linear VRL 06-OCT-2023 DEFINITION Norovirus GII isolate GII/Hu/US/2015/GII.6[P7]/NIH53.1 nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION OR069405 VERSION OR069405.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7546) AUTHORS Chaimongkol,N., Dabilla,N., Tohma,K., Matsushima,Y., Behrle Yardley,A., Levenson,E.A., Johnson,J.A., Ahorrio,C., Oler,A.J., Kim,D.Y., Souza,M., Sosnovtsev,S.V., Parra,G.I. and Green,K.Y. TITLE Norovirus Evolves as One or More Distinct Clonal Populations in Immunocompromised Hosts JOURNAL Unpublished REFERENCE 2 (bases 1 to 7546) AUTHORS Chaimongkol,N., Dabilla,N., Sosnovtsev,S.V. and Green,K.Y. TITLE Direct Submission JOURNAL Submitted (26-MAY-2023) LID, NIAID, 50 South Drive/R6318, Bethesda, MD 20892, USA COMMENT ##Assembly-Data-START## Assembly Method :: HIVE Hexagon/Heptagon v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7546 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="GII/Hu/US/2015/GII.6[P7]/NIH53.1" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="Dec-2015" /note="genotype: GII.6" gene 1..5094 /gene="ORF1" CDS 1..5094 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="WID03731.1" /translation="MKMASNDASAAFGSQNSVNDSINTAPSGKEEVGTFSNIKVGFKK MLGAVPKGTKAPSGDQHCPSVKIGTKTLTVPPEPPNGEDTVQFDAKSGTVRGLPDLTT VQNEHENTPYTVPPLSEREHRPATEPLPGTILEMWDGEFYHYSVYVSGGKALGVHKPP AAISLATIELTPISLYWRPVYTPNYLVCPDTLKGLAGEKFPYTAFSNNCYNFCCWVLE LNDTWLSRRSISRTTGFFKPYQSWNRKPLPTVDDGKIKKVANAILCALGSLFSKPIKD LLGKLKPLNLLHLLASCDWTFAGIVETVILMAELFNIFWTPPDVSSFIASLIGDFELQ GPEDLAVELVPVVMGGIGMVLGFTAEKIGRMLSSAASTLRACKDLGNYALDILKLVMK WFFPKKEEKAGMETLRAIEDAVLDMEAIGNNHLTTLLKDKDSLAAFMKTLDLEEEKAR KLSTKSSSPDIVGTINAILARIAAARSLLHKAKEEMFSRIRPVVVMISGRPGIGKTHM ARHLAKSIANTMSGDQRVGLVPRNGVDHWDAYRGERVVLWDDYGMGNPVKDALTLQEL ADTCPVTLNCDRIENKGKMFDSDVIIITTNLVNPAPLDYVNFEACSRRVDFLVYAESP EIEKVKRDFPGQPDMWKDHFKPDFSHIKLTLAPQGGFDKNGNTPHGKGTMRSLTQGSL TARVAGLVYERRDEFQLQGNDLQTYNFDTNRVSAFRKLAADNKYGIMETMRVGTALKS VKTLEDLKVALRDVKFNECEIIYRNSKYRVSSNGKGSVSVDKVEDQTSQTANEVGAAL LRLRQARARYYVSCFQDLVYTLIQVAGASFVVNRISKRFCWERWVKPTETQETSESEK EVAQGRWEIEPKDTEPEGKKGKNKKGRGKKHTAFSSKGLSDEEYDEFKRIREERNGKY SIEEYLQDRDRYYEEVAVARATEEDFCEEEEAKIRQRIFRPTKKQRKEERGVLGLVTG SDIRKRRPDDFQPKGNLWADDTRSVDYNERLDFEAPPSVWSRIVPLGTGWGFWVSSNL LITTTHVLPKGIKELFGVEIKQIQIHKSGEFCRFRFPRPIRPDVTGLVLEEGAPEGTV CSILVKRPTGEMIPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDLGTGPGDCG CPYIYKRGNDIVVAGVHTAAARGGNTVICATQGQDGEAVLEGNENLGTYCGAPILGPG KAPKLSTKTKFWRSSPDALPPGTYEPAYLGGKDPRVEKGPSLQQVMRDQLKPFTEPRG KPPRPAVLEEAKRTVMNVLEQTIDPAKPWTYSQACASLDKTTSSGSPHHVRKNDHWNG ESFTGPLADQASKANLMYEQAKHVQPVYTAALKDELVKTDKIYKKIKKRLLWGSDLGT MIRCARAFGGLMDSMKASCIALPCRVGMNMNEDGPIIFDKHSKYRYHYDADYSRWDST QQRNILSAAMEVMVRFSAEPELAQVVAEDLLAPSQLDVGDFVISVQEGLPSGVPCTSQ WNSIAHWILTLSAMAEVSGLSPDVVQAHSCFSFYGDDEIVSTDINLDPMKLTQKLREY GLVPTRPDKTEGPLVITEDLTGLTFLRRSIARDPAGWFGKLDQDSILRQLYWTRGPNH ENPYESMVPHSQRATQLMALLGEASLHGPQFYKKVSKMVINEIKSGGLEFYVPRQEAM FRWMRFSDLSTWEGDRNLAPEGVNEDGVE" mat_peptide 1..1002 /gene="ORF1" /product="p48" mat_peptide 1003..2100 /gene="ORF1" /product="NTPase" mat_peptide 2101..2619 /gene="ORF1" /product="p22" mat_peptide 2620..3018 /gene="ORF1" /product="VPg" mat_peptide 3019..3561 /gene="ORF1" /product="Pro" mat_peptide 3562..5091 /gene="ORF1" /product="RdRp" gene 5075..6718 /gene="ORF2" CDS 5075..6718 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="WID03732.1" /translation="MKMASNDAAPSNDGAANLVPEATNEVMALEPVVGASIAAPVVGQ QNIIDPWIRENFVQAPQGEFTVSPRNSPGEMLLNLELGPELNPYLSHLSRMYNGYAGG MQVQVVLAGNAFTAGKIIFAAVPPHFPVENISAAQITMCPHVIVDVRQLEPVLLPLPD IRNRFFHYNQENTPRMRLVAMLYTPLRANSGEDVFTVSCRVLTRPAPDFEFTFLVPPT VESKTKPFTLPILTLGELSNSRFPAPIDMLYTDPNEAIVVQPQNGRCTLDGTLQGTTQ LVPTQICSFRGTLISQTSRSADSTDSAPRVRNHPLHVQLKNLDGTPYDPTDEVPAVLG AIDFKGTVFGVGSQRNTTGNSIGATRAHEVHIDTTNPRYTPKLGSVLMHSESTDFDDG QPIRFTPIGMGADDWHQWELPEYSGHLTLNMNLAPAVAPAFPGERVLFFRSVVPSAGG YGSGHIDCLIPQEWVQHFYQEAAPSQSAVALIRYVNPDTGRNIFEAKLHREGFITVAN SGNNPIVVPPNGYFRFEAWVNQFYTLTPMGTGQGRRRVQ" gene 6718..7494 /gene="ORF3" CDS 6718..7494 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="WID03733.1" /translation="MAGAFLAGLAGDAITSGVGSLINAGANAINQKVEYDFNKQLQLA SFKHDKEMLQSQVLATKQLQQEMMNIRQGVLSAGGFSPTDAARGAVNAPMTKILDWNG TRYFAPNSMKTTSYSGQFSSNPVHRQPTLPRSDPPKIKVNSDSASVYSSASTAPSQST HSTTLTAGSSSSRTTSSSAVASTLSRTSDWVRGQNERLSPFMDGALQTTFVTPPSSKA SSYGSVSTVPKAVLDSWTPMFNTHRQPLFAHVRRRGESQI" ORIGIN 1 atgaagatgg cgtctaacga cgcttccgct gcctttggca gccaaaactc tgtcaatgac 61 agtatcaaca ccgccccttc tggtaaagaa gaggttggta cattttccaa catcaaagtt 121 ggttttaaga aaatgctggg tgccgttccc aaggggacca aagcacccag tggtgatcag 181 cactgcccct cagttaagat cgggaccaag acattaacag ttccccctga gcccccaaat 241 ggtgaagaca ccgtgcagtt tgatgcaaag tcgggaactg tgcgcgggct accagacttg 301 acaacggtgc agaatgaaca cgaaaataca ccatacactg tccccccatt gagtgaaagg 361 gagcacagac cagccactga accgcttcct ggcacaatac tggagatgtg ggatggtgaa 421 ttctaccact actctgtgta cgtcagcggt ggcaaagccc taggggttca caagcccccc 481 gcagcgataa gcctcgcgac gatagagctc acacccatat ctctttattg gaggcccgtt 541 tacaccccca attatctggt ctgtccagac acgttgaaag gccttgctgg tgagaagttc 601 ccttacacgg ccttcagcaa caactgttac aacttctgtt gctgggtgct tgagcttaat 661 gacacgtggc ttagcaggag aagcatctct aggactaccg gcttcttcaa accatatcag 721 tcttggaata ggaaacccct tccaaccgtt gatgatggaa agatcaaaaa ggtggccaac 781 gcaattctct gcgcgcttgg atcactgttt tcaaaaccaa tcaaggacct attgggcaaa 841 cttaaaccat tgaacttact gcacttgctc gcatcgtgtg actggacgtt tgcgggtata 901 gtagagacag tcatcctaat ggcggagctt tttaacatct tttggacacc gccagatgtt 961 tccagtttca tagcctccct gataggtgat tttgaattgc aagggcctga ggacctggct 1021 gtggagcttg tccccgtggt catgggtggc ataggaatgg ttctcggctt tactgctgag 1081 aagataggcc gcatgctgtc gtccgctgca tcaacactac gggcatgcaa agacttgggg 1141 aattatgccc ttgacatatt gaaactggtt atgaagtggt tcttcccaaa gaaagaagag 1201 aaagccggga tggaaaccct gagggcaatt gaggatgccg tcctagacat ggaagctata 1261 ggaaacaatc acctcacaac ccttctcaag gacaaggaca gcctcgcggc tttcatgaag 1321 actctagatc tagaagagga gaaggcaagg aagctgtcca ccaaatcatc ctcgccagac 1381 attgtcggca ctatcaatgc catactggcc agaatagctg ctgccaggtc cctcttgcac 1441 aaggctaaag aagaaatgtt tagcaggatc agacccgtgg ttgtcatgat ctcgggtaga 1501 cctggcattg gcaaaaccca catggcaaga cacttggcta agagcatcgc caacaccatg 1561 agcggcgacc agagagtggg gctcgtccca cgcaacgggg tcgatcattg ggacgcctac 1621 aggggagaga gggtggtctt atgggatgat tatggcatgg gaaaccctgt caaagatgcc 1681 ctaacactgc aagagttggc cgacacctgc ccagtaactc taaactgtga taggatcgag 1741 aataagggga agatgtttga cagtgatgtc atcataatca cgaccaattt agtcaaccca 1801 gcgcccctcg attatgtaaa cttcgaggcg tgctccagaa gggtcgactt tctggtctat 1861 gcagagtcac cagaaattga gaaggttaag agagattttc ccggccaacc tgacatgtgg 1921 aaggaccact ttaagccaga cttttcccac ataaaactga ctctagcccc acagggtgga 1981 tttgataaga atggcaacac cccccatggc aagggcacca tgcgatccct aacccagggg 2041 tccctgactg caagggttgc aggtcttgtt tatgagagga gagacgagtt tcagctccag 2101 gggaacgatc ttcaaacata caactttgac accaacaggg tctctgcatt taggaagctg 2161 gccgcagaca acaagtatgg gatcatggag actatgagag tgggcacagc gcttaagagt 2221 gtcaagacct tggaagatct caaagttgca ctgagggacg tgaagttcaa tgagtgtgag 2281 ataatttaca gaaactccaa ataccgagtc tcctccaatg gtaagggttc tgtctctgtt 2341 gacaaagttg aggaccaaac atcccagact gcaaacgaag tgggtgcagc actccttaga 2401 ctcaggcagg caagagccag atactatgtc agctgcttcc aagatctcgt ttacactcta 2461 atacaggtcg ccggagcatc attcgttgtc aacaggatct ctaaaaggtt ctgctgggag 2521 agatgggtca aaccaaccga gacccaggaa acgagtgaat ccgaaaagga agtggcccaa 2581 ggcagatggg agattgagcc caaggacaca gaaccagaag gtaagaaggg taagaacaag 2641 aaaggaagag gcaagaaaca cacagctttc tctagcaaag gcctgagtga tgaagagtat 2701 gacgagttca agagaatcag agaagaaagg aacggaaaat actccattga ggaatacctg 2761 caagaccggg atcgctacta tgaggaagtg gcagttgccc gggcaacaga ggaggacttc 2821 tgtgaagagg aggaggccaa gataagacaa agaatctttc gccccacaaa gaaacagagg 2881 aaggaagaaa ggggggtgct tggtttggtc actggctcag acatcagaaa gagaagacca 2941 gacgattttc aaccgaaggg taacctgtgg gcggacgaca ccaggagtgt ggactacaac 3001 gagagacttg atttcgaggc tcccccgagc gtctggtcaa gaatagtccc actgggcact 3061 ggctggggct tctgggtctc atcaaacctc ctgatcacaa caacacacgt cctgcctaag 3121 gggattaagg aactttttgg agttgaaatc aaacagattc aaattcataa gtctggagag 3181 ttctgcaggt tcagattccc gagaccaatt agaccagatg tcacagggct cgtgttggag 3241 gaaggcgctc cagaaggcac tgtctgctcc atactcgtaa agagacctac aggtgagatg 3301 atccccctgg cagtgaggat gggcacgcat gcatccatga aaatacaggg taggaccgtc 3361 ggtggccaga tgggaatgct cctcacaggg gcaaatgcga agaacatgga tcttggcacc 3421 ggtcctggtg actgcggttg cccttacatc tacaaacgtg gcaatgatat agttgtcgcg 3481 ggtgtccaca ccgcagcagc ccggggaggc aacactgtta tatgtgccac tcaagggcaa 3541 gatggggaag cagtccttga ggggaatgag aaccttggca cttactgcgg cgccccaatc 3601 ttgggtccag gcaaggcgcc caaacttagc acgaagacca agttctggcg ctcatcacca 3661 gacgctttgc cgccaggcac ttatgagcct gcctacttgg gaggtaagga ccccagagtg 3721 gaaaaaggac catccctgca gcaagtcatg agggaccagc taaaaccttt cacagaaccc 3781 aggggcaagc cacctagacc tgcagtctta gaagaggcca agaggacagt gatgaatgtt 3841 ctggaacaaa ccattgatcc tgctaagccg tggacctact cccaagcatg tgcctcactg 3901 gacaagacca cttccagtgg tagcccccat catgtcagga agaatgatca ttggaatggg 3961 gaatccttca ctggccccct tgcagaccaa gcatccaaag ccaacctcat gtacgagcag 4021 gccaaacatg tgcagcccgt gtacacggcc gcacttaaag atgaactagt caaaactgac 4081 aagatctaca aaaagataaa gaaaaggctc ttatgggggt cggatcttgg tacaatgatc 4141 aggtgtgcca gggcctttgg cggcctcatg gatagcatga aggcaagttg catagccctc 4201 ccatgtaggg taggaatgaa catgaatgaa gatggtccca tcatatttga caaacattct 4261 aagtataggt atcattatga tgctgactat tccaggtggg actcaaccca gcaaaggaac 4321 attctctcgg ccgctatgga agtgatggtg cggttctctg ccgaaccaga gctggcacaa 4381 gtggttgcag aggacctcct ggcacccagc caactagatg ttggcgattt cgtcatctca 4441 gtccaggagg gtctgccatc aggggttcca tgcacatcac aatggaattc aatagcacac 4501 tggatcctaa ccttgagtgc aatggcagaa gtgtcgggtc tctcaccaga tgttgttcaa 4561 gcccactcct gcttctcatt ctacggtgat gatgagatcg tcagcactga catcaacctt 4621 gatcccatga agttgacaca gaaactcaga gagtatggcc tggtccccac tcggcctgac 4681 aaaactgagg gtcccctcgt aataactgaa gatctcaccg gcctaacgtt cctgcgtagg 4741 tcaattgcac gggacccagc tgggtggttt gggaaactgg accaagactc aatcctcaga 4801 cagttgtact ggacaagggg ccccaatcat gagaacccat atgagagcat ggtccctcat 4861 tcccagcggg ccacacagct tatggccctt ctcggtgagg cttcactgca tggcccccag 4921 ttttacaaga aggttagcaa gatggtcatt aatgagatca aaagcggtgg tctggaattc 4981 tatgtgccca gacaagaggc catgttcaga tggatgagat tctctgacct cagcacatgg 5041 gagggcgatc gcaatcttgc tcccgagggt gtgaatgaag atggcgtcga atgacgctgc 5101 tccatcgaat gatggcgctg ccaacctcgt accagaggcc accaatgagg ttatggcact 5161 tgagccggtg gtgggagctt caatcgctgc tcctgtcgtc ggccaacaaa atataattga 5221 cccctggatt agagaaaatt ttgttcaggc accacagggt gagtttactg tttcaccaag 5281 gaactcacct ggtgagatgc ttctaaatct tgaattaggc cctgagctca acccttatct 5341 gagtcacttg tcccgcatgt ataatggcta tgctggtggt atgcaggttc aggtggtcct 5401 agctgggaat gcgttcacag ctggtaaaat catctttgcc gccgtcccac cacatttccc 5461 tgttgagaac attagtgcag cccaaattac aatgtgcccc catgtaattg ttgatgtaag 5521 acagcttgag ccagtgcttt tgcccctccc tgacataagg aatagattct ttcattataa 5581 tcaggaaaac accccccgga tgagacttgt ggctatgttg tacacacccc ttagagcaaa 5641 ttctggtgag gatgtgttta ctgtgtcttg cagagttctg acccgccctg cccccgactt 5701 tgagtttacg tttttggtgc caccaactgt agagtcaaag actaagccct ttacactccc 5761 tattctgact cttggagagc tgtccaactc cagattcccc gctccgatag acatgttgta 5821 cactgacccc aatgaggcaa ttgtggtgca gccacaaaat ggcaggtgca ctctagatgg 5881 aacacttcaa gggaccaccc aactggtacc cacccaaatt tgctccttta gaggcacgct 5941 aattagccag acgtcgaggt ctgcagattc aacagattca gccccacggg tgaggaacca 6001 ccctctccac gttcagctga agaacctcga tgggacacca tatgacccaa cagatgaggt 6061 gccagcagtt ttgggtgcca tagatttcaa agggactgta tttggggtcg gtagtcaaag 6121 aaacaccaca gggaactcta taggagcaac ccgcgcccat gaagtgcaca tagataccac 6181 aaaccctaga tacaccccaa agcttggctc tgtgttgatg cattctgaat ctactgattt 6241 tgatgatgga caacccatcc gctttacccc cattggcatg ggagccgatg attggcacca 6301 atgggaattg cccgaatatt ctggtcacct tactctcaat atgaatttgg cccccgcagt 6361 tgcccctgct ttccctggtg aacgcgttct cttcttcaga tcagtggtgc cgtctgctgg 6421 tggctatgga tcaggccaca tagattgcct catcccacag gaatgggttc agcatttcta 6481 ccaggaggcc gctccatcac agtctgcggt ggctcttatc agatatgtca accctgacac 6541 tggaagaaac atctttgagg caaaactgca tagagaaggg ttcatcactg tggcaaattc 6601 tggaaacaac cccatagttg tgcccccaaa tggttacttc aggtttgaag cctgggtcaa 6661 tcaattctac acactcaccc ccatgggaac tggacagggg cgcagaagag ttcaataatg 6721 gcaggagctt tcttagctgg attggcaggt gacgccataa caagtggtgt cgggtcccta 6781 atcaatgctg gggccaatgc aattaaccaa aaggtggagt atgactttaa taaacagctc 6841 caattggcat ccttcaaaca tgataaagag atgttacagt cacaagtact ggccactaaa 6901 cagctccagc aagaaatgat gaacataagg caaggggtct tgtccgctgg cggcttttcc 6961 cctacagatg ccgcgagagg tgctgtgaac gcgccgatga caaaaatttt ggattggaat 7021 ggaacaaggt acttcgcccc aaatagtatg aagacaacca gttattcggg ccagttttcc 7081 agcaaccctg tacataggca acccacttta ccccgttctg accccccaaa aatcaaggtt 7141 aatagtgatt ctgctagtgt gtatagttct gcatctactg ctccttcaca atcaacccac 7201 tcaacgacct tgactgcggg gtctagttca tctaggacaa cctcctcttc tgcagtagct 7261 tccactctat ctaggactag tgattgggtg agaggacaaa atgagaggct cagcccgttc 7321 atggatggtg ctcttcaaac aacttttgtc acaccaccat cgagcaaggc ttcttcatat 7381 gggtcggtct caaccgttcc caaagctgtt ttggactcct ggactcctat gttcaacacc 7441 cataggcagc ctctcttcgc tcatgtacgc aggcgagggg agtcacagat ttagtgaaaa 7501 gattgattag gttttctttc tttcttcttt tccttttctt tctttt //