Typing tool
|
Complete norovirus genomes
OR397775 | GII.1 | ||
---|---|---|---|
GII.P1 |
ORF1: 5..5104 ORF2: 5085..6692 ORF3: 6692..7471LOCUS OR397775 7513 bp RNA linear VRL 28-NOV-2023 DEFINITION Norovirus GII isolate NHP_#4_ch2_d8, complete genome. ACCESSION OR397775 VERSION OR397775.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7513) AUTHORS Rimkute,I., Chaimongkol,N., Woods,K.D., Nagata,B.M., Darko,S., Gudbole,S., Henry,A.R., Sosnovtsev,S.V., Olia,A.S., Verardi,R., Bok,K., Todd,J.-P., Woodward,R., Kwong,P.D., Douek,D.C., Alves,D.A., Green,K.Y. and Roederer,M. TITLE A non-human primate model for human norovirus infection JOURNAL Unpublished REFERENCE 2 (bases 1 to 7513) AUTHORS Rimkute,I., Chaimongkol,N., Woods,K.D., Nagata,B.M., Darko,S., Gudbole,S., Henry,A.R., Sosnovtsev,S.V., Olia,A.S., Verardi,R., Bok,K., Todd,J.-P., Woodward,R., Kwong,P.D., Douek,D.C., Alves,D.A., Green,K.Y. and Roederer,M. TITLE Direct Submission JOURNAL Submitted (04-AUG-2023) VRC, NIH, 40 Convent Drive, Bethesda, MD 20892, USA COMMENT ##Assembly-Data-START## Assembly Method :: CLC Genomics Workbench v. v.21.0.3 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7513 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="NHP_#4_ch2_d8" /isolation_source="stool" /host="Rhesus macaque" /db_xref="taxon:122929" /country="USA" /collection_date="Aug-2022" /note="genotype: GII.1" gene 5..5104 /gene="ORF1" CDS 5..5104 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="WLE90398.1" /translation="MKMASNDASAAAAANSNNDTVKSSSDGVLSSMAVTFKRALGARP KQPPPREIPQRPPRPPTPELIKKVPPPPPNGEDEPVVSYSVKDGVSGLPDLSTVRQPP ENNTAFSVPPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVEQGLVLGVHKPPAAIS LAKVELTPLSLYWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS WLSRRMIHRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCALSSLFTRPIKDIIGK LRPLNILNILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED LAVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP KKEEASELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLAAYMRTLDLEEEKARKLST KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLAREL AKKIAATLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELADTC PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPDVEK AKRDFPGQPDMWKNAFSPDFSHIKLMLAPQGGFDKNGNTPHGKGVMKTLTVGSLIARA SGLLHERLDEYELQGPALTTYNFDRNKVLAFRQLAAENKYGLMDTMRVGGQLKGVRTM SELKQALKNISVKRCQIVYSGCTYTLESDGKGSVRVDRVQNTTVQTNNELAGALHHLR CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQDLWSKPQLEDTGEAVSKEGC PKPKDDEEFVVSSDDIKVEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV TGSEIRKRNPDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSP SLFITSTHVIPQGTQEFFGVSIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGNEGEAILEGGDDKGTYCGAPILG PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKSGPSLQQVMRDQLKPFTEP RGKQPKPSVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHIRKNDCW NGDSFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD STQQRAVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFKISINEGLPSGVPCT SQWNSITHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK EYGLKPTRPDKTEGPLIISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP NHEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVISELKEGGMDFYVPRQE PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide 5..994 /gene="ORF1" /product="p48" mat_peptide 995..2092 /gene="ORF1" /product="NTPase" mat_peptide 2093..2629 /gene="ORF1" /product="p22" mat_peptide 2630..3028 /gene="ORF1" /product="VPg" mat_peptide 3029..3571 /gene="ORF1" /product="Pro" mat_peptide 3572..5101 /gene="ORF1" /product="RdRp" gene 5085..6692 /gene="ORF2" CDS 5085..6692 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="WLE90399.1" /translation="MKMASNDAAPSNDGAAGLVPEVNNETMALEPVAGASIAAPLTGQ NNVIDPWIRMNFVQAPNGEFTVSPRNSPGEILLNLELGPELNPFLAHLSRMYNGYAGG VEVQVLLAGNAFTAGKLVFAAIPPHFPLENLSPGQITMFPHVIIDVRTLEPVLLPLPD VRNNFFHYNQQPEPRMRLVAMLYTPLRSNGSGDDVFTVSCRVLTRPSPDFDFNYLVPP TVESKTKPFTLPILTIGELSNSRFPVPIDELYTSPNEGVIVQPQNGRSTLDGELLGTT QLVPSNICALRGRINAQVPDDHHQWNLQVTNTNGTPFDPTEDVPAPLGTPDFLANIYG VTSQRNPNNTCRAHDGVLATWSPKFTPKLGSVILGTWEESDLDLNQPTRFTPVGLFNA DHFDQWALPSYSGRLTLNMNLAPSVSPLFPGEQLLFFRSHIPLKGGTSDGAIDCLLPQ EWIQHFYQESAPSPTDVALIRYTNPDTGRVLFEAKLHRQGFITVANSGSRPIVVPPNG YFRFDSWVNQFYSLAPMGTGNGRRRVQ" gene 6692..7471 /gene="ORF3" CDS 6692..7471 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="WLE90400.1" /translation="MAGAFIAGLAGDIVTNSVGSLVNAGANAINQKVDFENNKQLQQA SFNHDKEMLQAQIQATKQLQADMIALRQGVLTAGGFSPTDAARGAVNAPMTQVLDWNG TRYWAPGATKTTAFSGGFTSSSHARTVDLPKKTAAAPATMPVSRPSSSASTASTRSTL VSGSSNLPSSARSSSSVFSQSTSPSSRTSEWVRSQNRALEPYMRGALQTAYVTPPSSR ASSNGTVSTVPKEVLDSWTSVFNTHRQPLFAHLRRRGESQV" ORIGIN 1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gctgctaaca gcaacaacga 61 caccgtaaaa tcttcaagtg acggagtact ctctagtatg gctgtcacct ttaaacgggc 121 cctcggggcg cggcctaagc agccgccccc gagggaaata ccacaaagac ccccacgacc 181 acccactcca gagttgatca aaaaggtccc ccctcccccg cccaatgggg aggatgaacc 241 agtggtctcc tatagcgtca aagatggcgt ttccgggctg cctgacctct ccaccgtcag 301 acagccacct gagaacaaca cggcgtttag cgttcctcca ctcaaccaga gggagaacag 361 ggatgccaag gagccactga ctggaacaat cctggaaatg tgggatgggg agatttacca 421 ttatggcttg tatgtggaac aaggcctggt gcttggcgtg cacaaaccgc cagcggccat 481 cagcctcgct aaagttgaat taacaccact ctctttatac tggagacctg tgtacacccc 541 ccagtatctc atctccccag acactcttag gaggctccat ggggaatcgt tcccttacac 601 agcttttgac aacaactgct atgccttctg ttgctgggtt ttagacctga atgactcatg 661 gttgagtaga agaatgatac ataggacaac tggcttcttc agaccttacc aagattggaa 721 tagaaagccc cttcccacca tggatgactc caaactgaag aaggtggcca atattttcct 781 gtgtgctttg tcttcactat tcactagacc catcaaagac ataataggga agctaagacc 841 ccttaacatc ctcaatattt tagcatcatg tgattggact tttgcaggca tagtggaatc 901 cctgatcctc ttggcagagc tctttggagt tttctggaca cccccagatg tgtctgcgat 961 gatcgccccc ttactgggtg actacgagct gcaggggcct gaggacctcg cggtagaact 1021 agtgccagta gtaatgggag ggataggttt ggtgttagga ttcaccaaag aaaagattgg 1081 aaagatgctg tcatccgctg cgtccacatt gagagcttgt aaagaccttg gtgcgtacgg 1141 gctggagatc ttaaaattag tcatgaaatg gttcttcccg aagaaagagg aggcgagcga 1201 attggccatg gtgaggtcca tcgaggacgc ggtgttggac ctcgaagcaa ttgagaacaa 1261 ccatatgacc gctttactta aagataagga cagcctggcg gcttacatga gaactctcga 1321 ccttgaggaa gagaaagcca gaaagctctc aaccaaatct gcttcacccg acatcgtggg 1381 cacaatcaac gctctcttgg cgaggatcgc agccgctcgc tccctcgtgc atcgggcaaa 1441 ggaagagctc tccagtaggc cgagacccgt tgttgtgatg atatcgggaa aaccagggat 1501 agggaagacc catcttgcca gagaactggc caaaaagatc gcagctactc tcacagggga 1561 tcagagagtg ggtctcattc cacgcaatgg cgttgaccac tgggacgcat acaaggggga 1621 gagggtcgtc ctttgggatg actacgggat gagcaacccc atccatgatg ctctcagaat 1681 acaggagctt gctgacactt gccccctaac attaaactgt gatagaattg aaaacaaagg 1741 gaaagttttt gacagtgatg ccataatcat caccaccaac ttggccaacc ctgcaccact 1801 agactatgtc aattttgagg catgctcgag gcgcattgat ttcctcgtgt atgccgacgc 1861 ccctgatgtc gaaaaggcga agcgcgactt cccaggccaa cctgacatgt ggaaaaatgc 1921 ttttagtcct gacttctcac acataaaatt gatgctagcc ccacagggtg gttttgataa 1981 gaacggcaac accccacatg ggaagggcgt catgaaaacc ctcaccgttg gttccctcat 2041 cgcccgtgca tcaggactcc tccatgagag actggatgag tacgaactac agggcccagc 2101 tctcacaacc tacaactttg accgaaacaa agtgcttgct ttcaggcagc ttgctgctga 2161 aaacaagtac ggtttaatgg acacaatgag agtcggaggg cagctcaagg gtgtcagaac 2221 catgtcagag ctcaaacaag cactcaaaaa catctcagtt aaaaggtgcc agatagtgta 2281 cagtggttgc acttacacac ttgaatctga tggtaagggc agtgtgaggg ttgacagagt 2341 tcagaacacc actgtgcaga ccaacaacga gttagccggc gccctgcacc atctcaggtg 2401 cgccaggatt aggtactatg tcaagtgtgt tcaagaggcc ctgtattcca tcatccaaat 2461 tgcaggggct gcgtttgtca ccacgcgcat tgccaaacgc atgaacatac aggacctttg 2521 gtccaagcca cagctggagg acacaggaga agctgtcagc aaagaagggt gcccaaaacc 2581 caaggatgat gaggagttcg ttgtttcatc tgacgacatc aaggtcgagg gcaagaaagg 2641 gaaaaacaag actggtcgcg gcaagaaaca cacagccttc tctagtaaag gtctcagtga 2701 tgaggagtac gacgagtaca aaagaatcag ggaggaaaga aacggcaagt actctataga 2761 agagtatctc caggacagag acaagtatta cgaggaggtt gccattgcta gggcgactga 2821 ggaggacttc tgtgaagaag aagaagccaa aattcgacag aggatcttca gaccaacaag 2881 gaaacaacgt aaagaggaga gggcttctct cggtctagtc acaggctctg agattaggaa 2941 gagaaaccca gatgacttca aacccaaggg aaaactgtgg gccgatgacg acagaagtgt 3001 tgattacaat gagaagctca gttttgaagc cccaccaagc atctggtcaa gaatagtcaa 3061 ctttggttcg ggctggggct tttgggtctc tcctagtctg ttcataacat caactcatgt 3121 tattccccag ggcacacagg agttctttgg agtgtctatc aaacaaatcc aaatacacaa 3181 atcaggtgag ttttgccgct tgaggttccc aaagccaatc aggactgacg tgacaggtat 3241 gatcttagag gaaggtgccc ccgaagggac cgtggccaca ctgctcatca aaagaccaac 3301 tggtgagctc atgcctttag cagccaggat gggaactcac gcaaccatga agatccaagg 3361 tcgcactgtt gggggccaaa tgggtatgct cctaacagga tccaatgcta aaagtatgga 3421 cctgggcact acacccggtg actgtggttg tccctacatc tataagagag ggaatgacta 3481 tgtggtcatc ggggttcaca cagctgctgc ccgtgggggg aacactgtca tatgtgccac 3541 ccagggtaat gaaggtgagg ccatacttga gggcggagat gacaaaggca cctactgtgg 3601 tgccccaatc ctgggtccag gaagtgcccc aaagctcagc accaagacca aattttggag 3661 gtcatccaca acgccactcc cgcctggcac ctacgaacca gcctacctcg gtggcaagga 3721 ccccagagtc aagagtggcc cctcattaca acaagtcatg agagaccagt tgaaaccatt 3781 cacagaacca agaggtaaac aaccaaaacc aagtgtgttg gaggctgcca agaaaaccat 3841 catcaatgtt cttgaacaaa caatagaccc gcctcaaaaa tggtcatttg cgcaagcttg 3901 tgcgtctctt gacaagacca cctccagtgg ccacccacac cacatacgga agaatgactg 3961 ctggaatggg gactctttca caggcaagtt ggcagatcag gcctcgaagg ccaacttgat 4021 gtttgaggag gggaagaaca tgaccccagt ctacacaggt gccctcaagg atgagttagt 4081 caaaactgac aagatatatg gtaagatcaa gaagaggtta ctctggggct cggacctggc 4141 aaccatgatc cggtgtgcac gagcgttcgg aggtctgatg gacgaactta aagcccactg 4201 tgtcacactc cctgtcagag ttggtatgaa catgaatgag gatggcccca ttatctttga 4261 gaaacactcc aggtataaat atcattatga tgcagattat tctcgatggg actcaacaca 4321 gcagagagcc gtactagctg cagccctaga gatcatggtc aaattctccc cagagccaca 4381 cttggcccag gtagttgcag aagaccttct ttcccccagt gtgatggatg tgggtgactt 4441 caagatatca atcaacgagg gtcttccctc tggggtgccc tgcacctcgc aatggaactc 4501 catcacccac tggctcctca ctctttgtgc actctctgaa gtcacggacc tgtcccctga 4561 catcattcaa gccaattcct tattctcttt ctatggtgat gatgaaattg tgagtacaga 4621 cataaaattg gacccagaaa aactgacagc aaaactgaaa gagtatgggc tgaagccaac 4681 ccgccctgac aagactgagg gacctctgat catttctgag gatctggatg gtttaacctt 4741 tctgcggaga actgtaaccc gtgatccagc tggttggttt ggtaaattag aacagagctc 4801 aatacttagg cagatgtact ggactagggg ccccaaccat gaggatccat ctgaaacaat 4861 gataccacat tcccaaaggc ccatacagtt gatgtctctg ctaggtgaag ctgcattgca 4921 cggtccagca ttctacagca aaatcagtaa actagtcatt tcagagttga aggaaggtgg 4981 catggacttt tacgtgccca ggcaagagcc gatgttcaga tggatgagat tctcagacct 5041 gagcacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga 5101 atgacgccgc cccatctaat gatggtgcag ccggtctcgt accagaggtc aacaacgaga 5161 cgatggccct cgaaccggtg gctggggctt ctatagccgc ccctctaacc ggtcaaaata 5221 atgtgataga cccctggatt agaatgaact ttgtccaagc cccaaatgga gaattcacag 5281 tgtctccccg caattctcct ggtgaaatct tgctaaattt ggaattaggc cctgaattaa 5341 atccattctt agcacacctt tcaagaatgt ataatggtta tgccggcggg gttgaagtgc 5401 aggtactact cgctgggaac gcgttcacag cgggaaaact ggtgtttgca gcaatccccc 5461 cgcacttccc tcttgagaat ctgagtcctg gacaaattac aatgttccct catgtgatta 5521 ttgatgttag aacattagaa cctgtgcttt tgccccttcc agatgttaga aataatttct 5581 ttcattacaa tcagcagccc gagccccgta tgagacttgt agctatgttg tatactcctc 5641 ttagatctaa tggttctggt gatgatgtgt tcacagtttc ttgcagggtt ctcacccgcc 5701 cttctccaga ttttgatttt aattatttgg ttcccccaac tgtggagtct aaaactaaac 5761 cattcaccct gccaatccta actatcggag aattgtcaaa ttctagattc ccagttccaa 5821 tagatgaatt gtacaccagc cccaatgaag gagtgatcgt gcagccccaa aatggcagat 5881 caacacttga tggtgaattg ttgggcacca cgcaactcgt gccctcaaac atctgcgcgc 5941 tacgagggcg cattaacgcc caggtgccag atgatcacca tcaatggaac ctacaggtaa 6001 caaacacaaa tgggactcct ttcgacccca ccgaagacgt ccctgcacca ctgggcacac 6061 cggatttcct ggcgaatatc tatggagtca ccagccagag aaaccccaac aacacttgcc 6121 gtgcccatga tggggttttg gcaacttgga gccccaaatt tacacccaag ttaggatctg 6181 tgattttggg cacttgggaa gaaagtgatc ttgatctcaa tcagcccaca aggttcacac 6241 ctgttggtct gtttaacgct gaccactttg atcagtgggc cttgcctagt tattctggaa 6301 gattaaccct aaacatgaat ttggcaccct ctgtttcccc cctctttcca ggtgaacagc 6361 tacttttctt caggtcccat ataccactca aaggaggtac ctctgatggt gccattgatt 6421 gtctactccc ccaggaatgg attcagcatt tttatcagga gtcagcccca tcgcccacgg 6481 acgtggctct aattagatac accaatcctg acacaggccg cgttttgttt gaagctaaac 6541 tgcacaggca aggattcatc acagtggcaa actctggttc taggcctatt gttgtccctc 6601 cgaatggcta ttttaggttt gattcttggg ttaatcaatt ctattctctc gcccccatgg 6661 gaactgggaa cgggcgcaga agagtgcagt aatggctgga gcttttatag cagggcttgc 6721 tggtgacata gtcaccaata gtgttggctc acttgtgaac gctggagcta atgcaataaa 6781 ccaaaaagtg gactttgaaa acaacaagca actacagcag gcttctttca atcatgacaa 6841 agagatgctg caagctcaaa tccaagccac caaacagcta caggctgaca tgattgctct 6901 cagacaaggg gtgttgaccg caggcggctt ctcccctact gatgcagcaa gaggggcagt 6961 taatgcgcct atgacgcagg tcttggactg gaatgggact aggtattggg cccccggagc 7021 cacaaaaacc accgctttct ccggtgggtt cactagctct tcccatgcca ggactgttga 7081 tctgcccaag aaaacagcag ctgcaccagc cacgatgcct gtttctagac ccagttcttc 7141 tgcttctaca gcctctactc gctcaacatt ggttagcggg tcttctaacc ttccttcttc 7201 agctaggagt tcttctagtg tcttttccca atccacttcc ccttcctccc ggactagtga 7261 gtgggtgcgt agtcaaaata gggcactgga gccttacatg aggggggcgc tacaaacagc 7321 ttacgtgacg cctccttcta gtagagcatc tagtaatggc acagtctcaa ccgtgccgaa 7381 agaagttttg gactcctgga catctgtgtt taacactcac agacagccgc tcttcgctca 7441 tctccgtcgg agaggggagt cacaagttta gtgaaaagat gatctcttct ttctttgaaa 7501 atttctgtct ttt //