Typing tool
|
Complete norovirus genomes
MW305705 | GII.1 | ||
---|---|---|---|
GII.P33 |
ORF1: 1..5075 ORF2: 5056..6663 ORF3: 6663..7436LOCUS MW305705 7463 bp RNA linear VRL 06-JUL-2021 DEFINITION Norovirus GII isolate CMH-N016-12 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MW305705 VERSION MW305705.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7463) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Genome-wide analyses of human noroviruses reveal coexistence of viral populations evolving under recombination constraints JOURNAL Unpublished REFERENCE 2 (bases 1 to 7463) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Direct Submission JOURNAL Submitted (23-NOV-2020) CBER/OVRR/DVP/LHV, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA COMMENT ##Assembly-Data-START## Assembly Method :: High-performance Integrated Virtual Environment (HIVE) v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7463 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="CMH-N016-12" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="Thailand" /collection_date="27-Jan-2012" /note="genotype: GII.1[GII.P33]" gene <1..5075 /gene="ORF1" CDS <1..5075 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QPJ59451.1" /translation="AAATSNNDIAKSSSDGVLSSMAVTFKRALGARPKQPPPRETTQK QKPPRPPTPELIKKIPPPPPNGEDDIVVSYSAKDGVSGLPELSTVRQPGETNTAFSVP PLNQRENRDAKEPLPGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLAKVELTPL SLYWRPVYTPQYLISPETLKKLHGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQR TTGFFRPYQDWNRKPLPTMDDSKLKKVANIVLCALSSLFTRPIKDIIGKLRPLNILNI LASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVV MGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKDETNELA MVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMKTLDLEEEKARRLSTKSASPDIVG TINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLARDLAKKVAATLT GDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRI ENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPDVEKAKRDFPGQP DMWKDAFRSDFSHIKLMLAPQGGFDKNGNTPHGKGVMKTLTSGSLVARASGLLHERLD EYELQGPTPTTFNFDQNKVFAFRQLAAENKYGLMDTMRVGSQLKNVKTVSELKQALKN ISIRRCQIVYGGLTYSLESDGKGNVKVEKVQSPAVQTNNELTGALHHLRCARIRYYVK CVQESLYSIIQIAGAAFVTTRIAKRMNIQNLWSRPQVEDEEETTSKDGCPKPKDEEEF VISSEDIKAEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQ DRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERVSLGLVTGSEIRKRN PDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHV IPQGAQEFFGVSIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKR PTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKR GNDYVVIGVHTAAARGGNTVICATQGSEGEAVLEGGDNKGTYCGAPILGPGGAPKLST KTKFWRSSTAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPKPSV LEAAKKTIINVLEQTIDPPQKWTFAQACASLDKTTSSGYPHHVRKNEHWNGESFTGKL ADQASKANLMFEEGKHMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLATMIRCARA FGGLMDELKTHCVTLPIRVGMNMNEDGPIIFEKHSRYTYHYDADYSRWDSTQQRAVLA AALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFKISITEGLPSGVPCTSQWNSIAHW LLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPERLTVKLKEYGLKPTRP DKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPSETM IPHSQRPIQLMSLLGEAALHGPSFYSKISKLVISELKEGGMDFYVPRQEPMFRWMRFS DLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..965 /gene="ORF1" /product="p48" mat_peptide 966..2063 /gene="ORF1" /product="NTPase" mat_peptide 2064..2600 /gene="ORF1" /product="p22" mat_peptide 2601..2999 /gene="ORF1" /product="VPg" mat_peptide 3000..3542 /gene="ORF1" /product="Pro" mat_peptide 3543..5072 /gene="ORF1" /product="RdRp" gene 5056..6663 /gene="ORF2" CDS 5056..6663 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QPJ59452.1" /translation="MKMASNDAAPSNDGAAGLVPEVNNETMALEPVAGASIAAPLTGQ NNVIDPWIRMNFVQAPNGEFTVSPRNSPGEVLLNLELGPELNPFLAHLSRMYNGYAGG VEVQVLLAGNAFTAGKLVFAAIPPHFPLENLSPGQITMFPHVIIDVRTLEPVLLPLPD VRNNFFHYNQQPEPRMRLVAMLYTPLRSNGSGDDVFTVSCRVLTRPSPDFDFNYLVPP TVESKTKPFTLPILTIGELSNSRFPVPIDELYTSPNEGVVVQPQNGRSTLDGELLGTT QLVPSNICALRGRINAQVPDDHHQWNLQVTNANGTSFDPTEDVPAPLGTPDFLANIYG VTSQRNPDNTCRAHDGVLATWSPKFTPKLGSVVLGTWEESDLDLNQPTRFTPVGLYDT NHFDQWILPNYSGRLTLNMNLAPSVAPLFPGEQILFFRSHIPLKGGTSNGAIDCLLPQ EWIQHFYQESAPSPTDVALIRYTNPDTGRVLFEAKLHRQGFITVANSGSRPIVVPPNG YFRFDSWVNQFYSLAPMGTGNGRRRVQ" gene 6663..7436 /gene="ORF3" CDS 6663..7436 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QPJ59453.1" /translation="MAGAFVAGLAGDIVTNGIGSLVNAGANAINQKVDFENNKQLQQA SFNHDKEMLQAQIQATKQLQADMIALRQGVLTAGGFSPTDAARGAVNAPMTQVLDWNG TRYWAPGATKTTAFSGGFTSSPNTRTIDLPRKTSNTPAPTPVSRPSSSASTVSTRSTI VSGTSSPSSSTRSSFNSQPTSSSSRTSEWVRSQNRALEPYMRGALQTAYVTPPSSRAS SNGTVSTVPKEVLDSWTSVFNTHRQPLFAHLRRRGESQA" ORIGIN 1 ccgctgctgc taccagcaac aacgacatcg caaagtcttc aagtgatgga gtactaagta 61 gtatggctgt cacttttaaa cgagccctcg gggcccggcc taaacagccg cccccgaggg 121 aaacaacaca aaaacaaaaa cctccacgac cgcccacccc ggagttaatt aaaaagattc 181 cacctccccc gcccaacggg gaggatgaca tagtggtgtc ctacagtgcc aaagatggtg 241 tctccggtct gcctgagcta tccaccgtca ggcagccggg cgagacaaac acagcattta 301 gtgtcccccc actcaaccag agggaaaaca gggatgctaa agaaccgctg cctggcacca 361 tcttggagat gtgggacgga gagatttacc actacggctt gtacgtggag cggggcttgg 421 tgcttggggt gcacaaacca ccagcagcta ttagccttgc gaaggttgaa ctaacgccac 481 tgtctctgta ctggagacca gtgtacaccc ctcagtacct catatccccg gaaactctca 541 aaaagctaca tggagagaca ttcccttaca cggcctttga caacaactgt tatgccttct 601 gttgttgggt cttggatcta aatgactctt ggctcagcag gaggatgata cagagaacaa 661 ctggcttctt cagaccctac caggactgga acaggaaacc tctccccact atggatgact 721 ccaaattgaa aaaggtggct aatatagtcc tgtgtgcttt gtcatcacta ttcaccaggc 781 ccattaagga cataatagga aaactaaggc ccttaaacat actcaatata ctagcttcct 841 gcgactggac tttcgcagga atagtagaat ccctaattct ccttgcagaa ctcttcggag 901 ttttctggac acccccagat gtgtctgcga tgatcgcccc cttgctaggt gactacgaac 961 tacagggacc tgaagatctc gctgtagaac tcgtaccagt agtaatgggg gggataggtt 1021 tggttctagg tttcaccaag gagaagattg gcaagatgct atcatctgcc gcgtctaccc 1081 tcagagcttg taaggacctt ggagcatatg gactggaaat tctgaagttg gttatgaagt 1141 ggttcttccc gaagaaagac gagacaaatg aacttgcaat ggtgagatcc attgaggacg 1201 cagtgctaga cctcgaagca attgaaaata accacatgac taccctactc aaagacaaag 1261 acagcttagc aacctacatg aagactctgg atcttgagga ggagaaagct aggagactct 1321 ccaccaaatc cgcttctcct gacattgtgg gcacaatcaa ttcacttctg gcacggattg 1381 cagctgcccg ttccctggtg catcgggcaa aagaagagct ttccagtagg ccaagacccg 1441 ttgttgtcat gatatcagga aagccaggga tagggaagac ccaccttgcc agagatcttg 1501 caaagaaagt tgcggccacc ctcacaggag accagagggt gggcctcatc ccacggaatg 1561 gtgtcgacca ctgggatgca tacaaaggtg agagagtcgt cctttgggat gactatggaa 1621 tgagcaaccc cattcatgac gcccttaggc tacaagaact tgctgacact tgccccctaa 1681 cattgaattg tgatagaatt gagaacaaag gaaaggtctt tgacagtgat gccataatta 1741 tcacaactaa tctggctaac ccggcaccac tggactatgt taattttgaa gcatgttcga 1801 ggcgcatcga cttcctagtg tatgctgatg cccccgacgt tgaaaaggca aaacgcgact 1861 tcccagggca acctgacatg tggaaggatg cttttagatc tgacttctca cacataaagt 1921 tgatgttggc tcctcagggt ggttttgata agaacggcaa caccccacat ggaaaaggtg 1981 tcatgaaaac cctcacatct ggctccctcg ttgcccgtgc ttcgggactc ctccacgaaa 2041 gactggacga gtacgagtta caagggccaa cacccacaac cttcaatttc gaccaaaaca 2101 aggtgttcgc ctttagacag ctcgccgctg aaaataaata cgggctgatg gacaccatga 2161 gagtgggaag ccagctcaag aatgtcaaaa ccgtgtcaga gctcaaacag gcacttaaga 2221 acatttcaat tagaaggtgc cagatagtct acggtggtct cacatattca cttgaatctg 2281 atggcaaagg taatgtgaag gttgagaagg tgcaaagtcc agctgtgcaa accaacaatg 2341 agctaactgg cgcattacac catctgagat gcgctaggat tagatattat gttaagtgtg 2401 tccaggaatc tttgtactcc atcatccaaa ttgctggagc cgcgtttgtc accacgcgca 2461 ttgcaaagcg catgaacata caaaacctct ggtccagacc acaggtggag gatgaggaag 2521 agaccaccag caaagacggg tgcccaaaac caaaggatga ggaggagttt gtcatctcgt 2581 ctgaagacat caaagccgaa ggcaagaagg gaaagaacaa gtctggccgt ggcaagaaac 2641 acacagcctt ctcgagtaag ggtcttagtg atgaggagta cgatgagtac aagagaatca 2701 gagaagaaag aaatggcaag tactccatag aagaatacct ccaagacaga gacaagtact 2761 atgaggaggt tgccatagcc agggctactg aagaggactt ctgtgaggaa gaagaagcca 2821 agattcggca gaggattttc agaccaacaa ggaaacaacg caaggaggag agggtttcac 2881 tcggactggt tacaggctca gaaatcagga aaaggaaccc ggatgacttc aaacccaaag 2941 gaaagttgtg ggctgatgac gacaggagtg tcgattacaa tgaaaagctc agttttgaag 3001 cccccccaag catctggtcg agaatagtca actttgggtc aggttggggc ttctgggtct 3061 caccaagtct ctttataaca tccacccatg tcatacctca aggggcacaa gagttcttcg 3121 gtgtttccat caaacagatc caaattcata agtcaggtga gttctgccga ctaagattcc 3181 cgaagccaat tagaactgat gtgacaggta tgatcctgga ggagggggcc cctgaaggca 3241 ctgtggccac actgctcata aagaggccaa ctggagagct gatgccactg gcagctagaa 3301 tgggcaccca tgcaactatg aagatccaag gtcgaactgt cgggggacag atggggatgc 3361 tcctgacagg atccaacgcc aagagcatgg atttgggtac tacaccaggt gactgtggat 3421 gcccatatat ctacaaaaga ggaaatgact acgtggtcat cggagtccac acagctgccg 3481 cccgcggagg gaacactgtc atctgtgcaa ctcaaggaag tgagggcgag gccgtacttg 3541 aaggtggtga caacaaaggc acctattgtg gcgccccaat cctaggtcca ggaggtgcac 3601 ccaaactcag tactaaaacc aaattctgga ggtcctcaac agcaccactc ccacctggca 3661 catatgaacc agcttacctt ggaggcaaag accccagggt aaaaggtggc ccctcattac 3721 agcaagttat gagagaccaa ctgaaaccgt ttacagaacc cagaggtaag ccaccaaagc 3781 ccagtgtatt ggaagcagcc aaaaagacaa tcattaatgt ccttgagcaa acaatagacc 3841 caccccaaaa atggacgttc gcacaggcgt gcgcatccct tgataaaaca acctctagtg 3901 gttacccgca ccacgtgcgg aagaatgaac attggaatgg ggaatctttc acaggaaaat 3961 tggctgatca ggcttcaaaa gctaacttga tgtttgagga aggaaaacac atgacaccag 4021 tctacacagg agctctcaaa gatgagttag tcaaaactga caaaatatat ggtaagatca 4081 agaaaaggtt gttatggggt tcagatctgg caaccatgat ccgttgcgca cgagcgttcg 4141 gagggctgat ggatgaactc aagacccact gtgttacact ccctatcagg gtggggatga 4201 acatgaatga ggatggtccc atcatttttg aaaaacactc taggtacacc taccattatg 4261 atgcagacta ctctcggtgg gactcaacac agcagagagc tgttttagct gcagccctgg 4321 agataatggt aaaattttcc ccagaaccac acctagctca gattgtcgca gaagacctcc 4381 tatctcccag tgtgatggac gtgggcgatt ttaaaatatc aatcactgaa ggactcccct 4441 ctggggtgcc ttgcacttca caatggaact ccattgctca ttggctcctc acgctctgtg 4501 cactatctga ggtaacagat ttatctcctg acatcatcca agcaaactca ctcttctctt 4561 tctatggtga tgatgaaatt gtgagcacag acattaaact agatccagaa aggctaacag 4621 tcaaactaaa agagtatggt ctcaagccaa cccgtcctga caagactgaa ggacctctgg 4681 tcatttctga ggacttagat ggtctgacct tcttgcggag aactgtgacc cgtgacccag 4741 caggttggtt tggaaaattg gaacaaagct caatacttag acagatgtat tggaccaggg 4801 gccccaatca tgaggacccc tctgaaacaa tgataccaca ctcccaacga cccatacagt 4861 tgatgtccct actaggtgaa gctgcattac atggcccatc attctacagc aaaatcagca 4921 aattggtcat atcagagttg aaagaaggtg gcatggactt ttacgtgcca aggcaagagc 4981 caatgttccg atggatgagg ttctcagact tgagcacgtg ggagggcgat cgcaatctgg 5041 ctcccagttt tgtgaatgaa gatggcgtcg aatgacgccg ctccatctaa tgatggtgca 5101 gccggtctcg taccagaggt caacaacgag acgatggcac ttgagccggt ggctggggct 5161 tccatagctg cccctctaac cggccaaaat aatgtgatag acccctggat tagaatgaat 5221 tttgtccaag ctccaaatgg tgaatttaca gtgtcccccc gtaattcccc aggtgaagtg 5281 ttgttaaatt tggaattagg ccctgaatta aatccattcc tggcacacct ttcaagaatg 5341 tacaatggtt atgctggtgg ggttgaagtg caggtgctac ttgctgggaa tgcgttcaca 5401 gcaggaaaac tagtgtttgc agcaatcccc ccacacttcc ctcttgagaa tctgagtcct 5461 ggtcaaatta caatgttccc tcatgtaatt attgatgtta gaactttgga acctgtcctc 5521 ttgcctttgc cagatgttag aaataatttc tttcattaca atcagcagcc tgagccccgt 5581 atgagacttg tggctatgtt gtatactcct cttagatcta atggttctgg tgatgatgtg 5641 ttcacagtct cttgtagggt cctcactcgc ccctctccag actttgattt taattatttg 5701 gtccccccaa ctgtggaatc taaaaccaaa ccattcaccc tgccaatatt gaccatcgga 5761 gaactgtcaa attctaggtt cccagtacca atagatgaat tgtacaccag ccctaatgaa 5821 ggagtggttg tgcaacctca aaatggcaga tcaacactcg atggtgaact gcttggtacc 5881 acacaacttg taccatcaaa catctgtgcc ttgcgagggc gcatcaacgc ccaagtgcca 5941 gacgatcacc atcaatggaa cttgcaggtt acaaatgcaa atgggacttc ctttgacccc 6001 actgaagatg tccctgcacc attgggcacg ccggatttcc tggcaaacat ctacggagtc 6061 actagtcaga ggaatcctga taacacttgc cgtgcccatg atggggtttt ggcaacctgg 6121 agccctaaat ttacacccaa gttaggatct gtggttttgg gcacttggga agaaagtgat 6181 cttgatctta atcagcctac cagatttaca cctgttggtt tgtatgatac aaaccacttt 6241 gatcagtgga ttctacccaa ctattctgga agattgacct taaacatgaa tttggcgccc 6301 tccgttgctc ccctcttccc aggtgaacaa atactcttct ttaggtccca cataccactt 6361 aaaggaggca cttccaacgg tgccattgac tgtctactcc ctcaggagtg gatccagcat 6421 ttctatcagg agtcagcccc gtcacccaca gatgttgctc tgattaggta caccaaccct 6481 gacacgggcc gtgtcctttt tgaggccaaa ctacatagac agggattcat cacagtagca 6541 aattctggtt ctagacctat tgttgtccct cctaatggtt attttaggtt tgattcttgg 6601 gttaatcagt tttattctct cgcccccatg ggaactggga atgggcgtag aagggtgcag 6661 taatggctgg agcttttgta gcagggcttg ccggtgacat agtcaccaat ggcattggtt 6721 cacttgtgaa cgctggagcc aatgcaataa accaaaaagt agattttgaa aacaacaaac 6781 aactacagca ggcctctttc aaccatgata aggaaatgtt gcaagctcaa attcaggcca 6841 caaaacaatt acaggctgac atgattgcgc tcagacaagg ggtattgacc gcaggcggct 6901 tttcccctac tgatgcagca agaggggcgg ttaatgcgcc catgacgcaa gtcttggact 6961 ggaatgggac taggtactgg gcccccggag ccacgaaaac aactgctttc tccggcgggt 7021 tcacaagctc tcccaatacc aggactattg atttgcctag gaaaacatca aacacaccag 7081 cccccacgcc tgtgtctaga cctagttctt ctgcttctac tgtttctacc cgctcaacaa 7141 ttgttagcgg gacttctagc ccttcttctt cgactaggag ctcttttaat tcccagccca 7201 cctcctcttc ttctcggact agcgaatggg tgcgtagtca aaacagggcg ctggaaccct 7261 acatgagggg agcgttacaa actgcttacg tgacgcctcc ctctagtagg gcctccagca 7321 atggtacagt ctcaaccgtg ccaaaagaag ttttggactc ctggacatct gtgttcaaca 7381 cccacagaca accgcttttc gctcatctcc gtcggagagg ggagtcacag gcttagtgaa 7441 aagatgattt ttcttctatt ctt //