Typing tool
|
Complete norovirus genomes
MW305572 | GII.4 | ||
---|---|---|---|
GII.P39 |
ORF1: 1..5069 ORF2: 5050..6669 ORF3: 6669..7475LOCUS MW305572 7520 bp RNA linear VRL 06-JUL-2021 DEFINITION Norovirus GII isolate C110 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MW305572 VERSION MW305572.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7520) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Genome-wide analyses of human noroviruses reveal coexistence of viral populations evolving under recombination constraints JOURNAL Unpublished REFERENCE 2 (bases 1 to 7520) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Direct Submission JOURNAL Submitted (23-NOV-2020) CBER/OVRR/DVP/LHV, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA COMMENT ##Assembly-Data-START## Assembly Method :: High-performance Integrated Virtual Environment (HIVE) v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7520 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="C110" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="French Guiana" /collection_date="04-Apr-1978" /note="genotype: GII.4[GII.P39]" gene <1..5069 /gene="ORF1" CDS <1..5069 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QPJ59052.1" /translation="AVANSNNDTAKSSSDGVLSSMAVTFKRALGARPKQPPPRETPQR PPRPPTPELIKKVPPPPPNGEDEPVISYNAKSGVSGLPELSTVRQPDENNTAFSVPPL NQRENRDAKEPLTGTILEMWDGEIYHYGLYVDRGLVLGVHKPPAAISLAKVELTPLSL FWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTT GFFRPYQDWNRKPLPTMDDSKLKKVANIFLCALSSLFTRPIKDIIGKLRPLNILNILA SCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVVMG GIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEASELAMV RSIEDAVLDLEAIENNHLTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTI NALLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLARELAKKIATTLTGD QRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIEN KGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPDVEKAKRDFPGQPDM WKNAFSPDFSHIKLMLAPQGGFDKNGNTPHGKGVMKTLTIGSLIARASGLLHERLDEY ELQGPALTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGGQLKGVRTMPELKQALKNIS IKRCQIVYGGSTYTLESDGKGSVRVDKVQSATVQTNNELAGALHHLRCARIRYYVKCV QEALYSIIQIAGAAFVTTRIVKRMNIQDLWSKPQLEDTEEATGKDGCPKPKDDEEFVV SSDDIKVEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDR DKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPD DFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIP QGTQEFFGVSIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRPT GELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGN DYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILGPGSAPKLSTKT KFWRSSTAPLPPGTYEPAYLGGKDPRIKGGPSLQQVMRDQLKPFTEPRGKQPKPSVLE AAKRTIINVLEQTIDPPQKWTFAQACASLDKTTSSGHPHHIRKNECWNGDSFTGKLAD QASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLATMIRCARAFG GLMDELKAHCVTLPVRVGMNMNEDGPIIFERHSRYRYHYDADYSRWDSTQQRAILAAA LEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFKISINEGLPSGVPCTSQWNSIAHWLL TLCALSEVTDLSPDIVQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDK TEGPLIISEDLNGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPSETMIP HSQRPTQLMSLLGEAALHGPAFYSKISKLVITELKEGGMDFYVPRQEPMFRWMRFSDL STWEGDRNLAPSFVNEDGVE" mat_peptide <1..959 /gene="ORF1" /product="p48" mat_peptide 960..2057 /gene="ORF1" /product="NTPase" mat_peptide 2058..2594 /gene="ORF1" /product="p22" mat_peptide 2595..2993 /gene="ORF1" /product="VPg" mat_peptide 2994..3536 /gene="ORF1" /product="Pro" mat_peptide 3537..5066 /gene="ORF1" /product="RdRp" gene 5050..6669 /gene="ORF2" CDS 5050..6669 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QPJ59053.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNIIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLSRMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD VRNNFYHYNQAHDSTLKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFTVPILTVEEMSNSRFPIPLEKLYTGPSSAFVVQPQNGRCTTDGVLLGTT QLSAVNICNFRGDVTRVGTSHDYTMNLVSQNWNNYDPTEEIPAPLGTPDFVGKIQGLL TQTTRADGSTRAHKATVSTGSVHFTPKLGSVQFTTDTDNDFLTGQNTKFTPVGVIQDG DHHQNEPQQWVLPNYSGTSGHNVHLAPAVAPAFPGEQLLFFRSTMPGCSGYPNMNLDC LLPQEWVLHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYITVAHTGPYDLVI PPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6669..7475 /gene="ORF3" CDS 6669..7475 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QPJ59054.1" /translation="MAGPFFAGLASDVLGSGLGSLISAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATQKLQQELMKVKQAVLLEGGFSTTDAARGAIN TPMTKALDWSGTRYWAPDARITTYNAGRFSTPQPLGALPGRTNPRVPASARSSPSSLS NAPTATSVYSNQTASTRLGSSAGSGAGVSSLPSTARTRSWVEDQNKNLSPFMKGALNT SFVTPPSSRSSNQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV" ORIGIN 1 ccgctgttgc taacagcaac aacgacaccg caaaatcttc aagtgacgga gtactttcta 61 gtatggctgt cacttttaaa cgagccctcg gggcgcggcc taagcagccg cccccgaggg 121 aaacaccaca aagaccccca cgaccgccca ctccggagtt gatcaagaag gtccctcccc 181 ccccgcccaa cggggaggac gaaccagtga tttcttacaa cgccaagagt ggcgtttctg 241 gactgcctga gctttccact gtcaggcaac cagatgagaa caacacggca tttagtgtcc 301 ccccacttaa ccaaagggag aacagggatg ccaaggagcc acttactgga acaatcctgg 361 agatgtggga tggggagatc taccattacg gcttgtatgt ggatcgaggc ctggtgctcg 421 gcgtgcacaa accgccagca gctattagcc tcgctaaggt tgaactgaca ccactctctt 481 tgttttggag acctgtctac accccccagt acctcatctc cccagacacc cttaggagat 541 tacatggtga atcgttccct tacacagcct ttgacaacaa ctgctatgcc ttctgttgct 601 gggttttgga cctgaacgac tcatggttga gtaggaggat gatacagagg acaactggct 661 tcttcagacc ttaccaggat tggaacagga agcctctccc caccatggat gattctaaat 721 taaagaaagt ggccaacata ttcctgtgtg ctttgtcttc actattcacc agacctatca 781 aggacataat aggaaagcta agacctctta acattctcaa tattttagct tcatgtgact 841 ggacctttgc aggcatagtg gagtctctaa tcctcttggc agaactcttt ggagttttct 901 ggacaccccc agatgtgtct gcgatgattg cccccttact gggtgactat gaactgcagg 961 ggcctgagga cctcgcagta gagctagtgc cagtagtaat ggggggaata ggtttggtgc 1021 taggatttac caaagaaaag attgggaaga tgttgtcatc tgccgcatcc acactgaggg 1081 cctgtaaaga ccttggtgca tatgggctgg agatcttgaa actagtcatg aagtggttct 1141 tcccgaagaa agaggaggcg agtgagttgg ccatggtgag atccatcgag gacgcggtgt 1201 tggacctcga agcaattgaa aacaatcatt tgaccactct gcttaaggac aaggacagcc 1261 tggcaactta catgagaact cttgaccttg aggaagagaa agccaggaag ctctcaacca 1321 agtctgcctc acctgacatc gtgggtacaa tcaatgctct cctggcgagg atcgcggctg 1381 cccgttccct ggtgcaccga gcaaaagaag aactttccag caggccaagg cctgttgtcg 1441 tgatgatatc aggaaaacca ggaataggga agacccacct tgccagagag ctggctaaga 1501 aaatcgcaac cacccttacg ggagaccaga gggtgggcct catcccacgc aatggtgttg 1561 accactggga tgcgtacaag ggagaaaggg tcgtcctttg ggatgactac ggtatgagta 1621 accccatcca cgatgccctc aggttgcagg agcttgctga cacttgcccc ctaacactaa 1681 attgtgacag gattgagaac aaaggtaaag tttttgacag tgatgctata attatcacca 1741 ctaacttggc caacccagca ccactggact atgtcaactt tgaggcatgc tcgaggcgca 1801 ttgacttcct cgtgtacgcc gaagctcctg atgttgagaa agcgaagcgc gacttcccag 1861 gccaacctga catgtggaag aatgctttca gccctgactt ctcccacata aagctgatgc 1921 tggccccgca gggtggcttt gacaagaatg gcaatacccc acatgggaag ggtgttatga 1981 agaccctcac catcggttct ctcatcgccc gtgcttcagg actcctccat gaaaggctgg 2041 atgagtacga gttgcaaggc ccagccctca caaccttcaa ctttgaccga aacaaagtgc 2101 tcgctttcag gcagcttgct gctgaaaaca agtacgggtt gatggatacg atgagagttg 2161 gagggcaact caagggcgtc agaaccatgc cagagctcaa gcaagcactc aagaacatct 2221 caattaagag gtgccagata gtgtatggtg gcagcaccta cacacttgaa tctgatggca 2281 agggtagtgt gagggttgac aaggtccaga gtgccactgt gcaaaccaac aacgaactag 2341 ccggcgccct gcatcatctc aggtgcgcca ggatcagata ctatgtcaag tgtgtccagg 2401 aagctctgta ctccatcatt caaattgcag gggctgcgtt tgtcaccacg cgcattgtca 2461 agcgcatgaa catacaggat ctatggtcca agccacaact ggaggatacg gaagaagcca 2521 ctggtaaaga cgggtgccca aaacccaaag atgatgagga gtttgttgtc tcatccgacg 2581 acatcaaggt tgagggcaag aaagggaaga acaagactgg tcgcggcaag aagcacacag 2641 ccttctcgag taaaggtctc agtgatgagg agtacgacga gtacaaaaga atcagggagg 2701 aaagaaacgg taaatactct atagaggagt acctccaaga cagagacaag tattatgagg 2761 aggttgccat cgccagggcg actgaagagg atttctgtga agaagaagag gccaagattc 2821 gacagagaat ctttaggcca acaaggaaac aacgcaagga ggagagggct tctctcggtc 2881 tagtcacagg ttctgagatc aggaaaagaa acccagatga cttcaaaccc aagggaaagt 2941 tgtgggccga tgacgacaga agtgtcgatt acaacgagaa gctcagtttt gaagccccac 3001 caagcatctg gtcaaggata gtcaactttg gttcgggttg gggtttctgg gtttccccca 3061 gtctgttcat aacatcaact catgttatac cccagggcac acaggagttc ttcggagtgt 3121 ccatcaaaca aatccagata cacaaatcag gcgaattctg ccgcctgagg ttcccaaaac 3181 caatcaggac tgatgtgaca ggcatgatct tagaggaggg tgcccccgaa ggcaccgtgg 3241 ccacattact catcaagaga ccaactggtg agctcatgcc tttagcagcc aggatgggca 3301 ctcatgcaac catgaagatc caaggtcgta ctgttggagg tcaaatgggt atgctcctga 3361 cagggtccaa tgccaaaagt atggacctgg gcaccacacc tggcgactgt ggttgtccct 3421 acatttacaa gagaggaaac gactacgtgg tcatcggggt tcacacggct gctgcccgag 3481 gagggaacac tgtcatatgt gccacccagg gaagcgaagg tgaggccaca cttgagggcg 3541 gtgacaacaa gggtacttac tgtggcgccc cgatcctagg tccaggaagt gccccaaagc 3601 tcagcaccaa aaccaaattc tggagatcat ccacagcacc actcccacct ggcacctatg 3661 aaccagccta cctcggtggt aaggacccca gaattaaggg tggcccctca ctacaacaag 3721 ttatgaggga ccagttgaaa ccattcacgg agccaagagg caaacaacca aagccaagtg 3781 tgttggaagc tgccaagaga accattatta acgtccttga acaaacaata gacccacctc 3841 agaaatggac gtttgcgcaa gcttgcgcgt ctcttgacaa gaccacctcc agtggtcatc 3901 cacaccacat acggaagaat gaatgctgga atggagattc cttcacaggc aagttggcag 3961 atcaggcctc aaaggccaac ttgatgtttg aggaggggaa gaacatgact ccagtttaca 4021 caggtgccct caaggatgag ttggttaaaa cagacaagat atatggcaag attaagaaga 4081 gactactttg gggatcggac ctggcaacca tgatccggtg tgcccgggcg ttcgggggcc 4141 taatggatga actcaaggcc cactgtgtca cactccctgt cagagttggt atgaacatga 4201 atgaggatgg ccccatcatc tttgaaaggc actccaggta taggtatcac tatgatgcag 4261 actattccag atgggactca acacaacaaa gagctatatt ggctgcggcc ctagagatca 4321 tggtcaaatt ctccccagag ccgcacttag cccagatagt tgcagaagac cttctatccc 4381 ccagcgtgat ggatgtgggt gatttcaaga tttcaatcaa cgagggcctc ccctctgggg 4441 tgccttgcac ttcacaatgg aactctattg cccactggct cctcaccctc tgtgctctct 4501 ctgaagtcac ggatctctcc cctgacatcg ttcaagccaa ttctctcttc tctttctatg 4561 gtgatgatga aattgtgagt acagatataa aattggaccc agaaaaactg acggcaaagc 4621 tgaaggaata cggactgaag ccaacccgcc ctgacaagac tgaaggaccc ctaatcattt 4681 ctgaggatct aaatggtttg accttcttgc gaaggactgt aacccgtgat ccagctggtt 4741 ggtttggtaa attggaacaa agttcaatac tcaggcagat gtactggacc aggggcccca 4801 accatgagga cccatctgaa acaatgatac cacattccca aaggcccacg cagttgatgt 4861 ctttgctggg tgaggctgca ctgcacggcc cagcattcta cagcaagatc agtaaactag 4921 tcatcacaga gttaaaggaa ggtggcatgg atttttacgt gcccagacaa gaaccgatgt 4981 tcaggtggat gagattctca gacctgagca cgtgggaggg cgatcgcaat ctggctccca 5041 gttttgtgaa tgaagatggc gtcgagtgac gccaacccat ctgatgggtc cgcagccaac 5101 ctcgtcccag aggtcaacaa tgaggttatg gctttggagc ctgttgttgg tgctgccatt 5161 gcagcacctg tagcaggcca gcaaaatata attgacccct ggattagaaa taattttgta 5221 caagcccctg gtggagaatt cacagtgtcc cctagaaacg ctccaggtga aatactgtgg 5281 agcgcgccct taggccctga tttgaacccc tatctctctc acctgtccag aatgtacaat 5341 ggttatgcag gtggttttga agtgcaagta atcctcgcgg ggaacgcgtt caccgccggg 5401 aaagtcatat ttgcagcagt tccaccaaat tttccaactg aaggcctaag ccccagtcag 5461 gttactatgt tcccccatat aattgtggat gttaggcaat tggaacctgt attgatcccc 5521 ctacctgatg ttaggaataa tttctatcat tataatcaag cacatgattc caccctcaaa 5581 ttgatagcaa tgttgtacac accactcaga gctaacaatg ccggggatga tgtctttaca 5641 gtctcttgtc gagtcctcac gagaccatcc cctgattttg attttatatt cctggtgcca 5701 cccacagttg aatcaagaac taaaccattt actgtcccaa tcttaactgt tgaggaaatg 5761 tccaattcaa gattccccat tcctttggaa aagctgtaca cgggtcctag cagtgctttt 5821 gttgtccaac cacaaaatgg cagatgcacg actgatggcg tgctcttagg cactactcag 5881 ctgtcagctg tcaacatctg caacttcagg ggggatgtca cccgtgttgg gaccagccat 5941 gattatacaa tgaatttggt gtcccaaaat tggaataatt atgacccaac agaagaaatc 6001 ccagctcccc tgggaacacc agactttgta ggaaagatcc aaggtttgct cacccaaact 6061 acaagagcgg acggctcgac ccgcgcccac aaggccacag tgagcaccgg gagtgtccac 6121 ttcactccaa agctgggtag tgttcaattc accaccgata cggacaatga tttcctaact 6181 ggccaaaaca caaaatttac cccagttggt gtcatccaag acggtgatca ccatcagaat 6241 gaaccccaac aatgggtact cccaaattac tcaggcacat ctggtcacaa tgtgcatctg 6301 gcccctgccg ttgcccccgc cttcccgggt gagcagcttc ttttctttag gtccactatg 6361 cccgggtgta gcgggtaccc caacatgaat ttggattgcc tactccccca ggagtgggtg 6421 ctgcactttt accaggaggc agccccagca caatccgatg tggccctgct gaggtttgtg 6481 aacccagaca caggtagggt tctgtttgag tgcaaactcc ataagtcagg ctatatcaca 6541 gtggctcata ctggcccata tgatttggtt atccccccca atggttactt cagatttgat 6601 tcttgggtca accagttcta cacactcgcc cccatgggaa atggaacggg gcgcaggcgt 6661 gcattataat ggctggacct ttctttgctg gattggcatc tgatgtcctc ggttctggac 6721 ttggttctct aatcagtgct ggagctggag ccatcaatca aaaagttgaa tttgaaaaca 6781 atagaaaatt gcaacaagct tcttttcaat tcagtagtaa tctgcaacag gcttccttcc 6841 agcatgataa agagatgctc caagcacaaa ttgaggctac tcaaaaattg cagcaagaat 6901 tgatgaaagt taaacaggca gtgctcctag agggtgggtt ttccacaaca gatgcggccc 6961 gtggggcaat caacaccccc atgacaaagg ctctggactg gagcggaaca aggtattggg 7021 ctcctgatgc caggatcaca acatacaatg caggccgctt ctccacccct cagcctttgg 7081 gagcattgcc tgggagaacc aatcctaggg ttcctgcatc tgctcggagc tcccccagtt 7141 cactttccaa tgctcctact gctacttctg tgtattccaa tcaaactgct tcaacgagac 7201 taggttcttc agctggttct ggtgccggtg tctcaagtct cccgtcaact gcaaggacta 7261 ggagctgggt tgaggaccaa aacaaaaatt tgtcgccttt catgaagggg gctctcaaca 7321 catcatttgt gacccctcca tctagtagat cctccaacca aggcacagtc tcaaccgtgc 7381 ctaaagagat tttggactcc tggactggcg cttttaatac gcgcaggcag cctctcttcg 7441 cacacattcg taagcgaggg gagtcacggg tgtaatgtga aaagacaaat tgattatctt 7501 tcttttcttt agtgtctttt //