Typing tool
|
Complete norovirus genomes
MW305695 | GII.5 | ||
---|---|---|---|
GII.P22 |
ORF1: 1..5105 ORF2: 5086..6708 ORF3: 6708..7484LOCUS MW305695 7519 bp RNA linear VRL 06-JUL-2021 DEFINITION Norovirus GII isolate C22 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MW305695 VERSION MW305695.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7519) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Genome-wide analyses of human noroviruses reveal coexistence of viral populations evolving under recombination constraints JOURNAL Unpublished REFERENCE 2 (bases 1 to 7519) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Direct Submission JOURNAL Submitted (23-NOV-2020) CBER/OVRR/DVP/LHV, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA COMMENT ##Assembly-Data-START## Assembly Method :: High-performance Integrated Virtual Environment (HIVE) v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7519 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="C22" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="French Guiana" /collection_date="01-Feb-1978" /note="genotype: GII.5[GII.P22]" gene <1..5105 /gene="ORF1" CDS <1..5105 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QPJ59421.1" /translation="TAGKTSTNSEIHNQDANPTSNLFANMTVGLKRALGARPKQSAPR VSSPNREGKLDTTPRAPSPNKGGKINIPPIPPPPPNGEDIVIKYTKQDDVVEGVPTVA TVDVPLTHNTAYSVPPLDQREKVDAKEPLTGSILEMWDGEIYHYGLYVEKGLVLGVHK PPAAISFARIELTPLSLYWRVVYTPPYLIAPETLRKLHGESFPYTAFDNNCYSFCCWV LDLNDSWLNRKFVSRTTGFYRPYQEWNRKPLPTMDEGKLKKVANVLLCGLSGLFTRPI KDLIGKLRPLNILNIITNCDWTFPGVVEALILLAELFGVFWTPPDISTFIASLLGDYE MQGPEDIAVEVVPVILGGIGMVLGFTKERIGRLLSSAASSLRACREIGNYGIEVVKLV MKWFFPTKDETNEMDMVRAIEDAVLDFEAIQNNHMTALLKDKNSLATFLRELDMEEEK ARKLSTKSASPDIIGTINALLARIAAARSLVHKAKEEMSSRMRPVVIMISGRPGIGKT HLARDLAKKMAATLSGDQRVGLVPRNGVDHWDAYKGERIVLWDDYGMSNPVTDALRLQ ELADTCPLTLNCDRIENKGKVFDSDTIIITTNLANPAPLDYVNFEACSRRIDFLVYAD SPSVDKAKRDFPGQPDMWKDAFKSDFSHIKLQLAPQGGFDKNGNTPHGKGVMKTLTTH SLIARATGLLHERLDEFDLQGPELPVYNFDHNRVAAFRRLAADNKYGLMETMKVGNKL KGVKNLEQLKDALKDISIRPCRIVYNSMAYDLDSNGSGRVNVKKAVDHKVQTNNELNS ALNNLRGARVRYYVKCVQEMVYSLLQIAGAAFVTSRMTRRLNISSIWARPEKQMEAAE EPPAPPTEDWTIIPANAAQEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERQ GKYSIEEYLQDRDRYYEELAIAKATEENFCEEEEIKIRQRVFRPTRKQRKEERATLGL VTGSEIRKRNPDDFKPKGKLWADDDRSVNYNEKIDFEAPPSLWSRIVNFGSGWGFWVS SNLFITSTHVIPPNMTEAFGVPIGQIQVHRSGEFCKMRFPKAIRPDVSGMILEEGAPE GTVVTILIKRTTGELMPLAARMGTHATMKIQGKMLGGQMGMLLTGSNAKNMDLGTIPG DCGCPYVYKRGNDWVVIGVHTAAARGGNTVICATQGPDGEATLEGGDNHGTYCAAPIL GPGSAPKLSTKTKFWRSSNAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTA PRGKPPRPAILEAAKETICNVLEQIIDPPLKWSYAQACASLDKTTSSGHPHHMKKNDN WNGESFTGVLADQASKANLMYEQGKHMQPVYTAALKDELVKTSKIYESIKKRLLWGSD LGTMVRCARAFGGLMDEMKANCVNLPIRVGMNMNEDGPIIFEKHSKYKYHYDADYSRW DSTQQRHILGAALEIMVRFSAEPELAQIVSEDLLAPSVVDVGDFKIAINEGLPSGVPC TSQWNSISHWLITLCAISEVSGLSPDVVQTNSCFSFYGDDEIVSTDIKLDPMKLTNKL KEYGLIPTRPDKTEGPLIIKENLEGLTFLRRDVCRDPAGWYGKLDQSSIMRQLFWTKG PNHEDPNETMIPHSQRPIQLMSLLGEAALHGPTFYKKVSKLVITELKEGGMDFYVPRQ EPMFRWMRFSDLSTWEGDRNLAPEGVNEDGVE" mat_peptide <1..1010 /gene="ORF1" /product="p48" mat_peptide 1011..2108 /gene="ORF1" /product="NTPase" mat_peptide 2109..2645 /gene="ORF1" /product="p22" mat_peptide 2646..3029 /gene="ORF1" /product="VPg" mat_peptide 3030..3572 /gene="ORF1" /product="Pro" mat_peptide 3573..5102 /gene="ORF1" /product="RdRp" gene 5086..6708 /gene="ORF2" CDS 5086..6708 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QPJ59422.1" /translation="MKMASNDAAPSNDGAAGLVPESSNEAMALEPVVGASLAAPVTGQ TNIIDPWIRTNFVQAPNGEFTVSPRNSPGEVLVNLELGPELNPYLGHLARMYNGYAGG MEVQVLLAGNAFTAGKIIFAAVPPYFPVENLSPSQITMFPHVIIDVRTLEPVLLPMPD VRSTLFHFNQKDEPKMRLVAMLYTPLRSNGSGDDVFTVSCRILTRPSPEFDFTYLVPP TVESKTKPFTLPVLTLGELSNSRFPLSIDEMVTSPNESIVVQPQNGRVTLDGELLGTT QLQACNICSIRGKVTGQVPNEQHMWNLEITNLNGTQFDPTDDVPAPLGVPDFAGEVFG VLSQRNRGESNPANRAHDAVVATYSDKYTPKLGLVQIGTWNTNDVENQPTKFTPIGLN EVANGHRFEQWTLPRYSGALTLNMNLAPAVAPLFPGERLLFFRSYVPLKGGFGNPAID CLVPQEWVQHFYQESAPSLGDVALVRYVNPDTGRVLFEAKLHKGGFLTVSSTSTGPVV VPANGYFRFDSWVNQFYSLAPMGTGNGRRRFQ" gene 6708..7484 /gene="ORF3" CDS 6708..7484 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QPJ59423.1" /translation="MAGAFIAGLAGDVLSNGLGSLINAGANAINQRAEFDFNRQLQQN SFNHDKEMLSAQIQATKQLQADMIAIKQGVLAAGGFSPTDAARGAVNAPMTQALDWNG TRYWAPGSVKTTSYSGKFVSMNPVRQVEFPQPKKSAAIPSSASSVSSGRTNLTNSTQS TVVSASSVPPSRGPSAPSTLSRATARTSNWVEEQNRNLEPYMKGALQTAFVTPPSSRA SSNGTVSTVPKGVLDSWTPAFNTRRQPLFAYLRKKGESQA" ORIGIN 1 cgaccgctgg caaaaccagc acaaacagtg aaattcacaa tcaagatgca aaccctacat 61 caaacctctt tgccaacatg actgttggct tgaagagggc cctgggggcg aggccaaaac 121 agtccgctcc cagggtctcc tcccccaaca gggagggaaa gctcgatacc actcccagag 181 ccccctctcc caacaaggga ggaaaaatca acatcccccc aataccaccc ccaccgccca 241 acggtgagga catagtcatc aaatacacca aacaagatga tgtagttgaa ggggtaccca 301 ccgtcgccac ggtggacgtt cctcttacac acaacaccgc ctacagtgtt ccccccttgg 361 accagaggga aaaggttgat gcgaaggaac ccctgacagg ttcgatactt gaaatgtggg 421 atggggaaat ctaccattat gggctatatg tggagaaggg cttggtcctt ggtgtgcata 481 aacccccagc cgccataagc tttgcacgca tagaactcac tcccctttct ctatactgga 541 gggtagtcta taccccacct tacctaatag ccccagaaac cctcaggaaa ctccatgggg 601 agtcgtttcc atacacagcc tttgacaata actgctattc cttctgctgt tgggtcctgg 661 atctgaacga ttcttggctc aataggaagt ttgtgtcaag gaccacgggt ttctacaggc 721 cctaccaaga gtggaacaga aaacccctac caactatgga tgagggaaag cttaagaagg 781 tggccaatgt gctcctatgt ggcctctctg ggctctttac taggcccatc aaggacctca 841 taggaaaatt gaggccgctg aacatactca acatcatcac caattgtgat tggaccttcc 901 ccggagttgt ggaggctctc atactacttg cagaactctt tggagtcttt tggacaccac 961 cagacatttc cacctttata gcatcactgc tgggtgatta tgaaatgcaa ggccctgagg 1021 atatcgcggt tgaggtcgtg ccggtgatcc tcggtggcat aggcatggtg cttggattca 1081 ccaaagaaag gatcggccga ctgttgagct ctgccgcgtc ttcgctcagg gcatgcaggg 1141 agattggtaa ctacgggata gaagtggtga aactggtgat gaagtggttt ttccccacca 1201 aggatgagac aaacgagatg gatatggtga gggcgataga ggatgctgtg ctcgattttg 1261 aagccataca gaacaatcat atgacagcat tactcaaaga caaaaacagt ctggcaacct 1321 tcctcaggga gttagatatg gaagaggaaa aggcgcggaa gctgtcaacg aaatctgcgt 1381 ccccagatat tattggcacc ataaatgccc ttctagccag gattgccgcc gctaggtccc 1441 tagtccacaa ggccaaggaa gaaatgtcat ctaggatgcg tcctgttgtc atcatgatct 1501 ccggtcgacc cgggatagga aaaacacact tggctcgtga cctggccaag aagatggcag 1561 cgaccctctc aggtgatcag agggtggggc tcgtaccgag gaatggtgtc gaccattggg 1621 atgcatataa aggagagagg attgtgcttt gggacgacta cggcatgagc aaccctgtga 1681 ccgacgcact gcggctccag gagttggccg acacatgccc cttgacactc aattgtgata 1741 gaatagaaaa caaggggaaa gtgtttgaca gtgacaccat aatcatcacc accaatctag 1801 ccaaccctgc tcctctagac tatgttaatt ttgaagcttg ctcccgccgt atcgatttcc 1861 ttgtctacgc agactcccct tcggttgaca aagcgaagag agacttccct ggtcaaccag 1921 acatgtggaa agatgccttc aaatcagact tttcacatat aaagcttcag ttggcgccac 1981 aagggggctt tgataagaac ggaaacaccc cccacgggaa aggtgtcatg aaaacgctga 2041 ccacacactc ccttattgcc cgcgccacgg gcctcctgca cgagagactg gatgagttcg 2101 acctgcaggg acctgagttg cctgtgtaca actttgacca caacagggtg gctgctttca 2161 ggagacttgc agctgacaac aagtatggcc ttatggaaac catgaaagtg ggcaacaaac 2221 tgaagggtgt caagaatctg gaacaactca aggatgctct aaaggacata agtatcagac 2281 cttgcaggat agtctacaac tcgatggctt acgacctcga ctcgaacggg tcagggagag 2341 tcaacgtcaa aaaggctgta gaccacaagg tccaaaccaa caacgaactc aactccgcgc 2401 tcaacaatct gcgtggcgcc agagtgagat actatgtcaa gtgtgttcaa gagatggtgt 2461 actctctcct ccagatcgct ggcgcggcct tcgtcacttc aaggatgaca cggagactga 2521 acatcagcag tatttgggcc aggccagaaa aacaaatgga ggccgctgag gaacccccag 2581 cccccccaac tgaggactgg actataatac cagcaaatgc agcccaggaa ggtaagaagg 2641 gtaagaacaa gagtggtaga ggcaagaaac acaccgcctt ctccagcaaa ggcctcagtg 2701 atgaagagta tgatgagtac aaaagaatca gagaagagag acagggaaaa tactctatag 2761 aagagtacct tcaggacagg gaccggtact atgaagaact agccatagcc aaagcaacag 2821 aagagaactt ctgtgaggag gaagaaataa aaattagaca gagggtcttc cgcccgacca 2881 ggaaacagcg taaggaagag agagcgactc tgggcctggt tacgggcagt gaaataagga 2941 agaggaaccc agatgacttc aagccaaagg gaaaactgtg ggctgatgat gaccgcagtg 3001 taaactacaa tgagaaaata gactttgaag cgccccccag cctctggtcc cgaatagtga 3061 actttggatc aggttggggc ttctgggtgt cttccaattt gttcatcaca tcaacacatg 3121 tcatcccccc aaacatgacc gaggcttttg gtgtaccaat aggtcaaata caagtgcaca 3181 ggagtgggga gttctgcaaa atgaggttcc ctaaagctat aagacctgat gtctcaggca 3241 tgatccttga ggaaggagcc cccgagggaa cagtggtgac cattctgatc aagagaacaa 3301 ctggtgaact catgccactg gcagcccgga tgggcactca cgccaccatg aaaatacagg 3361 ggaagatgct tggaggccaa atgggcatgc ttctcacagg gtccaatgcc aagaacatgg 3421 atcttggcac catccccggg gactgtggct gtccttatgt gtacaagaga ggcaatgatt 3481 gggtggtcat aggggtgcac accgcagctg ccagaggcgg caacacggtg atttgcgcca 3541 cccagggacc agatggtgag gccacgcttg aggggggaga caatcacggg acttactgtg 3601 ctgcaccaat acttggacca gggagtgcac caaagctcag caccaagacg aagttctgga 3661 ggtcatccaa cgccccactc ccacctggta cttatgaacc cgcctaccta ggcggaaagg 3721 acccaagggt caagggaggg ccatcactgc agcaagtcat gagggaccaa ctcaagccct 3781 tcaccgcccc ccgagggaag ccaccacgac cggcgatcct ggaggctgct aaagagacca 3841 tctgcaatgt gcttgaacaa atcattgacc cacccctaaa atggtcttac gcccaagcct 3901 gtgcctcatt ggataagaca acatcaagtg gccaccctca tcatatgaaa aagaatgaca 3961 actggaacgg ggagtccttt actggcgtgc tagctgacca ggcctccaag gcaaacctca 4021 tgtatgaaca ggggaaacac atgcagcctg tgtacactgc tgccctcaag gatgagctcg 4081 ttaagaccag caaaatttat gagagcatta agaaaaggct cctatggggc tcagatctgg 4141 gcacaatggt tcgctgtgct agagcttttg gtggtcttat ggatgagatg aaagcaaatt 4201 gtgtgaactt accaattaga gttggcatga acatgaacga ggatgggccc attatttttg 4261 aaaaacattc aaagtataag taccattatg atgctgacta ctctaggtgg gactcaaccc 4321 aacagaggca catcttgggg gctgcccttg aaataatggt tagattctca gctgaacctg 4381 aattggcaca aattgtgtct gaggacctct tagcccccag cgtggtggac gttggtgact 4441 ttaaaattgc cataaatgag ggtttgccct ctggggtgcc gtgcacatca caatggaact 4501 caatctctca ttggctaata acattatgtg ccatatcaga agtctcaggg ttgtcgccag 4561 atgtggtgca aacaaactcc tgtttctctt tctacgggga tgatgagata gtcagcactg 4621 acatcaagtt agatcccatg aaactaacca acaagttgaa ggagtacggg ttgattccaa 4681 caagaccaga caagacagag gggccactca taattaaaga aaaccttgag ggtttgactt 4741 tcctgaggag ggacgtgtgc cgggacccag ctggttggta cggaaaactt gatcaatcct 4801 caatcatgag acaattgttc tggaccaagg gacccaatca tgaagacccc aatgaaacaa 4861 tgattccaca ctctcagagg cccatacaac taatgtcact gctaggggag gcagcactgc 4921 atggaccaac tttctacaag aaagtcagca aattggttat cactgagctc aaagagggtg 4981 ggatggattt ctacgtgcca agacaggaac ctatgttcag gtggatgaga ttctctgatc 5041 tcagcacttg ggagggcgat cgcaatcttg ctcccgaagg tgtgaatgaa gatggcgtcg 5101 aatgacgccg ctccatcaaa tgatggtgcc gccggcctcg tgccagaaag tagtaatgag 5161 gcaatggctc tggaacccgt ggtgggggca tctttagccg cccctgtcac tggccaaact 5221 aatataatag acccctggat tagaactaat tttgtccaag cccctaatgg tgaattcaca 5281 gtttcccctc gaaactcccc tggagaggta ttggtcaatt tggagctagg tccagaactg 5341 aacccttatc tgggacactt agctaggatg tacaatggtt atgcgggtgg tatggaagtg 5401 caagtgttgc tcgcagggaa cgcgttcact gctggcaaga tcatctttgc tgccgtgcca 5461 ccttactttc cagtggaaaa tcttagccct tctcaaataa caatgttccc acatgtgatt 5521 attgatgtta gaaccttgga acctgtatta ctcccaatgc ctgatgttag aagtaccttg 5581 ttccatttta atcaaaagga tgagcctaag atgagacttg ttgctatgct ttatacccct 5641 cttcgttcta atggttctgg tgacgatgtt ttcaccgtct cgtgtaggat ccttactagg 5701 ccctcccctg aatttgattt cacatattta gtaccaccaa cagtagaatc aaaaaccaaa 5761 ccattcacac tacctgtact aacactggga gaattgtcta actctagatt tcccctctct 5821 attgatgaaa tggttaccag ccccaatgag tccatagttg tccagccaca gaatggtagg 5881 gttacgctgg atggggagct gttgggcaca actcaattac aagcatgcaa catttgctct 5941 ataaggggga aggtaacagg gcaggttcct aatgaacaac atatgtggaa cctagagatc 6001 acaaacctaa atgggacaca atttgacccc acagatgatg tcccagcccc ccttggtgtg 6061 cccgactttg caggtgaggt ttttggtgta ctcagccaga gaaatagagg tgagagcaat 6121 ccagcaaaca gggctcatga tgctgtcgtg gctacctaca gtgacaagta cacccccaaa 6181 ctgggcttag tgcaaattgg aacttggaac accaatgatg ttgaaaatca gccaacaaaa 6241 ttcaccccaa ttggtttgaa tgaggttgcc aatggccatc gatttgaaca gtggactttg 6301 cctaggtatt ctggtgccct aacactaaac atgaatttag cccctgctgt ggctccgctc 6361 tttcctggag agcgtctcct tttcttccgc tcttatgttc cattaaaagg tggatttggt 6421 aaccctgcta tagattgtct ggtgcctcaa gagtgggttc aacacttcta tcaggagtct 6481 gccccctctc tgggggatgt ggccttggtc aggtacgtca acccagacac cgggcgcgtc 6541 ctttttgagg ccaaactcca caagggtgga ttcctgactg tgtcaagcac tagcacaggg 6601 cctgttgtgg tgccagccaa tggctatttc agatttgatt cttgggttaa tcaattttac 6661 tctcttgccc ccatgggaac tgggaatggg cgcagaagat ttcagtaatg gcaggagctt 6721 ttatagcagg tttagcaggt gatgtgctta gtaatggcct cggttcgctg attaatgccg 6781 gagccaatgc tatcaatcaa agagcagaat ttgattttaa taggcaattg caacaaaatt 6841 ctttcaatca tgataaagaa atgttaagtg cccaaattca agcaacaaaa cagttgcagg 6901 ctgatatgat agcaatcaaa cagggagtcc tggctgctgg aggtttctcc ccaactgatg 6961 cagccagagg agctgttaac gcacccatga cccaagcact agattggaat ggcactaggt 7021 actgggcccc tggctcagtg aagaccacca gctactctgg aaagtttgtg tccatgaatc 7081 ctgttaggca ggttgaattc ccgcagccca aaaagagtgc ggccatacca tcaagtgcca 7141 gctccgtgtc ttctggtaga actaatttga caaattccac tcaatcaact gttgttagtg 7201 cttcctcagt gccaccttct aggggcccct ctgctccttc tactctgtca agggccacag 7261 ccagaacttc caattgggtt gaggaacaaa acaggaattt ggaaccctac atgaagggtg 7321 cccttcaaac agcatttgtc acccccccct ctagccgggc atctagtaat ggaacagtct 7381 caactgtgcc aaaaggtgtt ttggactcct ggacacctgc gttcaacacc cgcaggcagc 7441 cgcttttcgc ttaccttcgt aagaaggggg agtcacaagc ttagtgaaaa gataattaat 7501 gattaataat tgatctttt //