![]() |
![]() |
![]() |
![]() |
|
Typing tool
|
Complete norovirus genomes
NC_029646 | GII.12 | ![]() |
|
---|---|---|---|
GII.P12 |
ORF1: 5..5104 ORF2: 5085..6692 ORF3: 6692..7471LOCUS NC_029646 7518 bp RNA linear VRL 23-NOV-2018 DEFINITION Norovirus GII nonstructural polyprotein (ORF1), VP1 (ORF2), and VP2 (ORF3) genes, complete cds. ACCESSION NC_029646 VERSION NC_029646.1 DBLINK BioProject: PRJNA485481 KEYWORDS RefSeq. SOURCE Norwalk-like virus ORGANISM Norwalk-like virus Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 AUTHORS Hansman,G.S., Doan,L.T., Kguyen,T.A., Okitsu,S., Katayama,K., Ogawa,S., Natori,K., Takeda,N., Kato,Y., Nishio,O., Noda,M. and Ushijima,H. TITLE Detection of norovirus and sapovirus infection among children with gastroenteritis in Ho Chi Minh City, Vietnam JOURNAL Arch. Virol. 149 (9), 1673-1688 (2004) PUBMED 15593412 REFERENCE 2 (bases 1 to 7518) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (11-MAR-2016) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 7518) AUTHORS Abe,K., Noda,M., Ikeda,Y., Kamimura,M., Fujii,A., Yamaoka,K. and Ogino,T. TITLE Direct Submission JOURNAL Submitted (06-JUN-2000) Katsuhiko Abe, Hiroshima city Institute of public health, Division of Biological Science; Nishi-ku Shoko-center 4-1-2, Hiroshima city, Hiroshima 733-8650, Japan (E-mail:ei-seibutsu@city.hiroshima.jp, Tel:81-82-277-6998, Fax:81-82-277-0410) COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to AB044366. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..7518 /organism="Norwalk-like virus" /mol_type="genomic RNA" /strain="Hu/Norovirus/hiroshima/1999/JP(9912-02F)" /db_xref="taxon:95340" 5'UTR 1..4 gene 5..5104 /gene="ORF1" /locus_tag="NoVGII_gp1" /db_xref="GeneID:27042434" CDS 5..5104 /gene="ORF1" /locus_tag="NoVGII_gp1" /note="N-terminal, helicase, VPg, 3Cpro, RdRp, orf1, RNA polymerase" /codon_start=1 /product="nonstructural polyprotein" /protein_id="YP_009237897.1" /db_xref="GeneID:27042434" /translation="MKMASNDASAAAAANSNNDTAKSSSDGMLSSMAVTFKRALGARP KQPPPREIPQRPPRPPTPELVKKIPPPPPNGEDEPVVSYSVKDGVSGLPELTTVRQPG ETNTAFSVPPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAIS LAKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS WLCRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCALSSLFTRPIKDIIGK LRPLNILNILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP KREEANELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDLEEEKARKLST KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLAREL AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPEVEK AKRDFPGQPDMWKNAFSPDFSHIKLMLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA SGLLHERLDEYELQGPTPTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGKQLKDVRTM PELRQALKNISIKSCQIVYGGCTYMLESDGKGDVKVDRVQNATVQTNNELAGALHHLR CARIRYYVKCIQEALYSIIQIAGAAFVTTRIVKRMNIQDLWSKPQVEDTEETASKDGC PKPKDDDEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV TGSEIRKRNPDDFKPKGKLWADDDRSVDYNERLNFEAPPSIWSRIVNFGSGWGFWVSP SLFITSTHVIPQGAQEFFGVSIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG TVVTLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD CGCPYIYKRGNDYIVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILG PGSAPKLSTKTKFWRSSTAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP RGKPPKPSVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL ATMIRCARAFGGLMEELKAHCVTLPVRVGMNMNEDGPIIFERHSRYKYHYDADYSRWD STQQRAVLAAALEIMVKFSPEPNLAQKVAEDLLSPSVMDVGDFKISINEGLPSGVPCT SQWNSIAHWLLTLCALSEVTNLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP NHEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide 5..994 /gene="ORF1" /locus_tag="NoVGII_gp1" /product="p48" /note="N-terminal leader p48; 2A2-2B-like membrane protein; distant homolog of H-rev107-like parechovirus 2a2, a putative regulator of cell proliferation; the c-terminal membrane-binding portion contributes to the Golgi disassembly and, therefore, functionally similar to the picornavirus 2B protein" /protein_id="YP_009238492.1" mat_peptide 995..2092 /gene="ORF1" /locus_tag="NoVGII_gp1" /product="NTPase" /note="2C-L; probable ortholog of the 2C protein of picornaviruses; the calicivirus NTPase was found in membranous replication complexes" /protein_id="YP_009238487.1" mat_peptide 2093..2629 /gene="ORF1" /locus_tag="NoVGII_gp1" /product="p22" /note="3A-L; located in the polyprotein between NTPase and VPg; second most variable region of the calicivirus polyprotein; the FCV ortholog was detected in membranous replication complexes" /protein_id="YP_009238488.1" mat_peptide 2630..3028 /gene="ORF1" /locus_tag="NoVGII_gp1" /product="VPg" /note="For Southampton calicivirus, both N-terminal and C-terminal cleavage sites have been confirmed by direct sequencing. In caliciviruses, VPg can also exist in the form of precursor with the upstream protein which is thought to function as a membrane anchor" /protein_id="YP_009238489.1" mat_peptide 3029..3571 /gene="ORF1" /locus_tag="NoVGII_gp1" /product="Pro" /note="Chymotrypsin-like cysteine proteinase; For Southampton calicivirus, both N-terminal and C-terminal cleavage sites have been confirmed by direct sequencing. The FCV proteinase (Pro) can exist in the form of a stable Pro-Pol precursor" /protein_id="YP_009238490.1" mat_peptide 3572..5101 /gene="ORF1" /locus_tag="NoVGII_gp1" /product="RdRp" /note="The FCV polymerase (Pol) can exist in the form of a stable Pro-Pol precursor; RNA-dependent RNA polymerase" /protein_id="YP_009238491.1" gene 5085..6692 /gene="ORF2" /locus_tag="NoVGII_gp2" /db_xref="GeneID:27042432" CDS 5085..6692 /gene="ORF2" /locus_tag="NoVGII_gp2" /note="ORF2; major capsid protein" /codon_start=1 /product="VP1" /protein_id="YP_009237898.1" /db_xref="GeneID:27042432" /translation="MKMASSDAAPSNDGAAGLVPEANNETMALEPVAGASIAAPLTGQ NNIIDPWIRLNFVQAPNGEFTVSPRNSPGEVLLNLELGPELNPYLAHLSRMYNGYAGG VEVQVLLAGNAFTAGKLVFAAVPPHFPLENISPGQITMFPHVIIDVRTLEPVLLPLPD VRNNFFHYNQQNEPRMRLVAMLYTPLRSNGSGDDVFTVSCRVLTRPSPDFDFNYLVPP TVESKTKPFTLPILTIGELTNSRFPVPIDELYTSPNESLVVQPQNGRCALDGELQGTT QLLPTAICSFRGRINQKVSGENHVWNMQVTNINGTPFDPTEDVPAPLGTPDFSGKLFG VLSQRDHDNACRSHDAVIATNSAKFTPKLGAIQIGTWEEDDVHINQPTKFTPVGLFED GGFNQWTLPNYSGALTLNMGLAPPVAPTFPGEQILFFRSHIPLKGGVADPVIDCLLPQ EWIQHLYQESAPSQSDVALIRFTNPDTGRVLFEAKLHRSGYITVANTGSRPIVVPANG YFRFDSWVNQFYSLAPMGTGNGRRRVQ" gene 6692..7471 /gene="ORF3" /locus_tag="NoVGII_gp3" /db_xref="GeneID:27042433" CDS 6692..7471 /gene="ORF3" /locus_tag="NoVGII_gp3" /note="ORF3; minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="YP_009237899.1" /db_xref="GeneID:27042433" /translation="MAGAFIAGLAGDIVTNGIGSLVNAGANAINQKVDFENNKQLQQA SFNHDKEMLQAQVQATKQLQADMIAIRQGVLTAGGFSPTDAARGAVNAPMTQVLDWNG TRYWAPGATKTTTFSGGFTNVSHARTVDLTKKTSATPAPAPVSRPSSVASTVSTRSTL ISGSSNPSSLARSSSSVSSQPTSSSSRTSEWVRSQNRALEPYMRGALRTAYVTPPSSR ASSNGTVSTVPKEVLDSWTSAFNTHRQPLFAHLRRRGESQV" 3'UTR 7472..7518 ORIGIN 1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gctgctaaca gcaacaacga 61 caccgcaaaa tcttcaagtg acggaatgct ttctagtatg gctgtcacat ttaaacgagc 121 cctcggggca cggcctaaac agcctccccc gagggaaata ccacaaagac ccccacgacc 181 acccacccca gaactggtca aaaagatccc tcctcctcca cccaacgggg aggatgaacc 241 agtggtttct tacagcgtca aagatggcgt ttccggcttg cctgagctta ccactgtcag 301 gcagccgggt gaaaccaaca cggcgttcag tgttccccca ctcaatcaaa gggagaatag 361 ggacgccaag gagccactaa ctggaacaat cctggaaatg tgggacgggg agatctacca 421 ttacggcctg tatgtggaac gaggtcttgt acttggtgtg cacaaaccac cggctgccat 481 cagcctcgcc aaggttgaac taacaccact ctctctgttt tggagaccag tgtatacacc 541 acagtatctc atctctccgg acactctcag gagactgcac ggagagtcgt ttccctacac 601 agcctttgat aacaactgct atgccttctg ttgttgggtc ctggacctaa acgactcgtg 661 gttgtgcagg agaatgatcc agaggacaac tggtttcttc aggccctacc aagactggaa 721 taggaaaccc cttcccacca tggatgactc caagttgaag aaggtagcta acatattctt 781 gtgcgcgcta tcttcgctat tcactaggcc catcaaagac ataataggaa agttgaggcc 841 tctcaacatc cttaacatct tggcttcatg tgattggact tttgcaggca tagtggaatc 901 tttgattctc ttggcagagc tctttggagt tttctggaca cccccagatg tgtctgcgat 961 gatcgcccct ttactaggtg actacgagct gcaggggccc gaggaccttg cagtggaact 1021 cgttccaata gtgatggggg ggattggttt ggtgctaggg ttcaccaaag agaagatcgg 1081 gaaaatgttg tcatctgctg catccacctt aagagcttgt aaagaccttg gtgcatacgg 1141 actggaaatc ttaaaattgg tcatgaagtg gttcttccca aagagagagg aagcaaatga 1201 gctggctatg gtgaggtcca tcgaggatgc ggtgttggac ctcgaggcaa ttgaaaacaa 1261 ccacatgact gccctcctca aagacaaaga cagcctggca acctatatga gaactcttga 1321 cctcgaggag gagaaagcca gaaagctttc gaccaagtct gcttcacctg atatcgtggg 1381 cacaatcaac gctctcctgg cgagaatcgc cgctgcacgc tccctggtgc atcgggcgaa 1441 agaggagctc tccagcagac caagacctgt tgttgtgatg atatcaggta agccagggat 1501 agggaaaacc caccttgcca gggaattggc caagaaaatc gcagcttctc tcacagggga 1561 ccagcgtgtg ggtcttatcc cgcgcaatgg tgttgatcac tgggacgcat ataagggaga 1621 aagagtcgtt ctatgggacg actatggaat gagtaacccc atccacgacg ccctcaggtt 1681 acaagaactt gctgacacct gccccctcac gctaaattgt gataggattg agaataaagg 1741 aaaggtcttt gacagtgatg ccataatcat caccactaac ctggccaacc cagcaccact 1801 ggactatgtc aattttgaag catgctcgag gcgtatcgac ttcctcgtgt atgcagatgc 1861 ccctgaagtc gagaaggcga aacgtgattt cccaggtcaa cctgacatgt ggaagaacgc 1921 tttcagtcct gacttctcgc acataaaact gatgctggct ccgcagggtg gcttcgacaa 1981 gaacggaaac accccacatg ggaaaggcgt catgaaaacc ctcactactg gttccctcat 2041 cgctcgagca tcagggctac tccatgagag gttagatgag tacgagttac agggcccaac 2101 ccccactacc ttcaactttg accgcaacaa ggtgcttgcg ttcagacagc ttgctgctga 2161 aaacaagtac gggttgatgg acacaatgag agtcggaaaa cagctcaagg atgtcaggac 2221 catgccagag ctcagacaag cactcaagaa catctcaatc aagagttgcc agatagtgta 2281 tggtggctgc acctatatgc ttgagtctga tggcaagggt gatgtgaaag ttgacagagt 2341 tcagaacgcc actgtgcaga ccaacaatga actggccggc gccctacacc atcttaggtg 2401 tgccaggatt agatattatg tcaagtgcat tcaggaggcc ctgtattcca tcatccaaat 2461 tgctggagct gcatttgtca ccacgcgcat tgtcaagcgc atgaacatac aagacctttg 2521 gtccaagcca caggtggaag atacagagga gactgctagc aaggatgggt gcccaaaacc 2581 caaggatgat gacgagttcg ttgtttcatc cgacgacatc aaaaccgagg gcaagaaagg 2641 aaagaacaag tctggccgtg gtaagaagca cacagcattc tcaagcaaag gcctcagtga 2701 tgaggagtac gatgagtaca aaagaatcag agaagaaaga aacggcaagt actctataga 2761 ggaatacctt caggacagag ataagtatta tgaggaggtg gccatcgcca gggcgaccga 2821 agaggacttc tgtgaagaag aagaggccaa gatccgacaa aggattttta ggccaacaag 2881 gaagcaacgc aaagaggaga gggcctctct cggcttggtc acaggttctg aaatcaggaa 2941 gaggaaccca gacgacttca agcctaaagg aaagctgtgg gctgatgacg acaggagtgt 3001 tgactacaat gagagactca attttgaagc cccaccaagc atttggtcga ggatagtcaa 3061 ctttggttca ggttggggtt tttgggtttc ccccagcctg ttcataacat caactcatgt 3121 cataccccag ggcgcacagg agttctttgg ggtttccatc aagcaaattc agatacacaa 3181 atcgggtgaa ttctgtcgct tgaggtttcc aaaaccaatc agaactgatg tgacaggcat 3241 gatcctagaa gaaggtgcgc ccgaagggac cgtggtcaca ttactcatca agagaccaac 3301 tggagaactc atgcccttgg cagccagaat gggaacccat gcaaccatga agatacaagg 3361 gcgcactgtt gggggtcaaa tgggcatgct cctaacagga tctaacgcca agagtatgga 3421 cctgggcacc acaccaggtg actgtggctg tccctacatt tacaagagag ggaatgacta 3481 catagtcatt ggagtccaca cggctgctgc ccgtggagga aacactgtca tatgtgccac 3541 ccaggggagc gagggggaag ccacacttga aggcggtgac aacaagggaa cctactgcgg 3601 tgcaccaatc ttaggtccag ggagtgcccc aaagctcagc accaagacta agttttggag 3661 atcatccaca gcaccactcc cacctggtac ctatgaacca gcctatcttg gcggcaagga 3721 ccccagagtc aagggtggcc cctcattgca acaagttatg agggaccagc tgaaaccatt 3781 cactgagccc aggggcaaac caccaaaacc aagtgtgtta gaggctgcca agaaaaccat 3841 catcaatgtt cttgaacaaa caattgaccc acctcaaaaa tggtcattcg cgcaggcatg 3901 cgcatccctc gacaagacca cttccagcgg ccacccgcac cacatgcgga agaacgactg 3961 ctggaacggg gagtccttca caggcaaatt ggcagaccag gcttccaagg ctaacctgat 4021 gttcgaagag ggaaagaaca tgaccccagt ctacacaggt gcgcttaagg acgagctggt 4081 caagactgac aaaatttatg gcaagatcaa aaagaggctt ctctggggct cggatctggc 4141 aaccatgatc cggtgtgctc gagcgtttgg aggcctgatg gaggaactca aagcacattg 4201 tgtcacacta cccgtcagag taggtatgaa tatgaatgag gatggcccta tcatctttga 4261 gaggcactcc agatataagt atcattatga tgctgattac tcccggtggg actcaacaca 4321 acaaagagcc gtgttagcag cagccttaga aatcatggtt aagttctccc cagaaccgaa 4381 tctggcccaa aaggttgcag aagaccttct ctctcccagc gtgatggacg taggtgactt 4441 caaaatatca atcaatgagg gcctcccctc cggggtgccc tgcacctccc aatggaattc 4501 catcgcccac tggctcctca ccctctgtgc gctttctgag gttacaaacc tgtcccctga 4561 cattatccag gctaattccc tcttttcctt ctacggtgat gatgaaattg tgagcacaga 4621 cataaaattg gacccagaga agttgacagc aaaacttaag gaatacgggt tgaaaccgac 4681 ccgccctgac aagactgagg gaccccttgt tatctctgag gacctggatg gcctaacctt 4741 cctgcggagg actgtgaccc gcgacccagc tggctggttt ggaaagctgg aacagagctc 4801 aatacttagg caaatgtatt ggactagggg ccctaaccat gaagacccat ctgaaacaat 4861 gataccacac tcccaaagac ccatacaatt gatgtctttg ctgggcgagg ctgcactcca 4921 cggcccagca ttctacagca aaatcagcaa gctggtcatt gcagagctga aggaaggtgg 4981 catggatttt tacgtgccca gacaagagcc aatgttcaga tggatgaggt tttcagatct 5041 gagcacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga 5101 gtgacgccgc tccatctaat gatggtgcag ccggtcttgt accagaggct aacaatgaga 5161 ccatggcact tgaaccggtg gctggggctt caatagccgc cccactcacc ggccaaaaca 5221 atattataga cccctggatt agattaaatt ttgtgcaggc tcccaatgga gagttcacgg 5281 tttcaccccg caactcgccc ggggaagtcc tattaaattt ggaattaggc cccgaactaa 5341 atccatacct agcacacctc tctagaatgt ataatggtta tgcgggtggg gttgaggtgc 5401 aagtactact ggctgggaat gcgttcacag ctggaaaact ggtgtttgcc gcagttcccc 5461 ctcattttcc attagaaaat ataagccctg gccagataac tatgttccct catgtaatta 5521 ttgatgttag gactttagaa ccagttttgt tgccccttcc tgatgttagg aataatttct 5581 ttcattataa tcagcagaat gaaccgagga tgagacttgt agcaatgctt tatactcctc 5641 ttagatctaa tggttctggt gatgatgtat tcactgtctc ctgcagggtg cttacccgac 5701 cttcccctga ttttgatttt aattatttgg tcccccctac tgttgaatct aaaaccaaac 5761 ccttcacact ccctatcttg actatagggg agttaaccaa ctccaggttc cctgtaccta 5821 tagatgagct ctacaccagt cccaatgaga gtctggtggt gcaaccccag aacgggagat 5881 gcgcactaga tggggagctg cagggcacga ctcagctcct ccccacggcg atctgctcgt 5941 tcaggggccg gatcaatcag aaggtgagtg gggaaaacca tgtttggaat atgcaggtca 6001 ccaacatcaa cgggacccct tttgatccaa cagaggatgt cccggctcct ctaggcactc 6061 cagatttctc tggcaagctc tttggtgtac taagccagag agaccatgat aatgcctgca 6121 ggagtcatga tgcagtaatt gcaaccaact ctgccaaatt taccccaaaa ttgggcgcta 6181 tacaaattgg cacatgggaa gaagacgatg tgcacatcaa ccaacctact aagtttactc 6241 cagttggctt gtttgaagat ggaggtttca accagtggac actccccaat tattctggag 6301 ccttaacact taatatggga ttggcccctc ctgtggcccc caccttccct ggtgaacaaa 6361 ttcttttctt tagatcccac attcctctta aaggaggtgt ggcggaccca gttattgatt 6421 gtctcttgcc tcaagaatgg atccaacatc tttaccaaga gtcggccccc tcacaatcag 6481 atgtagcact gattaggttt acaaatccag acacaggacg tgttctattt gaagcaaaat 6541 tacacaggag tggttacatc acagtggcca atactggtag cagaccgatt gtggtaccag 6601 ctaatggtta cttcaggttt gactcttggg ttaatcaatt ctattctctt gcccccatgg 6661 gaactggaaa tgggcgcaga agggtgcagt aatggctgga gcttttatag cggggcttgc 6721 tggtgacata gtcaccaatg gcattggctc acttgtgaac gctggggcta atgcaataaa 6781 tcaaaaagta gactttgaaa acaacaagca actacagcag gcttctttca accatgataa 6841 agagatgctg caagctcaag tccaggccac caaacagctg caggctgata tgattgcaat 6901 cagacaaggg gtgttgaccg cgggcggctt ctcccccact gatgcagcaa gaggggcagt 6961 taatgcacct atgactcagg ttttagactg gaatgggacc agatattggg cccccggagc 7021 cacgaaaacc actactttct ccggtggatt caccaatgtt tctcatgcca gaactgtcga 7081 cctgaccaag aagacatcag ccacaccagc tcctgcgcct gtttccagac ctagctctgt 7141 cgcctctaca gtctccaccc gctcaacctt gattagcggg tcttccaatc cttcttcttt 7201 agctaggagt tcttctagtg tttcttctca acccacctcc tcctcttctc ggaccagtga 7261 gtgggtgcgc agccaaaaca gggcactgga gccttacatg aggggagcgc tacgcacagc 7321 ctatgtgacg cctccctcta gtagagcttc tagtaatggc acagtctcaa ccgtgccaaa 7381 agaggttttg gactcctgga catctgcatt taacacccac agacaaccgc tattcgctca 7441 tctccgtcgg agaggggagt cacaagttta gtgaaaagat aatctttatt ttctttcctt 7501 tgaagatttt tgtctttt //