Typing tool
|
Complete norovirus genomes
MH638228 | GI.1 | ||
---|---|---|---|
GI.P1 |
ORF1: 1..5364 ORF2: 5348..6940 ORF3: 6940..7578LOCUS MH638228 7598 bp ss-RNA linear VRL 19-JUL-2019 DEFINITION Norovirus GI isolate ESP20695 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MH638228 VERSION MH638228.1 KEYWORDS . SOURCE Norovirus GI ORGANISM Norovirus GI Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7598) AUTHORS Pan,R.W., Koster,B.L., Vo,S., Balansay,M.S., Hayes,M.E., Graf,P.C. and Myers,C.A. TITLE Acute Gasteoenteritis Surveillance Program JOURNAL Unpublished REFERENCE 2 (bases 1 to 7598) AUTHORS Pan,R.W., Koster,B.L., Vo,S., Balansay,M.S., Hayes,M.E., Graf,P.C. and Myers,C.A. TITLE Direct Submission JOURNAL Submitted (18-JUL-2018) Operational Infectious Diseases, Naval Health Research Center, 140 Sylvester Road, San Diego, CA 92106, USA COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.11.1 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7598 /organism="Norovirus GI" /mol_type="genomic RNA" /isolate="ESP20695" /host="Homo sapiens" /db_xref="taxon:122928" /country="USA" /collection_date="07-Nov-2017" /note="genotype: GI.P1" gene <1..5364 /gene="ORF1" CDS <1..5364 /gene="ORF1" /inference="ab initio prediction:Prodigal:2.60" /inference="similar to AA sequence:UniProtKB:Q83883" /codon_start=1 /product="nonstructural polyprotein" /protein_id="AXQ39990.1" /translation="MASKDVVPAAASSENANNNSSIKSRLLARLKGVSGTTTPPNSIQ ITNQSMALGLIGQAPAPKATAVETPKQQRDRPPRTAAEVQQNLCWTEKPLDQNVKAWD ELDHTTKQQILDEHAEWFDAGGLGPSTLPSNHERHSNDDSGGHQVKWSAKEGVNLGIG GLTTVPGPEWNMCPLPPADQRSTTPAVEPIIGDMIEFYEGHIYHYAIYIGQGKTVGVH SPQAAFSITRITIQPISAWWRICYVPQPKQRLTYDQLKELENEPWPYAAVTNNCFEFC CQVMCLDDTWLQRKLISSGRFHHPTQDWSRDTPEFQQDSKLEMVRDAVLAAINGLVSR PFKDLLGKLKPLNVLNLLSNCDWTFMGVVEMVVLLLELFGVFWNPPDVSNFIASLLPD FHLQGPEDLARDLVPVVLGGIGLAIGFTRDKVGKMMKNAVDGLRAATQLGQYGLEIFS LLKKYFFGGDQTEKTLKDIESAVIDMEVLSSTSVTQLVRDKQSARAYMAILDNEEEKA RKLSVRNADPHVVSSTNALISRISMARAALAKAQAEMTSRMRPVVIMMCGPPGIGKTK AAEHLAKRLANEIRPGGKVGLVPREAVDHWDGYHGEEVMLWDDYGMTKIQEDCNKLQA IADSAPLTLNCDRIENKGMQFVSDAIVITTNAPGPAPVDFVNLGPVCRRVDFLVYCTA PEVEHTRKVSPGDTNALKDCFKPDFSHMKMELAPQGGFDNQGNTPFGKGTMKPTTINR LLIQAVALTMERQDEFQLQGPTYDFDTDRVSAFTKMARANGIGLISMASLGKKLRGVS TIEGLKNALLGYKIAKCSIQWQSRVYIIESDGGNVHIKEDKQALTPSQQAINTASLAI TRLKAARAVAYASCFQSAVTTILQMAGSALVINRAVKRMFGTRTAAMALEGPEKEHNC RVHKAKEAGKGPIGHDDMIEKFGLCETEEEESEDQIQITPNDAIPEGKNKGKTKKGRG RKNNYNAFSRRGLSDEEYEEYKKIREEKNGNYSIQEYLEDRQRYEEELAEVQAGGDGG IGETEMEIRHRVFYKSKSRKHQQEQRRQLGLVTGSDIRKRKPIDWTPPKNEWADDDRE VDYNEKINFEAPPTMWSRVTKFGSGWGFWVSPTVFITTTHVVPTGVKEFFGEPLTNIA IHQAGEFTQFRFSKKIRPDLTGMVLEEGCPEGTVCSVLIKRDSGELLPLAVRMGAIAS MRIQGRLVHGQSGMLLTGANAKGMDLGTIPGDCGAPYVHKRGNDWVVCGVHAAATKSG NTVVCAVQASEGETALEGGDKGHYAGHEIVRHGNGPALSTKTKFWRSSPEPLPPGVYE PAYLGGKDPRVQNGPSLQQVLRDQLKPFAEPRGRMPEPGLLEAAVETVTSMLEQTMDT PSPWSYADACQSLDKTTSSGFPYHKRKNDDWNGTTFVRELGDQAAHANNMYENGKHMK PIYTAALKDELVKPEKVYQKIKKRLLWGADLGTVIRAARAFGPFCDAIKPHVIKLPIK VGMNIIEDGPLIYAEHAKYKNHFDADYTAWDSTQNRQIMTESFSIMSRLTASPELAEV VAQDLLAPSEMDVGDYVIRVKEGLPSGFPCTSQVNSINHWIITLCALSEATGLSPDVV QSMSYFSFYGDDEIVSTDIDFDPARLTQILKEYGLRPTRPDKTEGPIQVRKNVDGLVF LRRTISRDAAGFQGRLDRASIERQIFWTRGPNHSDPSETLVPHTQRKVQLISLLGEAS LHGEKFYRKISSKVIHEIKTGGLEMYVPGWQAMFRWMRFHDLGLWTGDRNLLPEFVND DGV" mat_peptide <1..1188 /gene="ORF1" /product="p48" mat_peptide 1189..2277 /gene="ORF1" /product="NTPase" mat_peptide 2278..2880 /gene="ORF1" /product="p22" mat_peptide 2881..3294 /gene="ORF1" /product="VPg" mat_peptide 3295..3837 /gene="ORF1" /product="Pro" mat_peptide 3838..5361 /gene="ORF1" /product="RdRp" gene 5348..6940 /gene="ORF2" CDS 5348..6940 /gene="ORF2" /inference="ab initio prediction:Prodigal:2.60" /inference="similar to AA sequence:UniProtKB:Q83884" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="AXQ39991.1" /translation="MMMASKDATSNVDGASGAGQLVPEANTSDPLAMDPVAGSSTAVA TAGQVNPIDPWIINNFVQAPQGEFTISPNNTPGDVLFDLSLGPHLNPFLLHLSQMYNG WVGNMRVRIMLAGNAFTAGKIIVSCIPPGFGSHNLTIAQSTLFPHVIADVRTLDPIEV PLEDVRNVLFHNNDRNQQTMRLVCMLYTPLRTGGGTGDSFVVAGRVMTCPSPDFNFLF LVPPTVEQKTRPFTLPNLPLSSLSNSRAPLPIGSMGISPDNVQSVQFQNGRCTLDGRL VGTTPVSLSQVAKIRGTSNGTVINLTELDGTPFHPFEGPAPIGFPDLGGCDWHVNMTQ FGHSSQTQFDVDTTPETFVPHLGSIQANGIGSGNYIGVLSWISPPSHPSGSQVDLWKI PNYGSSVTEATHLAPSVFPPGFGEVLVFFMSKMPGPGAYNLPCLLPQEYISHFASEQA PTVGEAALLHYVDPDTGRNLGEFKAYPDGFLTCVPNGASSGPQQLPINGVFVFVSWVS RFYQLKPVGTASSARGRLGLRR" gene 6940..7588 /gene="ORF3" CDS 6940..7578 /gene="ORF3" /inference="ab initio prediction:Prodigal:2.60" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="AXQ39992.1" /translation="MAQAIIGAIAASTAGSALGAGIQVGGEAALQSQRYQQNLQLQEN SFKHDREMIGYQVEASNQLLAKNLATRYSLLRAGGLSSADAARSIAGAPVTRIVDWNG VRVSAPESSVTTLRSGGFMSVPIPYTSKQKQIQPSGISNPNYSPSSISRTTSWVESQN SLRFGNLSPYHTEALNTVWLTPPGSTASSTLSSVPRGYFNTDRLPLFANNRR" ORIGIN 1 atggcgtcga aagacgtcgt tcctgctgct gctagcagtg aaaatgctaa caacaacagt 61 agtatcaagt ctcgcctatt ggcgagactc aaaggtgtga gtggaacaac gactccacct 121 aattctatac aaataactaa ccaaagtatg gctctggggc taattggcca ggccccagct 181 ccaaaggcca cagccgtgga gacccctaaa caacagaggg ataggcctcc acggactgcc 241 gccgaagtcc aacaaaattt atgttggacc gagaagccgc tagaccaaaa tgtcaaggcg 301 tgggacgaat tggaccacac aacgaaacaa cagatacttg atgagcatgc tgaatggttt 361 gatgctggtg gcttaggccc aagcacactt cctagcaacc atgaacgtca ctcaaatgat 421 gatagtggag gccatcaagt gaaatggtcg gctaaagagg gtgtgaatct cgggatagga 481 gggcttacaa ccgttcccgg gcccgagtgg aacatgtgcc cgctaccccc agcagatcaa 541 aggagtacga cacctgcagt cgagcccata ataggtgata tgatcgaatt ttatgaaggt 601 catatctacc attatgccat ttatatcggc caaggcaaga cagtgggtgt ccactctcct 661 caagcagcat tttcaataac aagaatcacc atacaaccta tatcagcatg gtggcggata 721 tgttatgtcc cacagccaaa acagagactt acatatgacc aactcaaaga attggagaac 781 gaaccatggc catacgctgc agttaccaat aactgctttg aattctgctg tcaggtcatg 841 tgtctagatg acacttggct acaaagaaag ctcatctcct ctggacggtt ccaccatcca 901 acccaggatt ggtctcggga cactccagag tttcaacaag acagcaaact tgagatggtc 961 agagatgcag tgctggccgc gataaatggg ctagtgtcgc ggccctttaa ggatctccta 1021 ggcaaactca aacctttgaa tgtgctcaat ttactttcaa attgtgactg gacatttatg 1081 ggggtcgtgg aaatggtggt cctccttttg gagctttttg gggtgttctg gaacccacct 1141 gatgtgtcta acttcatagc ttcactccta ccagatttcc atctacaggg ccctgaagat 1201 cttgctagag atctcgtgcc agtggtcctg ggtggtattg gtctggctat aggatttact 1261 agggacaagg tgggtaaaat gatgaaaaat gctgttgacg gactgcgggc tgcgactcag 1321 ctcggacagt acggcctgga aatattctca ctattaaaga agtacttctt tggtggagac 1381 cagacagaga agaccctgaa agatatagag tcagctgtta ttgacatgga agtgctgtcg 1441 tccacctcag tgacccaact cgtaagggat aagcaatctg cgcgggctta catggctatc 1501 ttggataatg aagaagaaaa agccagaaag ctgtccgtta ggaatgctga cccacatgtg 1561 gtatcctcca ccaatgctct catatctcgg atatcaatgg ccagagccgc cctagccaag 1621 gcccaggctg agatgactag taggatgcgt cctgtggtta ttatgatgtg tgggccccct 1681 ggcataggaa aaaccaaagc agcggagcat ctggccaaac gattggccaa cgagatacgg 1741 cctggcggca aagttgggct agttccacgt gaggcagtgg accattggga tggctatcat 1801 ggtgaagagg tgatgttatg ggatgactat gggatgacca aaatacagga ggattgcaac 1861 aaactacagg ccatagctga ctcggccccc ttgacgctca actgtgaccg aatagagaac 1921 aaagggatgc aattcgtatc tgatgctata gtcataacca ccaatgcccc tggcccagcc 1981 cccgtagact ttgtcaacct cggaccggtc tgccggaggg tggacttcct tgtgtactgt 2041 acggcaccag aagtggaaca cacaaggaaa gttagtccag gcgacactaa tgcactaaag 2101 gattgtttta aacctgattt ctctcacatg aaaatggaac tggcccccca gggtggtttt 2161 gataaccaag ggaacactcc atttggcaag ggtacaatga agccaactac cataaataga 2221 ttgttaatcc aggccgtggc attgacaatg gagaggcagg acgagttcca actccaaggt 2281 cctacttatg actttgacac tgatagggtc tctgcattta ctaagatggc ccgagctaat 2341 gggataggtc ttatatccat ggcctccctg ggcaagaaat tgcgtggggt ttccaccatt 2401 gaaggcttga agaatgccct tctaggatac aagatagcaa agtgcagtat acagtggcag 2461 tcaagggtgt acattataga atcagatggt ggcaatgtac acatcaagga ggacaagcag 2521 gctctaactc cttcacagca ggcaattaac acagcctccc tggccatcac acgactcaag 2581 gcggctaggg ctgtagctta cgcctcatgt tttcaatctg ccgtcaccac catactacaa 2641 atggcagggt ctgctctggt catcaatcgg gcagttaagc gcatgtttgg cacccgtacg 2701 gcagccatgg cgctagaggg accagaaaag gaacataatt gtagggttca taaagctaag 2761 gaagctggaa aggggcctat aggacatgat gacatgatag aaaagtttgg tctgtgtgag 2821 actgaagagg aagaaagtga agatcaaatc caaataaccc caaatgatgc cattccagag 2881 gggaagaaca aaggtaagac caagaagggc cgcggccgca aaaataatta caatgcattc 2941 tcccgacgtg ggttgagtga tgaggagtat gaagaataca agaagatcag ggaagagaag 3001 aatggtaatt acagcataca agagtatctg gaggatcgtc aacggtacga ggaagagtta 3061 gcagaggttc aagcaggtgg tgacggtggc ataggagaga ctgagatgga gattcgccac 3121 agagttttct acaaatccaa aagcagaaaa caccaacaag agcagcgacg ccaacttggc 3181 ctagtaactg gatcggacat caggaaacgt aagcctattg actggacccc accaaagaat 3241 gaatgggccg atgatgatag ggaagttgac tacaatgaaa agatcaattt tgaggccccc 3301 ccaacaatgt ggagccgggt cacaaagttt gggtcagggt ggggcttctg ggtcagtcca 3361 accgtgttca tcaccaccac gcatgtggtg ccaactggcg tgaaggaatt ctttggcgaa 3421 cccctcacca acatagcaat ccatcaagca ggtgagttca cacagttcag gttttccaaa 3481 aagatacgcc ctgacctgac gggcatggtg ttggaggaag ggtgccctga aggaacagtc 3541 tgttcagtcc tgatcaaacg agactcgggt gagcttctcc ctctagctgt ccgtatgggg 3601 gctattgcct ctatgaggat acagggccgc ctcgttcatg gccaatcagg aatgctgctg 3661 acaggagcga atgcaaaagg aatggacctt ggaactatac caggtgactg tggggcacca 3721 tacgtccaca agcgcggtaa tgactgggtc gtgtgtgggg tccatgctgc agccacaaaa 3781 tcaggtaaca ctgtggtctg tgctgtgcag gctagtgaag gtgagaccgc attagaaggc 3841 ggagacaagg ggcattatgc gggtcatgaa attgtaaggc acgggaatgg cccagcattg 3901 tcaactaaaa caaagttctg gagatcctcc ccagagccat taccccctgg tgtttatgag 3961 cctgcatacc taggaggaaa ggaccctcgt gtccaaaacg gcccctccct ccaacaggtt 4021 ctgcgtgacc aattgaaacc cttcgcagaa ccccgtggtc gcatgcctga gcccggtttg 4081 ctggaggcag cagttgaaac tgtaacatcc atgttagagc aaacaatgga tactccaagc 4141 ccatggtctt atgctgatgc ttgtcaatct cttgataaaa ctacaagttc aggctttccc 4201 taccacaaga ggaagaatga tgattggaac ggcaccacct ttgtcaggga gcttggcgat 4261 caggccgcac atgctaacaa catgtatgag aatggtaaac acatgaaacc catctacaca 4321 gcagctttga aggatgagct agtcaaaccc gaaaaggtct accaaaagat caaaaagcgc 4381 ctgttatggg gtgctgatct cgggactgta atcagagctg cccgagcttt tggcccattt 4441 tgtgatgcta taaagccaca tgtcattaag ctaccaataa aggttggtat gaacataata 4501 gaagatggcc ccctaattta tgctgagcat gccaagtaca aaaaccactt tgacgcggat 4561 tatacggcat gggattcaac acagaacaga caaattatga cagaatcttt ctccattatg 4621 tcacgcctta cggcctcccc cgagttggcc gaagtcgtgg ctcaggattt attggcacca 4681 tctgagatgg atgtgggtga ctatgtcata agagttaagg aaggcttacc atctgggttc 4741 ccgtgtactt cccaagtaaa cagcataaat cattggataa ttactctctg tgcactttct 4801 gaagccactg gtctgtcacc tgatgtggtg caatcaatgt catatttttc attttatggt 4861 gatgatgaga ttgtgtcaac tgacatagat tttgacccag cccgtctcac ccaaatcctt 4921 aaggaatatg gccttagacc aacaaggcca gacaaaactg aaggaccaat acaggtcagg 4981 aaaaatgtgg atggattagt tttcttgcgc cgcaccatct cccgcgacgc tgcagggttc 5041 cagggcagat tggacagggc ctcaattgaa cggcagattt tctggacccg cgggcccaat 5101 cattcagatc cttcagagac cttagtacca cacacccaaa gaaaagtgca attgatctca 5161 ctattgggag aggcttcact tcatggagaa aagttctaca gaaagatctc cagcaaagtc 5221 atacatgaaa tcaagactgg tggattggag atgtatgttc cagggtggca ggccatgttc 5281 cgctggatgc gcttccatga cctcggattg tggacaggag atcgcaatct cctgcccgaa 5341 ttcgtaaatg atgatggcgt ctaaggacgc tacgtcaaac gtggatggcg ccagcggcgc 5401 tggtcagttg gtaccggagg ctaatacttc tgaccctctt gcaatggatc ctgtagcggg 5461 ttcttcgaca gcggttgcga ctgctggaca agtaaacccc attgatcctt ggataattaa 5521 caattttgtg caagctcccc aaggggaatt tacaatctcc ccaaataata cccccggtga 5581 tgttttattt gatctgagtt taggtcccca tcttaacccc ttcttgctac atctgtcaca 5641 aatgtataat ggttgggttg gcaacatgag agttaggatt atgctggctg gtaatgcttt 5701 cactgcaggt aagatcatag tctcctgtat acctcctggt tttggatcgc ataatctcac 5761 tatagcacaa tcaactctgt ttccacatgt gattgctgat gttaggactc tagaccctat 5821 agaagtgcct ttggaagatg ttagaaatgt tcttttccat aataatgata ggaatcaaca 5881 aaccatgcgc cttgtgtgca tgttgtacac ccctctccgc actggtggcg gtacaggtga 5941 ttcctttgtt gtggcggggc gggtcatgac ctgccctagt cctgatttta atttcttgtt 6001 cctggttccc cccacagttg agcagaaaac tagacctttc acccttccaa atttgccttt 6061 gagctctttg tccaattcac gcgctcctct tccaattggc agcatgggca tctctccaga 6121 caatgtccag agtgtacagt tccaaaatgg tcgatgtact ttggacggtc gtttggttgg 6181 cactacccca gtttcactgt cccaggttgc taagataagg ggcacttcaa atggtactgt 6241 cattaacctt accgaactgg atggtacacc tttccaccct tttgagggcc ctgcccccat 6301 tggattccca gacctcggtg gttgtgattg gcacgttaat atgacacagt ttggccactc 6361 tagtcagaca caatttgatg tggataccac ccctgaaacc ttcgtccctc atttgggatc 6421 aatccaggca aatggtattg gtagtggcaa ttatattggt gttctcagtt ggatctcccc 6481 tccatcacat ccctctggtt cccaggtaga tctttggaag atccccaact atgggtcgag 6541 tgttactgag gcaacacatc tggccccatc agttttccca cccggcttcg gggaagtgct 6601 ggttttcttc atgtcaaaga tgccagggcc tggcgcctac aatctgccct gtttgctgcc 6661 acaagagtac atctcacatt ttgcaagtga acaagccccc actgtgggtg aggctgctct 6721 actccattat gttgatcctg atacagggcg gaaccttggg gagttcaaag cataccccga 6781 tggattcctc acttgtgtcc ccaatggagc cagctcgggt ccacaacaat taccaatcaa 6841 tggggttttt gtttttgtct cttgggtgtc taggttctat caattgaagc ctgtgggaac 6901 tgccagctcg gcaagaggta ggcttggact gcgccgataa tggcccaagc tataattggt 6961 gcaattgctg cctccacagc gggcagtgcc cttggggcag gcatacaggt tggtggtgag 7021 gcagcactcc aaagtcaaag ataccaacag aatctacaac tgcaagagaa ttcctttaaa 7081 catgatagag aaatgattgg atatcaggta gaggcttcaa atcaattgct agctaagaat 7141 ctggcaacca gatactcact cctccgtgct ggaggcctat ccagtgctga tgcggcaagg 7201 tccatagcgg gagccccagt gacccgaatc gtggactgga acggtgtgag ggtgtcagcc 7261 cctgagtctt ctgtaaccac attgaggtct ggtggcttta tgtcggtgcc aataccatac 7321 acatctaaac agaaacagat tcaaccatct ggcattagta atccaaatta ttctccttct 7381 tccatttctc gaaccactag ttgggttgaa tcacaaaatt cattaagatt tgggaattta 7441 tccccatacc acacagaggc cctcaataca gtgtggttga ccccacctgg ctcaacagca 7501 tcttccacgc tgtcttctgt gccacgtggc tatttcaata cagatagatt gccattgttc 7561 gcaaacaata ggcgataatg ttgtaatatg aaatgtgg //