Typing tool

Complete norovirus genomes

MH638228  GI.1
 GI.P1

Length: 7,598 | 3 CDS

ORF1: 1..5364
ORF2: 5348..6940
ORF3: 6940..7578
LOCUS       MH638228                7598 bp ss-RNA     linear   VRL 19-JUL-2019
DEFINITION  Norovirus GI isolate ESP20695 nonstructural polyprotein (ORF1)
            gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
            cds.
ACCESSION   MH638228
VERSION     MH638228.1
KEYWORDS    .
SOURCE      Norovirus GI
  ORGANISM  Norovirus GI
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7598)
  AUTHORS   Pan,R.W., Koster,B.L., Vo,S., Balansay,M.S., Hayes,M.E., Graf,P.C.
            and Myers,C.A.
  TITLE     Acute Gasteoenteritis Surveillance Program
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7598)
  AUTHORS   Pan,R.W., Koster,B.L., Vo,S., Balansay,M.S., Hayes,M.E., Graf,P.C.
            and Myers,C.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-JUL-2018) Operational Infectious Diseases, Naval
            Health Research Center, 140 Sylvester Road, San Diego, CA 92106,
            USA
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: SPAdes v. 3.11.1
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7598
                     /organism="Norovirus GI"
                     /mol_type="genomic RNA"
                     /isolate="ESP20695"
                     /host="Homo sapiens"
                     /db_xref="taxon:122928"
                     /country="USA"
                     /collection_date="07-Nov-2017"
                     /note="genotype: GI.P1"
     gene            <1..5364
                     /gene="ORF1"
     CDS             <1..5364
                     /gene="ORF1"
                     /inference="ab initio prediction:Prodigal:2.60"
                     /inference="similar to AA sequence:UniProtKB:Q83883"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="AXQ39990.1"
                     /translation="MASKDVVPAAASSENANNNSSIKSRLLARLKGVSGTTTPPNSIQ
                     ITNQSMALGLIGQAPAPKATAVETPKQQRDRPPRTAAEVQQNLCWTEKPLDQNVKAWD
                     ELDHTTKQQILDEHAEWFDAGGLGPSTLPSNHERHSNDDSGGHQVKWSAKEGVNLGIG
                     GLTTVPGPEWNMCPLPPADQRSTTPAVEPIIGDMIEFYEGHIYHYAIYIGQGKTVGVH
                     SPQAAFSITRITIQPISAWWRICYVPQPKQRLTYDQLKELENEPWPYAAVTNNCFEFC
                     CQVMCLDDTWLQRKLISSGRFHHPTQDWSRDTPEFQQDSKLEMVRDAVLAAINGLVSR
                     PFKDLLGKLKPLNVLNLLSNCDWTFMGVVEMVVLLLELFGVFWNPPDVSNFIASLLPD
                     FHLQGPEDLARDLVPVVLGGIGLAIGFTRDKVGKMMKNAVDGLRAATQLGQYGLEIFS
                     LLKKYFFGGDQTEKTLKDIESAVIDMEVLSSTSVTQLVRDKQSARAYMAILDNEEEKA
                     RKLSVRNADPHVVSSTNALISRISMARAALAKAQAEMTSRMRPVVIMMCGPPGIGKTK
                     AAEHLAKRLANEIRPGGKVGLVPREAVDHWDGYHGEEVMLWDDYGMTKIQEDCNKLQA
                     IADSAPLTLNCDRIENKGMQFVSDAIVITTNAPGPAPVDFVNLGPVCRRVDFLVYCTA
                     PEVEHTRKVSPGDTNALKDCFKPDFSHMKMELAPQGGFDNQGNTPFGKGTMKPTTINR
                     LLIQAVALTMERQDEFQLQGPTYDFDTDRVSAFTKMARANGIGLISMASLGKKLRGVS
                     TIEGLKNALLGYKIAKCSIQWQSRVYIIESDGGNVHIKEDKQALTPSQQAINTASLAI
                     TRLKAARAVAYASCFQSAVTTILQMAGSALVINRAVKRMFGTRTAAMALEGPEKEHNC
                     RVHKAKEAGKGPIGHDDMIEKFGLCETEEEESEDQIQITPNDAIPEGKNKGKTKKGRG
                     RKNNYNAFSRRGLSDEEYEEYKKIREEKNGNYSIQEYLEDRQRYEEELAEVQAGGDGG
                     IGETEMEIRHRVFYKSKSRKHQQEQRRQLGLVTGSDIRKRKPIDWTPPKNEWADDDRE
                     VDYNEKINFEAPPTMWSRVTKFGSGWGFWVSPTVFITTTHVVPTGVKEFFGEPLTNIA
                     IHQAGEFTQFRFSKKIRPDLTGMVLEEGCPEGTVCSVLIKRDSGELLPLAVRMGAIAS
                     MRIQGRLVHGQSGMLLTGANAKGMDLGTIPGDCGAPYVHKRGNDWVVCGVHAAATKSG
                     NTVVCAVQASEGETALEGGDKGHYAGHEIVRHGNGPALSTKTKFWRSSPEPLPPGVYE
                     PAYLGGKDPRVQNGPSLQQVLRDQLKPFAEPRGRMPEPGLLEAAVETVTSMLEQTMDT
                     PSPWSYADACQSLDKTTSSGFPYHKRKNDDWNGTTFVRELGDQAAHANNMYENGKHMK
                     PIYTAALKDELVKPEKVYQKIKKRLLWGADLGTVIRAARAFGPFCDAIKPHVIKLPIK
                     VGMNIIEDGPLIYAEHAKYKNHFDADYTAWDSTQNRQIMTESFSIMSRLTASPELAEV
                     VAQDLLAPSEMDVGDYVIRVKEGLPSGFPCTSQVNSINHWIITLCALSEATGLSPDVV
                     QSMSYFSFYGDDEIVSTDIDFDPARLTQILKEYGLRPTRPDKTEGPIQVRKNVDGLVF
                     LRRTISRDAAGFQGRLDRASIERQIFWTRGPNHSDPSETLVPHTQRKVQLISLLGEAS
                     LHGEKFYRKISSKVIHEIKTGGLEMYVPGWQAMFRWMRFHDLGLWTGDRNLLPEFVND
                     DGV"
     mat_peptide     <1..1188
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     1189..2277
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2278..2880
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2881..3294
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3295..3837
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3838..5361
                     /gene="ORF1"
                     /product="RdRp"
     gene            5348..6940
                     /gene="ORF2"
     CDS             5348..6940
                     /gene="ORF2"
                     /inference="ab initio prediction:Prodigal:2.60"
                     /inference="similar to AA sequence:UniProtKB:Q83884"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="AXQ39991.1"
                     /translation="MMMASKDATSNVDGASGAGQLVPEANTSDPLAMDPVAGSSTAVA
                     TAGQVNPIDPWIINNFVQAPQGEFTISPNNTPGDVLFDLSLGPHLNPFLLHLSQMYNG
                     WVGNMRVRIMLAGNAFTAGKIIVSCIPPGFGSHNLTIAQSTLFPHVIADVRTLDPIEV
                     PLEDVRNVLFHNNDRNQQTMRLVCMLYTPLRTGGGTGDSFVVAGRVMTCPSPDFNFLF
                     LVPPTVEQKTRPFTLPNLPLSSLSNSRAPLPIGSMGISPDNVQSVQFQNGRCTLDGRL
                     VGTTPVSLSQVAKIRGTSNGTVINLTELDGTPFHPFEGPAPIGFPDLGGCDWHVNMTQ
                     FGHSSQTQFDVDTTPETFVPHLGSIQANGIGSGNYIGVLSWISPPSHPSGSQVDLWKI
                     PNYGSSVTEATHLAPSVFPPGFGEVLVFFMSKMPGPGAYNLPCLLPQEYISHFASEQA
                     PTVGEAALLHYVDPDTGRNLGEFKAYPDGFLTCVPNGASSGPQQLPINGVFVFVSWVS
                     RFYQLKPVGTASSARGRLGLRR"
     gene            6940..7588
                     /gene="ORF3"
     CDS             6940..7578
                     /gene="ORF3"
                     /inference="ab initio prediction:Prodigal:2.60"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="AXQ39992.1"
                     /translation="MAQAIIGAIAASTAGSALGAGIQVGGEAALQSQRYQQNLQLQEN
                     SFKHDREMIGYQVEASNQLLAKNLATRYSLLRAGGLSSADAARSIAGAPVTRIVDWNG
                     VRVSAPESSVTTLRSGGFMSVPIPYTSKQKQIQPSGISNPNYSPSSISRTTSWVESQN
                     SLRFGNLSPYHTEALNTVWLTPPGSTASSTLSSVPRGYFNTDRLPLFANNRR"
ORIGIN      
        1 atggcgtcga aagacgtcgt tcctgctgct gctagcagtg aaaatgctaa caacaacagt
       61 agtatcaagt ctcgcctatt ggcgagactc aaaggtgtga gtggaacaac gactccacct
      121 aattctatac aaataactaa ccaaagtatg gctctggggc taattggcca ggccccagct
      181 ccaaaggcca cagccgtgga gacccctaaa caacagaggg ataggcctcc acggactgcc
      241 gccgaagtcc aacaaaattt atgttggacc gagaagccgc tagaccaaaa tgtcaaggcg
      301 tgggacgaat tggaccacac aacgaaacaa cagatacttg atgagcatgc tgaatggttt
      361 gatgctggtg gcttaggccc aagcacactt cctagcaacc atgaacgtca ctcaaatgat
      421 gatagtggag gccatcaagt gaaatggtcg gctaaagagg gtgtgaatct cgggatagga
      481 gggcttacaa ccgttcccgg gcccgagtgg aacatgtgcc cgctaccccc agcagatcaa
      541 aggagtacga cacctgcagt cgagcccata ataggtgata tgatcgaatt ttatgaaggt
      601 catatctacc attatgccat ttatatcggc caaggcaaga cagtgggtgt ccactctcct
      661 caagcagcat tttcaataac aagaatcacc atacaaccta tatcagcatg gtggcggata
      721 tgttatgtcc cacagccaaa acagagactt acatatgacc aactcaaaga attggagaac
      781 gaaccatggc catacgctgc agttaccaat aactgctttg aattctgctg tcaggtcatg
      841 tgtctagatg acacttggct acaaagaaag ctcatctcct ctggacggtt ccaccatcca
      901 acccaggatt ggtctcggga cactccagag tttcaacaag acagcaaact tgagatggtc
      961 agagatgcag tgctggccgc gataaatggg ctagtgtcgc ggccctttaa ggatctccta
     1021 ggcaaactca aacctttgaa tgtgctcaat ttactttcaa attgtgactg gacatttatg
     1081 ggggtcgtgg aaatggtggt cctccttttg gagctttttg gggtgttctg gaacccacct
     1141 gatgtgtcta acttcatagc ttcactccta ccagatttcc atctacaggg ccctgaagat
     1201 cttgctagag atctcgtgcc agtggtcctg ggtggtattg gtctggctat aggatttact
     1261 agggacaagg tgggtaaaat gatgaaaaat gctgttgacg gactgcgggc tgcgactcag
     1321 ctcggacagt acggcctgga aatattctca ctattaaaga agtacttctt tggtggagac
     1381 cagacagaga agaccctgaa agatatagag tcagctgtta ttgacatgga agtgctgtcg
     1441 tccacctcag tgacccaact cgtaagggat aagcaatctg cgcgggctta catggctatc
     1501 ttggataatg aagaagaaaa agccagaaag ctgtccgtta ggaatgctga cccacatgtg
     1561 gtatcctcca ccaatgctct catatctcgg atatcaatgg ccagagccgc cctagccaag
     1621 gcccaggctg agatgactag taggatgcgt cctgtggtta ttatgatgtg tgggccccct
     1681 ggcataggaa aaaccaaagc agcggagcat ctggccaaac gattggccaa cgagatacgg
     1741 cctggcggca aagttgggct agttccacgt gaggcagtgg accattggga tggctatcat
     1801 ggtgaagagg tgatgttatg ggatgactat gggatgacca aaatacagga ggattgcaac
     1861 aaactacagg ccatagctga ctcggccccc ttgacgctca actgtgaccg aatagagaac
     1921 aaagggatgc aattcgtatc tgatgctata gtcataacca ccaatgcccc tggcccagcc
     1981 cccgtagact ttgtcaacct cggaccggtc tgccggaggg tggacttcct tgtgtactgt
     2041 acggcaccag aagtggaaca cacaaggaaa gttagtccag gcgacactaa tgcactaaag
     2101 gattgtttta aacctgattt ctctcacatg aaaatggaac tggcccccca gggtggtttt
     2161 gataaccaag ggaacactcc atttggcaag ggtacaatga agccaactac cataaataga
     2221 ttgttaatcc aggccgtggc attgacaatg gagaggcagg acgagttcca actccaaggt
     2281 cctacttatg actttgacac tgatagggtc tctgcattta ctaagatggc ccgagctaat
     2341 gggataggtc ttatatccat ggcctccctg ggcaagaaat tgcgtggggt ttccaccatt
     2401 gaaggcttga agaatgccct tctaggatac aagatagcaa agtgcagtat acagtggcag
     2461 tcaagggtgt acattataga atcagatggt ggcaatgtac acatcaagga ggacaagcag
     2521 gctctaactc cttcacagca ggcaattaac acagcctccc tggccatcac acgactcaag
     2581 gcggctaggg ctgtagctta cgcctcatgt tttcaatctg ccgtcaccac catactacaa
     2641 atggcagggt ctgctctggt catcaatcgg gcagttaagc gcatgtttgg cacccgtacg
     2701 gcagccatgg cgctagaggg accagaaaag gaacataatt gtagggttca taaagctaag
     2761 gaagctggaa aggggcctat aggacatgat gacatgatag aaaagtttgg tctgtgtgag
     2821 actgaagagg aagaaagtga agatcaaatc caaataaccc caaatgatgc cattccagag
     2881 gggaagaaca aaggtaagac caagaagggc cgcggccgca aaaataatta caatgcattc
     2941 tcccgacgtg ggttgagtga tgaggagtat gaagaataca agaagatcag ggaagagaag
     3001 aatggtaatt acagcataca agagtatctg gaggatcgtc aacggtacga ggaagagtta
     3061 gcagaggttc aagcaggtgg tgacggtggc ataggagaga ctgagatgga gattcgccac
     3121 agagttttct acaaatccaa aagcagaaaa caccaacaag agcagcgacg ccaacttggc
     3181 ctagtaactg gatcggacat caggaaacgt aagcctattg actggacccc accaaagaat
     3241 gaatgggccg atgatgatag ggaagttgac tacaatgaaa agatcaattt tgaggccccc
     3301 ccaacaatgt ggagccgggt cacaaagttt gggtcagggt ggggcttctg ggtcagtcca
     3361 accgtgttca tcaccaccac gcatgtggtg ccaactggcg tgaaggaatt ctttggcgaa
     3421 cccctcacca acatagcaat ccatcaagca ggtgagttca cacagttcag gttttccaaa
     3481 aagatacgcc ctgacctgac gggcatggtg ttggaggaag ggtgccctga aggaacagtc
     3541 tgttcagtcc tgatcaaacg agactcgggt gagcttctcc ctctagctgt ccgtatgggg
     3601 gctattgcct ctatgaggat acagggccgc ctcgttcatg gccaatcagg aatgctgctg
     3661 acaggagcga atgcaaaagg aatggacctt ggaactatac caggtgactg tggggcacca
     3721 tacgtccaca agcgcggtaa tgactgggtc gtgtgtgggg tccatgctgc agccacaaaa
     3781 tcaggtaaca ctgtggtctg tgctgtgcag gctagtgaag gtgagaccgc attagaaggc
     3841 ggagacaagg ggcattatgc gggtcatgaa attgtaaggc acgggaatgg cccagcattg
     3901 tcaactaaaa caaagttctg gagatcctcc ccagagccat taccccctgg tgtttatgag
     3961 cctgcatacc taggaggaaa ggaccctcgt gtccaaaacg gcccctccct ccaacaggtt
     4021 ctgcgtgacc aattgaaacc cttcgcagaa ccccgtggtc gcatgcctga gcccggtttg
     4081 ctggaggcag cagttgaaac tgtaacatcc atgttagagc aaacaatgga tactccaagc
     4141 ccatggtctt atgctgatgc ttgtcaatct cttgataaaa ctacaagttc aggctttccc
     4201 taccacaaga ggaagaatga tgattggaac ggcaccacct ttgtcaggga gcttggcgat
     4261 caggccgcac atgctaacaa catgtatgag aatggtaaac acatgaaacc catctacaca
     4321 gcagctttga aggatgagct agtcaaaccc gaaaaggtct accaaaagat caaaaagcgc
     4381 ctgttatggg gtgctgatct cgggactgta atcagagctg cccgagcttt tggcccattt
     4441 tgtgatgcta taaagccaca tgtcattaag ctaccaataa aggttggtat gaacataata
     4501 gaagatggcc ccctaattta tgctgagcat gccaagtaca aaaaccactt tgacgcggat
     4561 tatacggcat gggattcaac acagaacaga caaattatga cagaatcttt ctccattatg
     4621 tcacgcctta cggcctcccc cgagttggcc gaagtcgtgg ctcaggattt attggcacca
     4681 tctgagatgg atgtgggtga ctatgtcata agagttaagg aaggcttacc atctgggttc
     4741 ccgtgtactt cccaagtaaa cagcataaat cattggataa ttactctctg tgcactttct
     4801 gaagccactg gtctgtcacc tgatgtggtg caatcaatgt catatttttc attttatggt
     4861 gatgatgaga ttgtgtcaac tgacatagat tttgacccag cccgtctcac ccaaatcctt
     4921 aaggaatatg gccttagacc aacaaggcca gacaaaactg aaggaccaat acaggtcagg
     4981 aaaaatgtgg atggattagt tttcttgcgc cgcaccatct cccgcgacgc tgcagggttc
     5041 cagggcagat tggacagggc ctcaattgaa cggcagattt tctggacccg cgggcccaat
     5101 cattcagatc cttcagagac cttagtacca cacacccaaa gaaaagtgca attgatctca
     5161 ctattgggag aggcttcact tcatggagaa aagttctaca gaaagatctc cagcaaagtc
     5221 atacatgaaa tcaagactgg tggattggag atgtatgttc cagggtggca ggccatgttc
     5281 cgctggatgc gcttccatga cctcggattg tggacaggag atcgcaatct cctgcccgaa
     5341 ttcgtaaatg atgatggcgt ctaaggacgc tacgtcaaac gtggatggcg ccagcggcgc
     5401 tggtcagttg gtaccggagg ctaatacttc tgaccctctt gcaatggatc ctgtagcggg
     5461 ttcttcgaca gcggttgcga ctgctggaca agtaaacccc attgatcctt ggataattaa
     5521 caattttgtg caagctcccc aaggggaatt tacaatctcc ccaaataata cccccggtga
     5581 tgttttattt gatctgagtt taggtcccca tcttaacccc ttcttgctac atctgtcaca
     5641 aatgtataat ggttgggttg gcaacatgag agttaggatt atgctggctg gtaatgcttt
     5701 cactgcaggt aagatcatag tctcctgtat acctcctggt tttggatcgc ataatctcac
     5761 tatagcacaa tcaactctgt ttccacatgt gattgctgat gttaggactc tagaccctat
     5821 agaagtgcct ttggaagatg ttagaaatgt tcttttccat aataatgata ggaatcaaca
     5881 aaccatgcgc cttgtgtgca tgttgtacac ccctctccgc actggtggcg gtacaggtga
     5941 ttcctttgtt gtggcggggc gggtcatgac ctgccctagt cctgatttta atttcttgtt
     6001 cctggttccc cccacagttg agcagaaaac tagacctttc acccttccaa atttgccttt
     6061 gagctctttg tccaattcac gcgctcctct tccaattggc agcatgggca tctctccaga
     6121 caatgtccag agtgtacagt tccaaaatgg tcgatgtact ttggacggtc gtttggttgg
     6181 cactacccca gtttcactgt cccaggttgc taagataagg ggcacttcaa atggtactgt
     6241 cattaacctt accgaactgg atggtacacc tttccaccct tttgagggcc ctgcccccat
     6301 tggattccca gacctcggtg gttgtgattg gcacgttaat atgacacagt ttggccactc
     6361 tagtcagaca caatttgatg tggataccac ccctgaaacc ttcgtccctc atttgggatc
     6421 aatccaggca aatggtattg gtagtggcaa ttatattggt gttctcagtt ggatctcccc
     6481 tccatcacat ccctctggtt cccaggtaga tctttggaag atccccaact atgggtcgag
     6541 tgttactgag gcaacacatc tggccccatc agttttccca cccggcttcg gggaagtgct
     6601 ggttttcttc atgtcaaaga tgccagggcc tggcgcctac aatctgccct gtttgctgcc
     6661 acaagagtac atctcacatt ttgcaagtga acaagccccc actgtgggtg aggctgctct
     6721 actccattat gttgatcctg atacagggcg gaaccttggg gagttcaaag cataccccga
     6781 tggattcctc acttgtgtcc ccaatggagc cagctcgggt ccacaacaat taccaatcaa
     6841 tggggttttt gtttttgtct cttgggtgtc taggttctat caattgaagc ctgtgggaac
     6901 tgccagctcg gcaagaggta ggcttggact gcgccgataa tggcccaagc tataattggt
     6961 gcaattgctg cctccacagc gggcagtgcc cttggggcag gcatacaggt tggtggtgag
     7021 gcagcactcc aaagtcaaag ataccaacag aatctacaac tgcaagagaa ttcctttaaa
     7081 catgatagag aaatgattgg atatcaggta gaggcttcaa atcaattgct agctaagaat
     7141 ctggcaacca gatactcact cctccgtgct ggaggcctat ccagtgctga tgcggcaagg
     7201 tccatagcgg gagccccagt gacccgaatc gtggactgga acggtgtgag ggtgtcagcc
     7261 cctgagtctt ctgtaaccac attgaggtct ggtggcttta tgtcggtgcc aataccatac
     7321 acatctaaac agaaacagat tcaaccatct ggcattagta atccaaatta ttctccttct
     7381 tccatttctc gaaccactag ttgggttgaa tcacaaaatt cattaagatt tgggaattta
     7441 tccccatacc acacagaggc cctcaataca gtgtggttga ccccacctgg ctcaacagca
     7501 tcttccacgc tgtcttctgt gccacgtggc tatttcaata cagatagatt gccattgttc
     7561 gcaaacaata ggcgataatg ttgtaatatg aaatgtgg
//