Typing tool

Complete norovirus genomes

MH218634  GII.4 Sydney
 GII.P31

Length: 7,560 | 3 CDS

ORF1: 5..5104
ORF2: 5085..6707
ORF3: 6707..7513
LOCUS       MH218634                7560 bp    RNA     linear   VRL 11-JUN-2018
DEFINITION  Norovirus GII isolate NORO_165_21_06_2015, complete genome.
ACCESSION   MH218634
VERSION     MH218634.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7560)
  AUTHORS   Brown,J.R., Roy,S., Shah,D., Williams,C.A., Williams,R., Dunn,H.,
            Hartley,J., Harris,K. and Breuer,J.
  TITLE     Norovirus transmission dynamics in a paediatric hospital using full
            genome sequences
  JOURNAL   Clin. Infect. Dis. (2018) In press
   PUBMED   29800111
  REMARK    Publication Status: Available-Online prior to print
REFERENCE   2  (bases 1 to 7560)
  AUTHORS   Brown,J.R., Roy,S., Shah,D., Williams,C.A., Williams,R., Dunn,H.,
            Hartley,J., Harris,K. and Breuer,J.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-APR-2018) Division of Infection and Immunity,
            University College London, 90 Gower St, London, London WC1E 6BT,
            United Kingdom
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: CLC Genomics Workbench v. 10.1
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7560
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="NORO_165_21_06_2015"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="United Kingdom"
                     /collection_date="21-Jun-2015"
                     /note="genotype: GII.Pe_GII.4"
     gene            5..5104
                     /gene="ORF1"
     CDS             5..5104
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="AWR17560.1"
                     /translation="MKMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARP
                     KQPPPREIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPE
                     ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS
                     LAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
                     WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
                     LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
                     LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
                     KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
                     KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL
                     AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
                     PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
                     AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
                     SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTM
                     SDLKQALKSIAIKKCQIVYNGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLR
                     CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANRDGC
                     LKPKDEEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
                     KYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
                     TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
                     SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
                     TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
                     CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
                     PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
                     RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
                     NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDL
                     ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
                     STQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
                     SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
                     EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
                     NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
                     PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     5..994
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     995..2092
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2093..2629
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2630..3028
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3029..3571
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3572..5101
                     /gene="ORF1"
                     /product="RdRp"
     gene            5085..6707
                     /gene="ORF2"
     CDS             5085..6707
                     /gene="ORF2"
                     /note="predicted CDS stop by homology is invalid; there
                     may be a valid stop in a different location due to
                     truncation (trc) or extension (ext) (TAG|TAA|TGA) [TAT
                     ending at position 6705 on + strand];first in-frame stop
                     codon exists 3' of stop position predicted by homology to
                     reference [homology search predicted 5085..6705 revised to
                     5085..6707 (stop shifted 2 nt)]"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="AWR17561.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSHNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
     gene            6707..7513
                     /gene="ORF3"
     CDS             6707..7513
                     /gene="ORF3"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="AWR17562.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDTVPARSSLSKSS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN      
        1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gttgccaaca gcaacaacga
       61 catcgcaaaa tcttcaagtg acggcgtgtt ttctaacatg gctgtcactt ttaagcgggc
      121 cctcggggcg cggcctaaac agccgccccc gagggaaata ccacccagac ccccgcgacc
      181 acccacacca gaattagtca aaaagatccc tcctccccca cccaacgggg aggatgaact
      241 agtggtctct tacagcgcca aagatggcgt ttccggactg cctgagctca ccactgtcag
      301 gcaaccggaa gaaaccaaca cggcgttcag tgtccctcca ctcaaccaaa gggagagcag
      361 ggacgccaag gagccactaa ctggaacaat tattgaaatg tgggatggag aaatctacca
      421 ttacggcctg tacgtggaac gaggtcttat acttggtgtg cacaagccac cggcagctat
      481 cagccttgcc aaggtcgagc tagcaccgct ctctttgttc tggagacctg tatacacccc
      541 ccagtatctc atctctccag acacccttag gagactacat ggagagtcat tcccctacac
      601 tgcatttgac aacaattgct acgccttttg ttgttgggtg ttagacctaa acgactcatg
      661 gctaagcagg agaatgattc agagaacaac aggtttcttc aggccgtacc aggattggaa
      721 caggaaaccc ctccccacta tggatgattc caaattaaag aaggtggcca acatattctt
      781 gtgcactttg tcttcactat tcaccagacc cattaaggac ataataggga agttgaaacc
      841 ccttaacatc cttaacattc tggcaacatg tgattggacc ttcgcaggca tagtggaatc
      901 cttaatactc ttggcagaac tctttggagt tttctggaca cccccagatg tgtctgcgat
      961 gatcgccccc ttgctaggtg attatgaact gcaaggacct gaggaccttg cagtggaact
     1021 ggtcccaata gtgatggggg ggataggttt ggtgctagga tttaccaaag agaaaatcgg
     1081 aaagatgcta tcatccgctg catccacttt aagagcttgt aaagaccttg gtgcatacgg
     1141 actggaaatc ttaaaattgg ttatgaagtg gttcttccca aagaaagagg aagcaaatga
     1201 actggctatg gtgagatcca tcgaggatgc agtgctagac ctcgaggcaa ttgaaaacaa
     1261 ccacatgacc accctactca aagacaaaga cagcttggca acctacatga gaacccttga
     1321 ccttgaggag gagaaagcca gaaaactctc aaccaaatct gcttcacccg acattgtggg
     1381 cacaatcaac tctcttctag caagaattgc tgctgcacgc tccctagtgc atcgggccaa
     1441 ggaagagctc tccagcaggc cgagacctgt cgttgtgatg atatcgggaa gaccagggat
     1501 agggaaaact caccttgcca gggagctggc caagaagatc gcggcctccc tcacaggaga
     1561 ccagcgcgtg ggtcttatcc cacgcaatgg tgtcgaccac tgggacgcat acaagggcga
     1621 aagagttgtc ctgtgggacg actatggaat gagcaacccc atccatgatg ccctcaggtt
     1681 gcaggagctt gctgatactt gccccctcac gctaaattgt gacagaattg agaacaaagg
     1741 gaaagtcttt gacagtgatg ccataattat caccaccaat ctggccaacc cagcaccact
     1801 ggattatgtc aactttgaag cgtgctcgag acgcattgac tttctcgtgt acgcggaagc
     1861 ccctgaggtg gagaaagcaa agcgcgactt cccaggtcaa cctgacatgt ggaagaacgc
     1921 tttcagtcct gacttctcac acataaaact gtcattggct ccacagggtg gttttgacaa
     1981 gaacggcaac accccgcatg gaaaaggggt catgaagacc ctcaccactg gctccctcat
     2041 cgcccgagca tcagggttgc tccatgagag gctagatgaa tatgaactgc aaggcccagc
     2101 cctcaccact ttcaattttg accgcaacaa gatacttgct tttagacagc ttgctgctga
     2161 aaacaagtat gggctgatgg acacaatgag agttggaaaa cagctcaagg atgtcaagac
     2221 catgtcagac ctcaagcaag cactcaagag catcgcgatc aagaagtgcc agatagtgta
     2281 caatggtagc acctatacac ttgaggccga tggcaagggt agtgtgaaag ttgacaaagt
     2341 gcaaagtgct actgtgcaga ccaacaatga gctagccggt gccctacacc acctaaggtg
     2401 cgccagaatc agatactacg ttaagtgcgt ccaggaggca ctgtattcca tcatccaaat
     2461 cgctggggct gcattcgtca ccacgcgcat cgctaagcgc atgaatatac agaatctctg
     2521 gtccaagcca caggtggaag acacagaaga gatggctaac agagatggtt gcctaaagcc
     2581 caaagatgaa gaagagtttg tcgtctcatc cgacgacatc aaaactgagg gcaagaaagg
     2641 gaagaacaag tccggccgtg gcaagaagca cacagccttt tcaagcaaag ggctcagtga
     2701 tgaagagtac gatgagtaca agagaatcag agaagaaagg aatggtaagt actccataga
     2761 agagtacctt caggacagag acaggtatta cgaggaggtg gccattgcca gggcaaccga
     2821 agaggacttc tgtgaagaag aagaagctaa aatccggcag agaattttta gaccaacaag
     2881 gaaacaacgt aaagaagaga gggcctctct cggcttggtc acaggctctg aaatcaggaa
     2941 gagaaaccca gaagacttca aacccaaggg aaagctatgg gctgatgatg acagaagtgt
     3001 tgactacaat gagaaactca acttcgaggc cccaccaagc atctggtcgc gaatagtcaa
     3061 ctttggttca ggctggggct tttgggtctc ccccagtctg tttataacat caacccatgt
     3121 cataccccaa ggtgcaaaag agttcttcgg agtccccatc aagcaaatcc agatacacaa
     3181 atcaggtgaa ttctgccggt tgagattccc aaagccaatc agaactgatg tgacgggcat
     3241 gattctagaa gaaggtgcgc ccgaggggac cgtggccaca ctgctcatta agagaccaac
     3301 tggggagctc atgcctctgg cagccaggat ggggacccat gcaaccatga aaatccaggg
     3361 acgcacagtt ggggggcaaa tgggtatgct cctgacagga tccaacgcca agagtatgga
     3421 cctaggcaca acaccaggcg actgcggctg cccctatatc tacaagaggg ggaatgacta
     3481 cgtggtcata ggagtccata cggccgctgc ccgcggagga aacactgtta tatgtgccac
     3541 ccaggggagt gagggagaag ccacacttga aggaggtgac agtaaaggga catactgtgg
     3601 cgcaccaatc ttgggcccag ggagcgctcc aaagctcagc accaagacta agttctggag
     3661 atcatccaca acgccactcc cacccggcac ctacgaacca gcctacctcg gtggcaaaga
     3721 ccccagggtc aaaggtggcc cttcattgca acaagttatg agggaccagc tgaagccatt
     3781 cacagaacct agaggcaaac caccaagacc aaatgtgttg gaagctgcca agaaaaccat
     3841 catcaatgtc cttgagcaaa caattgatcc accccaaaaa tggtcatttg cgcaagcttg
     3901 cgcatccctt gacaaaacca cctccagcgg ccacccgcac cacatgcgga aaaacgattg
     3961 ttggaatggg gagtccttca caggaaaatt ggctgatcaa gcctccaagg ccaacctaat
     4021 gtttgaagag ggaaagaaca tgactccagt ctacacaggt gcacttaaag atgagttagt
     4081 aaagaccgat aaagtttatg gtaaggtcaa gaagaggctt ctgtggggtt cagatctggc
     4141 gaccatgata cggtgcgccc gagcttttgg aggcctcatg gatgaactca aggcacactg
     4201 tgtcacactt cctgtcagag ttggtatgaa catgaatgag gatggcccca tcatatttga
     4261 gaagcactcc agatatagat atcactatga tgctgattat tcccggtggg actcaacaca
     4321 acaaagggat gtgctagcag cagcactaga aatcatggtt aagttctctc cagaaccaca
     4381 cctggcccag atagttgcag aagacctcct ttcccctagc gtaatggatg taggtgactt
     4441 tcaaatatca ataagtgagg gtctcccctc tggggtacct tgcacctccc agtggaattc
     4501 catcgcccac tggctcctca ctctttgtgc actctctgaa gtcacggacc tgtcccctga
     4561 catcattcag gccaactccc ttttctcttt ctatggtgat gatgagattg taagcacaga
     4621 cataaagttg gacccagaga agctgacagc aaaactcaag gagtacgggc tgaaaccaac
     4681 ccgccccgac aaaactgaag gaccccttgt tatctctgaa gacctggatg gcctgacatt
     4741 cctccggaga actgtgaccc gtgatccagc tggctggttt ggaaaattgg aacaaagttc
     4801 aattctcagg caaatgtact ggaccagggg tcccaatcat gaagacccat ttgaaacaat
     4861 gataccacac tcccaaagac ccatacaatt gatgtccctg ctgggcgagg ctgcactcca
     4921 cggcccggca ttctatagca aaattagcaa attagtcatt gcagagttga aggaaggtgg
     4981 catggatttt tacgtgccca gacaagaacc aatgttcaga tggatgagat tctcagacct
     5041 gagcacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga
     5101 gtgacgccaa cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg
     5161 ttatggccct ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccaacaaa
     5221 atgtaattga cccctggatt agaaataatt ttgtacaagc ccctggtgga gagtttacag
     5281 tatcccctag aaacgctcca ggtgaaatac tatggagcgc gcccttgggc cctgatctaa
     5341 atccctacct atcccatttg gccagaatgt acaatggtta tgcaggtggt tttgaagtgc
     5401 aggtaatcct cgcggggaat gcgttcaccg ccgggaagat catatttgca gcagtcccac
     5461 caaattttcc aactgaaggc ttgagcccca gccaggtcac tatgttcccc catatagtag
     5521 tggatgttag gcaattagaa cctgtgttga ttcccttacc cgatgttagg aataatttct
     5581 atcactacaa ccaatcaaat gaccccacca ttaagttgat agcaatgttg tatacaccac
     5641 ttagggctaa taacgctggg gacgatgtct tcacagtttc ttgccgagtt ctcacgagac
     5701 catcccccga tttcgacttc atatttctag tgccacccac agttgagtca aggactaaac
     5761 cattctctgt cccagtttta actgttgagg agatgaccaa ttcaagattc cccattcctt
     5821 tggaaaagtt gttcacgggt cctagtagtg ccttcgttgt ccaaccacaa aacggcaggt
     5881 gcacgactga tggcgtgctc ctaggcacca cccaactgtc tcctgtcaac atctgcacct
     5941 tcaggggaga tgtcacccat attacaggta gtcataacta cacaatgaat ttggcttctc
     6001 aaaattggaa caattatgac ccaacagaag aaatcccagc ccctctggga actccagatt
     6061 ttgtggggaa gattcaaggc gtgctcaccc aaaccacaag gacagatggc tcaacacgcg
     6121 gccacaaagc tacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc
     6181 aatttgaaac tgacacaaac catgattttg aagctaacca aaacacaaag ttcaccccag
     6241 tcggtgtcat ccaagatggt agcaccaccc accgaaatga accccaacag tgggtgctcc
     6301 caagttactc aggcagaaat actcataatg tgcatctggc ccccgctgta gcccccactt
     6361 ttccgggtga gcaacttctc ttcttcagat ccaccatgcc cggatgcagc gggtacccca
     6421 acatggattt ggactgtctg ctcccccagg aatgggtgca gtacttctac caagaggcag
     6481 ccccagcaca atctgatgtg gctctgttaa gatttgtgaa tccagacaca ggtagggttt
     6541 tgtttgagtg taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg
     6601 atttggttat ccccccaaat ggttatttta ggtttgattc ctgggtcaac cagttttaca
     6661 cgcttgcccc catgggaaat ggaacggggc gtagacgtgc agtataatgg ctggagcttt
     6721 ctttgctgga ttggcatctg atgtccttgg ctctggactt ggttccctta tcaatgctgg
     6781 ggctggggcc atcaaccaaa aagttgagtt tgaaaataat agaaaattgc aacaagcatc
     6841 cttccaattt agcagcaatc tacaacaggc ttcctttcaa catgataaag agatgctcca
     6901 agcgcaaatt gaggctacca aaaagctaca acaggaaatg atgaaagtta aacaggcaat
     6961 gctcctagag ggtgggttct ctgagacaga tgcagcccgc ggggcaatca acgcccccat
     7021 gacaaaagct ttggactgga gcgggacaag gtactgggct cctgatgcta ggactacaac
     7081 atacaatgca ggccgctttt ccacccctca accatcgggg gcactgccag gaagagctaa
     7141 tcttagggat actgtccctg ctcggagttc cctcagtaaa tcttctaact cttctactgc
     7201 tacttctgtg tactcaaatc aaactacttc aacgagactt ggttctacag ctggttctgg
     7261 caccagtgtc tcgagcttcc cgtcaactgc aaggactagg agctgggttg aggatcaaag
     7321 caggaatttg tcacctttca tgaggggggc ccacaacata tcgtttgtca ccccaccatc
     7381 tagcagatcc tctagccaag gcacagtctc aaccgtgcct aaagagattt tggactcctg
     7441 gactggcgct ttcaacacgc gcaggcagcc actcttcgct cacattcgta agcgagggga
     7501 gtcacgggcg taatgtgaaa agacaaaatt gattatcctt ctttttcttt agtgtctttt
//