Typing tool

Complete norovirus genomes

MH702273  GII.4 Sydney
 GII.P31

Length: 7,357 | 3 CDS

ORF1: 1..4948
ORF2: 4929..6551
ORF3: 6551..7357
LOCUS       MH702273                7357 bp    RNA     linear   VRL 01-JAN-2019
DEFINITION  Norovirus GII strain Hu/BT/2012/GII.Pe-GII.4_Sydney_2012/ETR-NV-184
            nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2)
            and VP2 (ORF3) genes, complete cds.
ACCESSION   MH702273
VERSION     MH702273.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7357)
  AUTHORS   Pham,A.H., Pham,T.T.T., Swierczewski,B.E., Ladaporn,B. and Baker,S.
  TITLE     The genomics of Norovirus in Thailand
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7357)
  AUTHORS   Pham,A.H., Pham,T.T.T., Swierczewski,B.E., Ladaporn,B. and Baker,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (31-JUL-2018) Enterics, Oxford University Clinical
            Research Unit, 764 Vo Van Kiet, Ho Chi Minh, 5 70000, Vietnam
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: velvet v. 1.2.10
            Assembly Name         :: Norovirus
            Coverage              :: 326,147 X
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7357
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /strain="Hu/BT/2012/GII.Pe-GII.4_Sydney_2012/ETR-NV-184"
                     /isolation_source="stool specimen"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Bhutan"
                     /collection_date="04-Dec-2012"
                     /note="genotype: GII.Pe-GII.4_Sydney_2012"
     gene            <1..4948
                     /gene="ORF1"
     CDS             <1..4948
                     /gene="ORF1"
                     /codon_start=2
                     /product="nonstructural polyprotein"
                     /protein_id="AXQ39936.1"
                     /translation="IPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPEL
                     TTVRQPEETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVH
                     KPPAAISLAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCW
                     VLDLNDSWLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRP
                     IKDIIGKLKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDY
                     ELQGPEDLAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKL
                     VMKWFFPKKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEE
                     KARKLSTKSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGK
                     THLARELAKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRL
                     QELADTCPLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYA
                     EAPEVEKAKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTT
                     GSLIARASGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQ
                     LKDVKTMSDLKQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELA
                     GALHHLRCARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEE
                     MANKDGCLKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKR
                     IREERNGKYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQKIFRPTRKQRKEE
                     RASLGLVTGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSG
                     WGFWVSPSLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMIL
                     EEGAPEGTVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMD
                     LGTTPGDCGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTY
                     CGAPILGPGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQ
                     LKPFTEPRGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHH
                     MRKNDCWNGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKR
                     LLWGSDLATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYD
                     ADYSRWDSTQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGL
                     PSGVPCTSQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPE
                     KLTAKLKEYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQ
                     MYWTRGPNHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMD
                     FYVPRQEPMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..838
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     839..1936
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     1937..2473
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2474..2872
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     2873..3415
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3416..4945
                     /gene="ORF1"
                     /product="RdRp"
     gene            4929..6551
                     /gene="ORF2"
     CDS             4929..6551
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="AXQ39934.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSRNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDRDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
     gene            6551..7357
                     /gene="ORF3"
     CDS             6551..7357
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="AXQ39935.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKSS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SSVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN      
        1 aataccaccc agacccccgc gaccacccac accagaattg gtcaaaaaga tccctcctcc
       61 cccacccaac ggggaggatg aactagtggt ctcttacagc gccaaagatg gcgtttccgg
      121 actgcctgag ctcaccactg tcagacaacc ggaagaaacc aacacggcgt tcagtgtccc
      181 cccactcaac caaagggaga gcagggacgc caaggagcca ctaactggaa caatcattga
      241 aatgtgggat ggagaaatct accattacgg cctgtacgtg gaacgaggtc ttatacttgg
      301 tgtgcacaag ccaccggcag ccattagcct tgccaaggtc gagctagcac cgctctcttt
      361 gttctggaga cctgtataca ccccccagta tctcatctct ccagacactc ttaggagatt
      421 acatggagag tcattcccct acactgcatt tgacaacaat tgctacgcct tttgttgttg
      481 ggtattagac ctaaacgact catggctaag caggagaatg attcagagaa caacaggctt
      541 cttcaggccg taccaggatt ggaacaggaa acccctcccc actatggatg attccaaatt
      601 aaagaaggta gccaacatat tcttgtgcac tttgtcttca ctattcacca gacccattaa
      661 ggacataata gggaagttga aacctcttaa catccttaac attctggcta catgtgattg
      721 gaccttcgca ggcatagtgg aatccttaat actcttggca gaactctttg gagttttctg
      781 gacaccccca gatgtgtctg cgatgatcgc ccccttgcta ggtgattatg aactgcaagg
      841 acctgaggac cttgcagtgg aactggtccc aatagtgatg ggggggatag gtttggtgct
      901 aggatttacc aaagagaaaa tcggaaagat gctatcatcc gctgcatcca ctttaagagc
      961 ttgtaaagac cttggtgcat acggactgga aatcttaaaa ttggtcatga agtggttctt
     1021 cccaaagaaa gaggaagcaa atgaactggc tatggtgaga tccatcgagg atgcagtact
     1081 agacctcgag gcaattgaaa acaaccacat gaccacccta ctcaaagaca aagacagctt
     1141 ggcaacctac atgagaaccc ttgaccttga ggaggagaaa gccagaaaac tctcaaccaa
     1201 atctgcttca cccgatattg tgggcacaat caactctctt ctggcaagaa tcgctgctgc
     1261 acgctcccta gtgcatcggg cgaaagaaga gctctccagc aggccgagac ctgtcgttgt
     1321 gatgatatcg ggaagaccag ggatagggaa aactcacctt gccagggagc tggccaagaa
     1381 gatcgcggcc tccctcacag gggaccagcg tgtgggtctt atcccacgca atggtgtcga
     1441 ccactgggac gcatacaagg gcgaaagagt tgtcctatgg gacgactatg gaatgagcaa
     1501 ccccatccat gatgccctca ggttgcagga gcttgctgac acttgccccc tcacgctaaa
     1561 ttgtgacaga attgagaaca aagggaaagt ctttgacagt gatgccataa ttatcaccac
     1621 caatctggcc aacccagcac cactggatta tgtcaacttt gaagcgtgct cgagacgcat
     1681 tgatttcctc gtgtacgcag aagcccctga ggtggagaag gcaaagcgcg acttcccagg
     1741 tcaacctgac atgtggaaga acgctttcag tcctgacttc tcacacataa aattgtcatt
     1801 ggctccacag ggtggttttg acaagaacgg caacaccccg catggaaaag gggtcatgaa
     1861 gaccctcacc actggctccc tcatcgcccg agcatcaggg ttactccatg agaggctaga
     1921 tgaatatgaa ctgcaaggcc cagccctcac cactttcaac tttgaccgca acaagatact
     1981 tgcttttaga cagcttgctg ctgaaaacaa gtatgggctg atggacacaa tgagagttgg
     2041 aaaacagctc aaggatgtca agaccatgtc agacctcaaa caagcactca agaacatcgc
     2101 gatcaagaag tgccagatag tgtacaatgg tggcacctac acacttgagg ctgatggcaa
     2161 gggtagtgtg aaagttgaca aagtgcaaag tgccactgtg cagaccaaca atgaactagc
     2221 cggtgcccta caccacctaa ggtgcgctag aatcagatac tatgttaagt gcgtccagga
     2281 ggcactgtat tccatcatcc aaatcgctgg ggctgcattc gtcaccacgc gcatcgctaa
     2341 gcgcatgaat atacagaatc tctggtccaa gccacaggtg gaagacacag aagagatggc
     2401 caacaaagat ggttgcctaa aacccaaaga tgatgaagag tttgtcgtct catccgacga
     2461 catcaaaact gagggcaaga aagggaagaa caagtccggc cgtggcaaga agcacacagc
     2521 cttttcaagt aaagggctca gtgatgagga gtacgatgag tacaagagaa tcagagaaga
     2581 aaggaatggt aagtactcca tagaagagta ccttcaggac agagacaggt actacgagga
     2641 ggtggccatt gccagggcaa ccgaagagga cttctgtgaa gaagaagagg ccaaaatccg
     2701 gcagaaaatt ttcagaccaa caaggaaaca acgcaaagaa gagagggcct ctctcggctt
     2761 ggtcacaggc tctgaaatca ggaagagaaa cccagaagac ttcaaaccca agggaaagct
     2821 gtgggctgat gatgacagaa gtgttgacta caatgagaaa ctcaactttg aggccccacc
     2881 aagcatctgg tcgcggatag tcaactttgg ttcaggctgg ggcttctggg tctcccccag
     2941 tctgtttata acatcaaccc atgtcatacc ccaaggtgca aaagagttct tcggagtccc
     3001 tatcaagcaa atccagatac acaagtcagg tgaattctgc cggttgagat tcccaaagcc
     3061 aatcagaact gatgtgacgg gcatgattct agaagaaggt gcgcccgagg ggaccgtggc
     3121 cacactgctc atcaagagac caactggaga gctcatgcct ctggcagcca gaatggggac
     3181 ccatgcaacc atgaaaattc aggggcgcac agttggaggg caaatgggta tgctcctgac
     3241 aggatccaac gccaagagta tggacctagg cacaacacca ggcgactgcg gctgccccta
     3301 catctacaag agggggaatg actacgtggt cataggagtc catacggccg ctgcccgtgg
     3361 aggaaacact gtcatatgtg ccacccaggg gagtgaggga gaagccacac ttgaaggagg
     3421 tgacagtaaa gggacatact gtggcgcacc aatcttgggc ccagggagcg ctccgaagct
     3481 cagtaccaag actaagtttt ggagatcatc cacaacacca ctcccacctg gcacctacga
     3541 accagcctac ctcggtggca aagaccctag agtcaaaggt ggcccttcat tgcaacaagt
     3601 tatgagggac cagctgaagc cattcacaga acccagaggc aaaccaccaa gaccaaatgt
     3661 gttggaagct gccaagaaaa ccatcatcaa tgtccttgag caaacaattg atccacccca
     3721 aaaatggtca tttgcgcaag cttgcgcatc ccttgacaaa accacctcca gcggccaccc
     3781 gcaccacatg cggaaaaacg actgttggaa tggggagtcc ttcacaggaa aattggctga
     3841 tcaagcctcc aaggccaacc taatgtttga agagggaaag aacatgactc cagtctacac
     3901 aggtgcactt aaagatgagt tggtaaagac cgataaagtt tatggtaagg tcaagaagag
     3961 gcttctgtgg ggttcagatc tggcgaccat gatacggtgc gcccgagctt ttggaggcct
     4021 tatggatgaa ctcaaggcac actgtgtcac acttcctgtc agagttggta tgaacatgaa
     4081 tgaggatggc cccatcatct ttgagaagca ctccagatat agatatcact atgatgctga
     4141 ttattcccgg tgggactcaa cacaacaaag ggatgtgcta gcagcagcac tagaaatcat
     4201 ggttaagttc tctccagaac cacacctggc ccagatagtt gcagaagacc tcctctcccc
     4261 tagcgtgatg gatgtaggtg actttcaaat atcaataagt gagggtctcc cctctggggt
     4321 accttgtacc tcccagtgga attccatcgc ccactggctc ctcactctgt gtgcactctc
     4381 tgaagtcacg gacctgtccc ctgatatcat tcaggccaac tcccttttct ccttctatgg
     4441 tgatgatgag attgtaagca cagacataaa gttggaccca gagaagctga cagcaaaact
     4501 caaggagtac gggctgaaac caacccgccc cgacaaaact gaaggacccc ttgttatctc
     4561 tgaagacctg gatggcctga cattcctccg gagaactgtg acccgtgatc cagctggctg
     4621 gtttggaaaa ttggaacaaa gttcaattct caggcaaatg tactggacca ggggtcccaa
     4681 ccatgaagat ccatttgaaa caatgatacc acactcccaa agacccatac aattgatgtc
     4741 cttgctgggc gaggctgcac tccacggccc ggcattctat agcaaaatta gcaaattagt
     4801 tattgcagag ttgaaggaag gtggcatgga tttttacgta cccagacaag agccaatgtt
     4861 cagatggatg agattctcag atctgagcac gtgggagggc gatcgcaatc tggctcccag
     4921 ttttgtgaat gaagatggcg tcgagtgacg ccaacccatc tgatgggtcc gcagccaacc
     4981 tcgtcccaga ggtcaacaat gaggttatgg ctctggagcc cgttgttggt gccgccattg
     5041 cggcacctgt agcgggccaa caaaatgtaa ttgacccctg gattagaaat aattttgtac
     5101 aagcccctgg tggagagttt acagtatccc ctagaaacgc tccaggtgaa atactatgga
     5161 gcgcgccctt gggccctgat ctaaatccct acctatccca tttggccaga atgtacaatg
     5221 gttatgcagg tggttttgaa gtgcaggtaa ttctcgcggg gaacgcgttc accgccggga
     5281 aggtcatatt tgcagcagtc ccaccaaatt ttccaactga aggcttgagc cccagccagg
     5341 tcactatgtt cccccatata gtagtagatg ttaggcaact agaacctgtg ttgattccct
     5401 tacccgatgt taggaataat ttctatcatt acaatcaatc aaatgacccc accattaagt
     5461 tgatagcaat gttgtataca ccacttaggg ctaataatgc tggggatgat gtcttcacag
     5521 tttcttgccg agttctcacg agaccatccc ccgattttga tttcatattt ctagtgccac
     5581 ccacagttga gtcaagaact aaaccattct ctgtcccagt tttaactgtt gaggagatga
     5641 ccaattcaag attccccatc cctttggaaa agttgttcac gggtcccagc agtgcctttg
     5701 ttgtccaacc acaaaacggt aggtgcacga ctgatggcgt gctcctaggc accacccaac
     5761 tgtctcctgt caacatctgc accttcagag gagatgtcac ccatatcaca ggtagtcgta
     5821 actacacaat gaatttggct tctcaaaatt ggagcaatta tgacccaaca gaagaaatcc
     5881 cagcccctct aggaactcca gattttgtgg ggaagattca aggcgtgctc acccaaacca
     5941 caaggacaga tggctcaaca cgcggccaca aagccacagt gtacactggg agcgccgact
     6001 ttgctccaaa actgggtaga gttcaatttg aaactgacac agaccgtgat tttgaagcta
     6061 accaaaacac aaagttcacc ccagttggtg tcatccaaga cggtagcacc acccaccgaa
     6121 atgaacccca acagtgggtg cttccaagtt actcaggcag aaatactcct aatgtgcatc
     6181 tggcccccgc tgtagccccc acttttccgg gtgagcaact tctcttcttc agatccacca
     6241 tgcccggatg cagcgggtac cccaacatgg acttggactg tctgctcccc caggaatggg
     6301 tgcagtactt ctaccaagag gcagccccag cacaatctga tgtggctctg ctaagatttg
     6361 tgaatccaga cacaggtagg gttttgtttg agtgtaagct tcataaatca ggctatgtta
     6421 cagtggctca cactggccaa catgatttgg ttatcccccc caatggttat tttaggtttg
     6481 attcctgggt caaccagttt tacacgcttg cccccatggg aaatggaacg gggcgtagac
     6541 gtgcactata atggctggag ctttctttgc tggattggca tctgatgtcc ttggctctgg
     6601 acttggttcc cttatcaatg ctggggctgg ggccatcaac caaaaagttg agtttgaaaa
     6661 taacagaaaa ttgcaacaag catccttcca atttagcagc aatctacaac aggcttcctt
     6721 tcaacatgac aaagagatgc tccaagcaca aattgaggcc accaaaaggc tacaacagga
     6781 aatgatgaaa gttaagcagg caatgctcct agagggtggg ttctctgaga cagatgcagc
     6841 ccgcggggca atcaacgccc ccatgacaaa agctttggac tggagcggga caaggtactg
     6901 ggctcccgat gctaggacta caacatacaa tgcaggccgc ttttctaccc ctcaaccatc
     6961 gggggcactg ccaggaagag ctaatcttag ggatgctgtc cctgctcggg gttcctccag
     7021 taagtcttct aattcttcta ctgctacttc tgtgtactca aatcaaacta cttcaacgag
     7081 gcttggttct acagctggtt ctggcaccag tgtctcgagc ttcccgtcaa ctgcaaggac
     7141 taggagctgg gttgaggatc aaagtaggaa tttgtcacct ttcatgaggg gggcccacaa
     7201 catatcgtct gtcaccccac catctagcag atcctctagc caaggcacag tctcaaccgt
     7261 gcctaaagag attttggact cctggactgg cgctttcaac acgcgcaggc agccactctt
     7321 cgctcacatt cgtaagcgag gggagtcacg ggcgtaa
//