Typing tool
|
Complete norovirus genomes
MH702273 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 1..4948 ORF2: 4929..6551 ORF3: 6551..7357LOCUS MH702273 7357 bp RNA linear VRL 01-JAN-2019 DEFINITION Norovirus GII strain Hu/BT/2012/GII.Pe-GII.4_Sydney_2012/ETR-NV-184 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MH702273 VERSION MH702273.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7357) AUTHORS Pham,A.H., Pham,T.T.T., Swierczewski,B.E., Ladaporn,B. and Baker,S. TITLE The genomics of Norovirus in Thailand JOURNAL Unpublished REFERENCE 2 (bases 1 to 7357) AUTHORS Pham,A.H., Pham,T.T.T., Swierczewski,B.E., Ladaporn,B. and Baker,S. TITLE Direct Submission JOURNAL Submitted (31-JUL-2018) Enterics, Oxford University Clinical Research Unit, 764 Vo Van Kiet, Ho Chi Minh, 5 70000, Vietnam COMMENT ##Assembly-Data-START## Assembly Method :: velvet v. 1.2.10 Assembly Name :: Norovirus Coverage :: 326,147 X Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7357 /organism="Norovirus GII" /mol_type="genomic RNA" /strain="Hu/BT/2012/GII.Pe-GII.4_Sydney_2012/ETR-NV-184" /isolation_source="stool specimen" /host="Homo sapiens" /db_xref="taxon:122929" /country="Bhutan" /collection_date="04-Dec-2012" /note="genotype: GII.Pe-GII.4_Sydney_2012" gene <1..4948 /gene="ORF1" CDS <1..4948 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="AXQ39936.1" /translation="IPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPEL TTVRQPEETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVH KPPAAISLAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCW VLDLNDSWLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRP IKDIIGKLKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDY ELQGPEDLAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKL VMKWFFPKKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEE KARKLSTKSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGK THLARELAKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRL QELADTCPLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYA EAPEVEKAKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTT GSLIARASGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQ LKDVKTMSDLKQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELA GALHHLRCARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEE MANKDGCLKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKR IREERNGKYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQKIFRPTRKQRKEE RASLGLVTGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSG WGFWVSPSLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMIL EEGAPEGTVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMD LGTTPGDCGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTY CGAPILGPGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQ LKPFTEPRGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHH MRKNDCWNGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKR LLWGSDLATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYD ADYSRWDSTQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGL PSGVPCTSQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPE KLTAKLKEYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQ MYWTRGPNHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMD FYVPRQEPMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..838 /gene="ORF1" /product="p48" mat_peptide 839..1936 /gene="ORF1" /product="NTPase" mat_peptide 1937..2473 /gene="ORF1" /product="p22" mat_peptide 2474..2872 /gene="ORF1" /product="VPg" mat_peptide 2873..3415 /gene="ORF1" /product="Pro" mat_peptide 3416..4945 /gene="ORF1" /product="RdRp" gene 4929..6551 /gene="ORF2" CDS 4929..6551 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="AXQ39934.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDRDFEANQNTKFTPVGVIQDG STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6551..7357 /gene="ORF3" CDS 6551..7357 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="AXQ39935.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKSS NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI SSVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA" ORIGIN 1 aataccaccc agacccccgc gaccacccac accagaattg gtcaaaaaga tccctcctcc 61 cccacccaac ggggaggatg aactagtggt ctcttacagc gccaaagatg gcgtttccgg 121 actgcctgag ctcaccactg tcagacaacc ggaagaaacc aacacggcgt tcagtgtccc 181 cccactcaac caaagggaga gcagggacgc caaggagcca ctaactggaa caatcattga 241 aatgtgggat ggagaaatct accattacgg cctgtacgtg gaacgaggtc ttatacttgg 301 tgtgcacaag ccaccggcag ccattagcct tgccaaggtc gagctagcac cgctctcttt 361 gttctggaga cctgtataca ccccccagta tctcatctct ccagacactc ttaggagatt 421 acatggagag tcattcccct acactgcatt tgacaacaat tgctacgcct tttgttgttg 481 ggtattagac ctaaacgact catggctaag caggagaatg attcagagaa caacaggctt 541 cttcaggccg taccaggatt ggaacaggaa acccctcccc actatggatg attccaaatt 601 aaagaaggta gccaacatat tcttgtgcac tttgtcttca ctattcacca gacccattaa 661 ggacataata gggaagttga aacctcttaa catccttaac attctggcta catgtgattg 721 gaccttcgca ggcatagtgg aatccttaat actcttggca gaactctttg gagttttctg 781 gacaccccca gatgtgtctg cgatgatcgc ccccttgcta ggtgattatg aactgcaagg 841 acctgaggac cttgcagtgg aactggtccc aatagtgatg ggggggatag gtttggtgct 901 aggatttacc aaagagaaaa tcggaaagat gctatcatcc gctgcatcca ctttaagagc 961 ttgtaaagac cttggtgcat acggactgga aatcttaaaa ttggtcatga agtggttctt 1021 cccaaagaaa gaggaagcaa atgaactggc tatggtgaga tccatcgagg atgcagtact 1081 agacctcgag gcaattgaaa acaaccacat gaccacccta ctcaaagaca aagacagctt 1141 ggcaacctac atgagaaccc ttgaccttga ggaggagaaa gccagaaaac tctcaaccaa 1201 atctgcttca cccgatattg tgggcacaat caactctctt ctggcaagaa tcgctgctgc 1261 acgctcccta gtgcatcggg cgaaagaaga gctctccagc aggccgagac ctgtcgttgt 1321 gatgatatcg ggaagaccag ggatagggaa aactcacctt gccagggagc tggccaagaa 1381 gatcgcggcc tccctcacag gggaccagcg tgtgggtctt atcccacgca atggtgtcga 1441 ccactgggac gcatacaagg gcgaaagagt tgtcctatgg gacgactatg gaatgagcaa 1501 ccccatccat gatgccctca ggttgcagga gcttgctgac acttgccccc tcacgctaaa 1561 ttgtgacaga attgagaaca aagggaaagt ctttgacagt gatgccataa ttatcaccac 1621 caatctggcc aacccagcac cactggatta tgtcaacttt gaagcgtgct cgagacgcat 1681 tgatttcctc gtgtacgcag aagcccctga ggtggagaag gcaaagcgcg acttcccagg 1741 tcaacctgac atgtggaaga acgctttcag tcctgacttc tcacacataa aattgtcatt 1801 ggctccacag ggtggttttg acaagaacgg caacaccccg catggaaaag gggtcatgaa 1861 gaccctcacc actggctccc tcatcgcccg agcatcaggg ttactccatg agaggctaga 1921 tgaatatgaa ctgcaaggcc cagccctcac cactttcaac tttgaccgca acaagatact 1981 tgcttttaga cagcttgctg ctgaaaacaa gtatgggctg atggacacaa tgagagttgg 2041 aaaacagctc aaggatgtca agaccatgtc agacctcaaa caagcactca agaacatcgc 2101 gatcaagaag tgccagatag tgtacaatgg tggcacctac acacttgagg ctgatggcaa 2161 gggtagtgtg aaagttgaca aagtgcaaag tgccactgtg cagaccaaca atgaactagc 2221 cggtgcccta caccacctaa ggtgcgctag aatcagatac tatgttaagt gcgtccagga 2281 ggcactgtat tccatcatcc aaatcgctgg ggctgcattc gtcaccacgc gcatcgctaa 2341 gcgcatgaat atacagaatc tctggtccaa gccacaggtg gaagacacag aagagatggc 2401 caacaaagat ggttgcctaa aacccaaaga tgatgaagag tttgtcgtct catccgacga 2461 catcaaaact gagggcaaga aagggaagaa caagtccggc cgtggcaaga agcacacagc 2521 cttttcaagt aaagggctca gtgatgagga gtacgatgag tacaagagaa tcagagaaga 2581 aaggaatggt aagtactcca tagaagagta ccttcaggac agagacaggt actacgagga 2641 ggtggccatt gccagggcaa ccgaagagga cttctgtgaa gaagaagagg ccaaaatccg 2701 gcagaaaatt ttcagaccaa caaggaaaca acgcaaagaa gagagggcct ctctcggctt 2761 ggtcacaggc tctgaaatca ggaagagaaa cccagaagac ttcaaaccca agggaaagct 2821 gtgggctgat gatgacagaa gtgttgacta caatgagaaa ctcaactttg aggccccacc 2881 aagcatctgg tcgcggatag tcaactttgg ttcaggctgg ggcttctggg tctcccccag 2941 tctgtttata acatcaaccc atgtcatacc ccaaggtgca aaagagttct tcggagtccc 3001 tatcaagcaa atccagatac acaagtcagg tgaattctgc cggttgagat tcccaaagcc 3061 aatcagaact gatgtgacgg gcatgattct agaagaaggt gcgcccgagg ggaccgtggc 3121 cacactgctc atcaagagac caactggaga gctcatgcct ctggcagcca gaatggggac 3181 ccatgcaacc atgaaaattc aggggcgcac agttggaggg caaatgggta tgctcctgac 3241 aggatccaac gccaagagta tggacctagg cacaacacca ggcgactgcg gctgccccta 3301 catctacaag agggggaatg actacgtggt cataggagtc catacggccg ctgcccgtgg 3361 aggaaacact gtcatatgtg ccacccaggg gagtgaggga gaagccacac ttgaaggagg 3421 tgacagtaaa gggacatact gtggcgcacc aatcttgggc ccagggagcg ctccgaagct 3481 cagtaccaag actaagtttt ggagatcatc cacaacacca ctcccacctg gcacctacga 3541 accagcctac ctcggtggca aagaccctag agtcaaaggt ggcccttcat tgcaacaagt 3601 tatgagggac cagctgaagc cattcacaga acccagaggc aaaccaccaa gaccaaatgt 3661 gttggaagct gccaagaaaa ccatcatcaa tgtccttgag caaacaattg atccacccca 3721 aaaatggtca tttgcgcaag cttgcgcatc ccttgacaaa accacctcca gcggccaccc 3781 gcaccacatg cggaaaaacg actgttggaa tggggagtcc ttcacaggaa aattggctga 3841 tcaagcctcc aaggccaacc taatgtttga agagggaaag aacatgactc cagtctacac 3901 aggtgcactt aaagatgagt tggtaaagac cgataaagtt tatggtaagg tcaagaagag 3961 gcttctgtgg ggttcagatc tggcgaccat gatacggtgc gcccgagctt ttggaggcct 4021 tatggatgaa ctcaaggcac actgtgtcac acttcctgtc agagttggta tgaacatgaa 4081 tgaggatggc cccatcatct ttgagaagca ctccagatat agatatcact atgatgctga 4141 ttattcccgg tgggactcaa cacaacaaag ggatgtgcta gcagcagcac tagaaatcat 4201 ggttaagttc tctccagaac cacacctggc ccagatagtt gcagaagacc tcctctcccc 4261 tagcgtgatg gatgtaggtg actttcaaat atcaataagt gagggtctcc cctctggggt 4321 accttgtacc tcccagtgga attccatcgc ccactggctc ctcactctgt gtgcactctc 4381 tgaagtcacg gacctgtccc ctgatatcat tcaggccaac tcccttttct ccttctatgg 4441 tgatgatgag attgtaagca cagacataaa gttggaccca gagaagctga cagcaaaact 4501 caaggagtac gggctgaaac caacccgccc cgacaaaact gaaggacccc ttgttatctc 4561 tgaagacctg gatggcctga cattcctccg gagaactgtg acccgtgatc cagctggctg 4621 gtttggaaaa ttggaacaaa gttcaattct caggcaaatg tactggacca ggggtcccaa 4681 ccatgaagat ccatttgaaa caatgatacc acactcccaa agacccatac aattgatgtc 4741 cttgctgggc gaggctgcac tccacggccc ggcattctat agcaaaatta gcaaattagt 4801 tattgcagag ttgaaggaag gtggcatgga tttttacgta cccagacaag agccaatgtt 4861 cagatggatg agattctcag atctgagcac gtgggagggc gatcgcaatc tggctcccag 4921 ttttgtgaat gaagatggcg tcgagtgacg ccaacccatc tgatgggtcc gcagccaacc 4981 tcgtcccaga ggtcaacaat gaggttatgg ctctggagcc cgttgttggt gccgccattg 5041 cggcacctgt agcgggccaa caaaatgtaa ttgacccctg gattagaaat aattttgtac 5101 aagcccctgg tggagagttt acagtatccc ctagaaacgc tccaggtgaa atactatgga 5161 gcgcgccctt gggccctgat ctaaatccct acctatccca tttggccaga atgtacaatg 5221 gttatgcagg tggttttgaa gtgcaggtaa ttctcgcggg gaacgcgttc accgccggga 5281 aggtcatatt tgcagcagtc ccaccaaatt ttccaactga aggcttgagc cccagccagg 5341 tcactatgtt cccccatata gtagtagatg ttaggcaact agaacctgtg ttgattccct 5401 tacccgatgt taggaataat ttctatcatt acaatcaatc aaatgacccc accattaagt 5461 tgatagcaat gttgtataca ccacttaggg ctaataatgc tggggatgat gtcttcacag 5521 tttcttgccg agttctcacg agaccatccc ccgattttga tttcatattt ctagtgccac 5581 ccacagttga gtcaagaact aaaccattct ctgtcccagt tttaactgtt gaggagatga 5641 ccaattcaag attccccatc cctttggaaa agttgttcac gggtcccagc agtgcctttg 5701 ttgtccaacc acaaaacggt aggtgcacga ctgatggcgt gctcctaggc accacccaac 5761 tgtctcctgt caacatctgc accttcagag gagatgtcac ccatatcaca ggtagtcgta 5821 actacacaat gaatttggct tctcaaaatt ggagcaatta tgacccaaca gaagaaatcc 5881 cagcccctct aggaactcca gattttgtgg ggaagattca aggcgtgctc acccaaacca 5941 caaggacaga tggctcaaca cgcggccaca aagccacagt gtacactggg agcgccgact 6001 ttgctccaaa actgggtaga gttcaatttg aaactgacac agaccgtgat tttgaagcta 6061 accaaaacac aaagttcacc ccagttggtg tcatccaaga cggtagcacc acccaccgaa 6121 atgaacccca acagtgggtg cttccaagtt actcaggcag aaatactcct aatgtgcatc 6181 tggcccccgc tgtagccccc acttttccgg gtgagcaact tctcttcttc agatccacca 6241 tgcccggatg cagcgggtac cccaacatgg acttggactg tctgctcccc caggaatggg 6301 tgcagtactt ctaccaagag gcagccccag cacaatctga tgtggctctg ctaagatttg 6361 tgaatccaga cacaggtagg gttttgtttg agtgtaagct tcataaatca ggctatgtta 6421 cagtggctca cactggccaa catgatttgg ttatcccccc caatggttat tttaggtttg 6481 attcctgggt caaccagttt tacacgcttg cccccatggg aaatggaacg gggcgtagac 6541 gtgcactata atggctggag ctttctttgc tggattggca tctgatgtcc ttggctctgg 6601 acttggttcc cttatcaatg ctggggctgg ggccatcaac caaaaagttg agtttgaaaa 6661 taacagaaaa ttgcaacaag catccttcca atttagcagc aatctacaac aggcttcctt 6721 tcaacatgac aaagagatgc tccaagcaca aattgaggcc accaaaaggc tacaacagga 6781 aatgatgaaa gttaagcagg caatgctcct agagggtggg ttctctgaga cagatgcagc 6841 ccgcggggca atcaacgccc ccatgacaaa agctttggac tggagcggga caaggtactg 6901 ggctcccgat gctaggacta caacatacaa tgcaggccgc ttttctaccc ctcaaccatc 6961 gggggcactg ccaggaagag ctaatcttag ggatgctgtc cctgctcggg gttcctccag 7021 taagtcttct aattcttcta ctgctacttc tgtgtactca aatcaaacta cttcaacgag 7081 gcttggttct acagctggtt ctggcaccag tgtctcgagc ttcccgtcaa ctgcaaggac 7141 taggagctgg gttgaggatc aaagtaggaa tttgtcacct ttcatgaggg gggcccacaa 7201 catatcgtct gtcaccccac catctagcag atcctctagc caaggcacag tctcaaccgt 7261 gcctaaagag attttggact cctggactgg cgctttcaac acgcgcaggc agccactctt 7321 cgctcacatt cgtaagcgag gggagtcacg ggcgtaa //