Typing tool
|
Complete norovirus genomes
MT238668 | GII.4 Sydney | ||
---|---|---|---|
GII.P16 |
ORF1: 1..5093 ORF2: 5074..6696 ORF3: 6696..7502LOCUS MT238668 7556 bp RNA linear VRL 01-MAY-2020 DEFINITION Norovirus GII isolate 782-3 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MT238668 VERSION MT238668.1 DBLINK BioProject: PRJNA604000 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7556) AUTHORS Yang,Z., Williams-Hill,D., Wolfe,J., Silva,A.J., Hirneisen,K., Ruelle,S., Kulka,M. and Hellberg,R. TITLE Direct Submission JOURNAL Submitted (24-MAR-2020) Molecular Virology Team/Devision of Molecular Biology, OARSA/CFSAN/FDA, 8301 Muirkirk Rd., Laurel, MD 20708, USA COMMENT ##Assembly-Data-START## Assembly Method :: CLC Genomics Workbench v. 11 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7556 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="782-3" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="17-Dec-2018" /note="genotype: GII.P16-GII.4" gene <1..5093 /gene="ORF1" CDS <1..5093 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QIQ09391.1" /translation="ASNDATVAVACNNNNDKEKSSGEGLFTNMSFTLKKALGARPKQP APRDEPQKPPRPPTPELVKRIPPPPPNGEGEEEPVIRYEVKSGISGLPELTTVPQPDV ANTAFSVPPLSLRENREAKEPLTGAILEMWDGEIYHYGLYVEKGLVLGVHKPPAAISM ARVELTPLSLYWRVVYTPQYLISPETLRRLNGEAFPYTAFDNNCYAFCCWVLDLNDSW LSRRMVQRTTGFFRPYQEWNRKPLPTMDDSKIKKVANIFLCSLSTLFTRPIKDLIGKI KPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK KEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMKTLDMEEEKARRLSTK SASPDIVGTINALLARIAAARSLVHRAKEELSSRIRPVVVMISGKPGIGKTHLAREVA RKVASTLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELADTCP LTLNCDRIENKGKVFDSEVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA KRDFPGQPDMWKDAFKPDFSHIKLQLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS GLLHERMDEFELQGPTITTFNFDRNRITAFRQLAAENKYGLVDTMKVGNQLKGVKTME ELKQAIRNVTIKRCRIIYGGSTYDLESDGKGKVLVEKVKNTSVQTNNELAGALHHLKH ARIRYYVKCVQEAVYSIIQIAGAAFVTTRIARRMNIQELWSKPQLDQNESETKEEAPK SEDDEFIISSKDIKEEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYS IEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLGLVTGS EIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFEAPPSIWSRIVSFGSGWGFWVSPSLF ITSTHVIPAGITEAFGVPIKQIQIHKSGEFCRFRFPRPIRPDVTGMILEEGAPEGTVA TVLIKRPTGELMPLAARMGTHATMKIQGRMVGGQMGMLLTGSNAKGMDLGTTPGDCGC PYIYKRGNDYIVIGVHTAAARGGNTVICATQGSEGEATLEGGYDKGTYCGAPILGPGG APKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEPRGK PPRPSVLEAAKQTVINVLEQTLDPPQKWTYAQACASLDKTTSSGHPHHVRKNEFWNGE TFTGKLADQASKANLMFEEGKHMTPVYTAALKDELVKTEKIYRKIKKRLLWGSDLSTM IRCARSFGGLMDEMKAHCISLPVRVGMNVNEDGPIIFEKHSRYKYHYDADYSRWDSTQ QRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPSVVDVGDFKITINEGLPSGVPCTSQW NSIAHWLLTLCALSEVTKLSPDIIQANSMFSFYGDDEIVSTDIKLDPEQLTAKLKEYG LKPTRPDKTEGPLIISEDLNGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHE DPNETMIPHSQRPIQLMALLGEASLHGPSFYSRISKLVITELKEGGMDFYVPRQEPMF RWMRFSDLSTWEGDRNLAPNFVNEDGVE" mat_peptide <1..989 /gene="ORF1" /product="p48" mat_peptide 990..2087 /gene="ORF1" /product="NTPase" mat_peptide 2088..2618 /gene="ORF1" /product="p22" mat_peptide 2619..3017 /gene="ORF1" /product="VPg" mat_peptide 3018..3560 /gene="ORF1" /product="Pro" mat_peptide 3561..5090 /gene="ORF1" /product="RdRp" gene 5074..6696 /gene="ORF2" CDS 5074..6696 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QIQ09392.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVV" gene 6696..7502 /gene="ORF3" CDS 6696..7502 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QIQ09393.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGPSNKSS NSSTATSVYSNQTISTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV" ORIGIN 1 tggcgtctaa cgacgctacc gttgccgttg cttgcaacaa caacaacgac aaggaaaaat 61 cttcaggtga aggcttattc acaaatatgt ctttcacctt aaagaaagcc ctcggggcta 121 ggcccaaaca gcctgccccg agagacgaac cacaaaagcc cccaagacca ccaacccccg 181 agttggtcaa gaggataccc cctcctccac ctaatggcga aggagaagaa gaaccagtca 241 ttaggtatga ggttaagagt gggatctctg gcctgcccga gctcacaaca gtcccccaac 301 cggacgtggc caacacagca ttcagtgttc caccactgag cttgagagaa aacagggagg 361 ccaaggaacc gctaacaggg gcaatattag agatgtggga tggagagata taccactatg 421 gcctgtacgt ggagaaaggc ttagtgttgg gtgtgcacaa accacctgca gccataagca 481 tggcaagagt ggagctgacg ccgctgtcat tgtactggcg tgtggtgtac actccccaat 541 acctcatctc ccctgaaact ctcaggaggc tcaacggaga ggcgttccct tacaccgcct 601 tcgacaacaa ctgctacgcc ttttgctgct gggtgttaga cctcaatgac tcatggctta 661 gcaggaggat ggtgcaaaga acaacgggct tcttcagacc ttaccaagag tggaacagaa 721 agcccctgcc taccatggat gactccaaaa ttaagaaggt agcaaatata ttcctatgtt 781 cattgtccac attattcacc agacccataa aagacctcat agggaaaatt aaaccattaa 841 acatattgaa catcctggca acgtgtgact ggacgtttgc cggaatagtg gagtctctga 901 tattacttgc tgaactcttc ggagttttct ggacgccccc agatgtgtct gctatgatcg 961 ctcccttact cggggactac gagttgcaag ggccagaaga cctcgccgtt gaactcgtac 1021 ctgtggtaat gggagggatt ggtttggtgt tgggattcac caaagagaaa attggcaaaa 1081 tgttgtcctc agcagcatca acactcaggg cttgcaaaga tcttggtgcc tatggcttag 1141 agatactcaa gttggtcatg aagtggttct tcccaaagaa agaggaggcc aatgagctag 1201 ccatggtgag ggccatagag gatgccgtgc tagatcttga ggcaatagaa aataaccaca 1261 tgacaaccct gttgaaagac aaagacagct tagcaacata catgaaaaca ctggacatgg 1321 aggaggagaa agccagaagg ttgtccacaa aatctgcatc ccctgacata gttgggacaa 1381 tcaacgccct gctggctcga atagcagcgg ccaggtcatt agtccacagg gccaaggaag 1441 agctatctag caggataagg ccagtagttg ttatgatatc tggcaaacca ggaataggca 1501 aaactcatct ggccagggag gtggcaagaa aggtggcatc cactctcaca ggggaccaaa 1561 gagtcggact cataccaaga aacggtgtgg accattggga tgcatacaaa ggtgagagag 1621 tcgtgctgtg ggacgactat ggcatgagta accccatcca tgatgctctt cgcatacaag 1681 aattggctga tacgtgtccc cttaccttaa attgtgacag aattgaaaat aagggaaaag 1741 tttttgacag tgaagtcata ataattacaa caaaccttgc caatccagcc ccacttgatt 1801 atgtcaactt tgaggcctgt tccaggagaa ttgatttcct ggtgtacgct gaggcaccag 1861 aagtagaaaa ggcaaaacgg gactttcctg gtcagccaga tatgtggaag gacgccttca 1921 agccggactt ttcacacatc aagctacagc ttgcacctca gggcggcttt gacaagaatg 1981 gcaacacccc acatgggaaa ggagtgatga agaccctcac taccggttct ctgattgccc 2041 gtgcatcagg cctactacat gagaggatgg atgaatttga actccaaggt cccacaatca 2101 ccaccttcaa tttcgaccga aacagaatca cagcattcag acaattggct gcagaaaaca 2161 agtatggatt ggtggatacc atgaaagttg gcaatcaatt aaaaggagtg aaaaccatgg 2221 aagaactcaa acaagcaatc agaaatgtga ccatcaagag gtgccggatc atctacggtg 2281 gctccacgta tgaccttgaa tctgatggca agggcaaagt tttggtggaa aaggtcaaga 2341 acacctctgt acagaccaac aacgagttgg ccggggccct gcaccatctc aaacacgccc 2401 gaatcaggta ctatgtcaaa tgtgtgcaag aagcagtcta ttccatcata caaattgccg 2461 gcgctgcgtt tgtcaccacg cgcattgcac gccgcatgaa catacaagaa ctctggtcga 2521 agccacaatt agatcaaaat gaatcagaga ctaaggaaga ggcccccaaa tcagaagatg 2581 acgagttcat catatcttct aaggacatca aggaggaagg aaagaagggc aaaaacaaaa 2641 ctggccgtgg caagaaacac actgcattct ccagcaaggg cttgagcgat gaggagtatg 2701 acgagtacaa gaggataaga gaagagagaa atgggaagta ctctatagag gagtatcttc 2761 aagacagaga caggtactat gaggagctcg ccattgccaa ggccacggaa gaagacttct 2821 gtgaagagga ggagataaaa atccgtcaga gaattttccg tcccaccagg aaacaaagaa 2881 aggaagagag ggccacatta ggactggtaa caggttcaga aatcagaaaa agaaaccctg 2941 atgacttcaa acccaaaggg aagctgtggg ccgatgacaa cagaagtgtt gactataatg 3001 agaaactgga ctttgaggcc cccccaagca tatggtctag gattgtgagc tttggttctg 3061 gctggggctt ctgggtatca ccaagcctgt tcataacatc aactcatgta atccccgcag 3121 gcataacaga agcatttgga gtccccatca aacaaattca gatccacaaa tcaggtgaat 3181 tttgccgatt cagattccca agaccaatta gaccagacgt gacaggaatg atcttggaag 3241 aaggtgcgcc tgaaggcacc gtggcaactg tgctcatcaa acgccccacc ggagagctca 3301 tgcctcttgc agccagaatg ggaacacacg caaccatgaa aattcaaggc cgcatggttg 3361 gcggacagat gggtatgttg ctcactggat caaatgctaa aggaatggat ttgggaacaa 3421 ctcctggtga ctgtggctgt ccttacatct acaaaagggg caatgactat atagtcattg 3481 gggtgcacac tgcagcagcc cgtggtggaa acaccgtcat ctgtgccaca cagggaagtg 3541 agggtgaggc aactcttgag ggtggatatg acaaaggaac atactgtggg gcacccattc 3601 taggccctgg gggtgcacca aagttgagca ccaaaaccaa attttggagg tcatcgaaca 3661 cgcccctccc accagggaca tatgagcctg cctacctcgg tggccgtgat ccgcgtgtta 3721 agggtgggcc ctccttgcag caggtaatga gagaccagtt gaagccattc actgaaccca 3781 ggggcaaacc tccaagacca agtgtattgg aagcagccaa acaaaccgtt atcaatgtcc 3841 tcgaacaaac cctggatcct ccacaaaaat ggacatacgc acaggcgtgt gcctcacttg 3901 acaaaaccac ttccagcggg catcctcatc acgtccgaaa gaatgaattc tggaatggtg 3961 agaccttcac cggcaaattg gcagaccaag catcaaaagc aaacctaatg tttgaggaag 4021 ggaaacacat gacaccagtg tatacagcag cactcaagga cgagctagtc aagactgaga 4081 aaatctatag aaagatcaag aagagactgc tctggggctc tgacttgtcc accatgatcc 4141 ggtgcgctag gtcatttggt gggctcatgg acgagatgaa ggcacactgc atatcactcc 4201 cagtacgagt tggcatgaat gtgaatgaag atggcccaat aatatttgag aaacattcca 4261 gatacaaata ccactatgac gcagactact ctcgttggga ttcaacacaa cagagggcag 4321 tactagcagc agccttggaa atcatggtca gattctctgc agaaccacaa ttggcacaaa 4381 tagtcgctga ggatcttctg gcccctagcg tagtagatgt aggagacttt aaaatcacta 4441 taaatgaagg gctcccatct ggtgtgccat gcacctccca atggaactcc atcgcacact 4501 ggctgctaac tctctgtgcc ttgtctgaag tcaccaaact gtcccctgac attatacaag 4561 caaattccat gttctcattt tacggtgatg acgagattgt cagcaccgac ataaaattgg 4621 accctgaaca gttaaccgcc aagttgaagg agtatggcct gaaaccaacc cgcccagaca 4681 agaccgaggg acccctgatc atcagtgaag atttgaacgg actcactttc ctccgaagga 4741 cggtgactcg tgacccagct ggctggtttg gaaaactgga ccaaagttca attttgaggc 4801 agatgtactg gactagagga ccaaatcacg aagatcccaa tgagacaatg ataccccatt 4861 ctcaaagacc catacagctc atggcactgc ttggtgaagc ctctcttcac ggaccctctt 4921 tctacagtag aatcagtaaa ttggtcataa ctgaactcaa agaaggtggg atggactttt 4981 acgtgccaag gcaggaaccc atgttcaggt ggatgaggtt ttctgacttg agcacgtggg 5041 agggcgatcg caatctggct cccaattttg tgaatgaaga tggcgtcgag tgacgccaac 5101 ccatctgatg ggtccgcagc caacctcgta ccagaggtca acaatgaggt tatggctttg 5161 gagcccgttg tcggtgccgc tattgcggca cctgtagcgg gccaacaaaa tgtaattgac 5221 ccctggatta gaaataattt tgtacaagcc cctggtgggg agtttacagt atcccccaga 5281 aacgctccag gtgaaatact atggagcgcg cccctaggcc ctgacctaaa tccctaccta 5341 tcccatttgg ccagaatgta caatggttat gcaggtggtt ttgaagtgca ggtaattctc 5401 gcggggaacg cgttcaccgc cgggaagatc atatttgcag cagtcccacc aaattttcca 5461 actgaaggct taagtcctag ccaggtcact atgttccccc atataatagt agatgttaga 5521 cagttagaac ctgtgctgat tcctttaccc gatgttagga ataatttcta tcattacaat 5581 cagtcaaatg actccactat taagttgata gcaatgttgt acacaccact tagggctaat 5641 aatgctgggg atgatgtttt cacagtttcg tgccgagttc tcacgagacc atcccccgat 5701 tttgatttca tatttttagt gccacccaca gttgagtcaa gaactaaacc attctctgtc 5761 ccagttttaa ctgttgagga gatgaccaat tcaagattcc ccatcccttt ggaaaagctg 5821 ttcacaggcc ccagcagtgc ctttgttgtt caaccacaaa acggcaggtg cacaactgat 5881 ggcgtgctcc taggcaccac ccaactttct cctgtcaaca tctgcacctt cagaggggat 5941 gtcacccaca tcacaggtag tcgcaactac acaatgaatt tggcttctca aaattggaac 6001 aactatgacc caacagaaga aatcccagcc cctctaggaa ctccagattt tgtggggaag 6061 attcaaggca tgctcaccca aaccacaagg acagatggtt caacacgcgg ccacaaagct 6121 acagtgtaca ctgggagcgc cgactttgct ccaaaactgg gtagagttca atttgaaact 6181 gacacagacc atgattttga agctaatcaa aacacaaagt tcaccccagt cggtgtcatc 6241 caagatggta gcaccaccca tcgaaacgaa ccccaacagt gggtgctccc aagttactca 6301 ggcagaaata ctcacaatgt acatctggcc cccgctgtag cccccacctt tccgggtgag 6361 caacttctct tcttcagatc caccatgccc ggatgcagcg ggtaccccaa catggatttg 6421 gactgtctgc tcccccagga atgggtgcag tacttctatc aagaggcagc cccagcacaa 6481 tctgatgtgg ctctgctaag atttgtgaat ccagacacag gtagggtttt gtttgagtgc 6541 aagcttcaca aatcaggcta tgttacagtg gctcacactg gccaacatga tttggttatc 6601 ccccccaatg gctactttag atttgattcc tgggtcaacc agttctatac gcttgccccc 6661 atgggaaatg gaacggggcg tagacgtgta gtataatggc tggagctttc tttgctggat 6721 tggcatctga tgtccttggc tctggacttg gttccctcat caatgctggg gctggggcca 6781 tcaaccaaaa agttgagttt gaaaataaca gaaaattgca acaagcatcc ttccaattta 6841 gcagcaatct gcaacaggct tcctttcaac atgataaaga gatgctccaa gcacaaattg 6901 aggccactaa aaagctacaa caggaaatga tgaaagttaa gcaggcaatg ctcctagagg 6961 gtgggttctc tgagacagat gcagcccgcg gggcaattaa cgcccccatg acaaaagctt 7021 tggactggag tgggacaagg tactgggctc ccgatgctag gactacaaca tacaatgcag 7081 gccgcttttc cacccctcaa ccatcggggg cactgccagg aagagctaat cttagggatg 7141 ctgtccctgc tcggggaccc tccaacaaat cttctaactc ttctactgcc acctctgtgt 7201 attcaaatca aactatttca acgagacttg gttctacagc tggttctgga accagtgtct 7261 cgagcctccc gtcaactgca aggactagga gctgggttga ggatcaaaat aggaatttgt 7321 cacctttcat gaggggggcc cacaacatat catttgtcac cccaccatct agcagatcct 7381 ctagccaagg cacagtctca accgtgccta aagagatttt ggactcctgg actggcgctt 7441 tcaacacgcg caggcagcct ctcttcgctc acattcgtaa gcgaggggag tcacgggtgt 7501 aatgtgaaaa gacaaaattg attatttttc tttttcttta gtgtctttta aaaaaa //