Typing tool
|
Complete norovirus genomes
MT344180 | GII.4 Sydney | ||
---|---|---|---|
GII.P16 |
ORF1: 1..5094 ORF2: 5075..6697 ORF3: 6697..7503LOCUS MT344180 7583 bp RNA linear VRL 01-MAY-2020 DEFINITION Norovirus GII isolate 782-2 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MT344180 VERSION MT344180.1 DBLINK BioProject: PRJNA604000 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7583) AUTHORS Yang,Z., Williams-Hill,D., Wolfe,J., Silva,A.J., Hirneisen,K., Ruelle,S., Kulka,M. and Hellberg,R. TITLE Direct Submission JOURNAL Submitted (13-APR-2020) Molecular Virology Team/Devision of Molecular Biology, OARSA/CFSAN/FDA, 8301 Muirkirk Rd., Laurel, MD 20708, USA COMMENT ##Assembly-Data-START## Assembly Method :: CLC GWB v. 11 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7583 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="782-2" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="17-Dec-2018" /note="genotype: GII.P16-GII.4" gene <1..5094 /gene="ORF1" CDS <1..5094 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QJA16354.1" /translation="MASNDATVAVACNNNNDKEKSSGEGLFTNMSFTLKKALGARPKQ PAPRDEPQKPPRPPTPELVKRIPPPPPNGEGEEEPVIRYEVKSGISGLPELTTVPQPD VANTAFSVPPLSLRENREAKEPLTGAILEMWDGEIYHYGLYVEKGLVLGVHKPPAAIS MARVELTPLSLYWRVVYTPQYLISPETLRRLNGEAFPYTAFDNNCYAFCCWVLDLNDS WLSRRMVQRTTGFFRPYQEWNRKPLPTMDDSKIKKVANIFLCSLSTLFTRPIKDLIGK IKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED LAVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP KKEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMKTLDMEEEKARRLST KSASPDIVGTINALLARIAAARSLVHRAKEELSSRIRPVVVMISGKPGIGKTHLAREV ARKVASTLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELADTC PLTLNCDRIENKGKVFDSEVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK AKRDFPGQPDMWKDAFKPDFSHIKLQLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA SGLLHERMDEFELQGPTITTFNFDRNRITAFRQLAAENKYGLVDTMKVGNQLKGVKTM EELKQAIRNVTIKRCRIIYGGSTYDLESDGKGKVLVEKVKNTSVQTNNELAGALHHLK HARIRYYVKCVQEAVYSIIQIAGAAFVTTRIARRMNIQELWSKPQLDQNESETKEEAP KSEDDEFIISSKDIKEEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKY SIEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLGLVTG SEIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFEAPPSIWSRIVSFGSGWGFWVSPSL FITSTHVIPAGITEAFGVPIKQIQIHKSGEFCRFRFPRPIRPDVTGMILEEGAPEGTV ATVLIKRPTGELMPLAARMGTHATMKIQGRMVGGQMGMLLTGSNAKGMDLGTTPGDCG CPYIYKRGNDYIVIGVHTAAARGGNTVICATQGSEGEATLEGGYDKGTYCGAPILGPG GAPKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEPRG KPPRPSVLEAAKQTVINVLEQTLDPPQKWTYAQACASLDKTTSSGHPHHVRKNEFWNG ETFTGKLADQASKANLMFEEGKHMTPVYTAALKDELVKTEKIYRKIKKRLLWGSDLST MIRCARSFGGLMDEMKAHCISLPVRVGMNVNEDGPIIFEKHSRYKYHYDADYSRWDST QQRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPSVVDVGDFKITINEGLPSGVPCTSQ WNSIAHWLLTLCALSEVTKLSPDIIQANSMFSFYGDDEIVSTDIKLDPEQLTAKLKEY GLKPTRPDKTEGPLIISEDLNGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNH EDPNETMIPHSQRPIQLMALLGEASLHGPSFYSRISKLVITELKEGGMDFYVPRQEPM FRWMRFSDLSTWEGDRNLAPNFVNEDGVE" mat_peptide <1..990 /gene="ORF1" /product="p48" mat_peptide 991..2088 /gene="ORF1" /product="NTPase" mat_peptide 2089..2619 /gene="ORF1" /product="p22" mat_peptide 2620..3018 /gene="ORF1" /product="VPg" mat_peptide 3019..3561 /gene="ORF1" /product="Pro" mat_peptide 3562..5091 /gene="ORF1" /product="RdRp" gene 5075..6697 /gene="ORF2" CDS 5075..6697 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QJA16355.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVV" gene 6697..7503 /gene="ORF3" CDS 6697..7503 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QJA16356.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGPSNKSS NSSTATSVYSNQTISTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV" ORIGIN 1 atggcgtcta acgacgctac cgttgccgtt gcttgcaaca acaacaacga caaggaaaaa 61 tcttcaggtg aaggcttatt cacaaatatg tctttcacct taaagaaagc cctcggggct 121 aggcccaaac agcctgcccc gagagacgaa ccacaaaagc ccccaagacc accaaccccc 181 gagttggtca agaggatacc ccctcctcca cctaatggcg aaggagaaga agaaccagtc 241 attaggtatg aggttaagag tgggatctct ggcctgcccg agctcacaac agtcccccaa 301 ccggacgtgg ccaacacagc attcagtgtt ccaccactga gcttgagaga aaacagggag 361 gccaaggaac cgctaacagg ggcaatatta gagatgtggg atggagagat ataccactat 421 ggcctgtacg tggagaaagg cttagtgttg ggtgtgcaca aaccacctgc agccataagc 481 atggcaagag tggagctgac gccgctgtca ttgtactggc gtgtggtgta cactccccaa 541 tacctcatct cccctgaaac tctcaggagg ctcaacggag aggcgttccc ttacaccgcc 601 ttcgacaaca actgctacgc cttttgctgc tgggtgttag acctcaatga ctcatggctt 661 agcaggagga tggtgcaaag aacaacgggc ttcttcagac cttaccaaga gtggaacaga 721 aagcccctgc ctaccatgga tgactccaaa attaagaagg tagcaaatat attcctatgt 781 tcattgtcca cattattcac cagacccata aaagacctca tagggaaaat taaaccatta 841 aacatattga acatcctggc aacgtgtgac tggacgtttg ccggaatagt ggagtctctg 901 atattacttg ctgaactctt cggagttttc tggacgcccc cagatgtgtc tgctatgatc 961 gctcccttac tcggggacta cgagttgcaa gggccagaag acctcgccgt tgaactcgta 1021 cctgtggtaa tgggagggat tggtttggtg ttgggattca ccaaagagaa aattggcaaa 1081 atgttgtcct cagcagcatc aacactcagg gcttgcaaag atcttggtgc ctatggctta 1141 gagatactca agttggtcat gaagtggttc ttcccaaaga aagaggaggc caatgagcta 1201 gccatggtga gggccataga ggatgccgtg ctagatcttg aggcaataga aaataaccac 1261 atgacaaccc tgttgaaaga caaagacagc ttagcaacat acatgaaaac actggacatg 1321 gaggaggaga aagccagaag gttgtccaca aaatctgcat cccctgacat agttgggaca 1381 atcaacgccc tgctggctcg aatagcagcg gccaggtcat tagtccacag ggccaaggaa 1441 gagctatcta gcaggataag gccagtagtt gttatgatat ctggcaaacc aggaataggc 1501 aaaactcatc tggccaggga ggtggcaaga aaggtggcat ccactctcac aggggaccaa 1561 agagtcggac tcataccaag aaacggtgtg gaccattggg atgcatacaa aggtgagaga 1621 gtcgtgctgt gggacgacta tggcatgagt aaccccatcc atgatgctct tcgcatacaa 1681 gaattggctg atacgtgtcc ccttacctta aattgtgaca gaattgaaaa taagggaaaa 1741 gtttttgaca gtgaagtcat aataattaca acaaaccttg ccaatccagc cccacttgat 1801 tatgtcaact ttgaggcctg ttccaggaga attgatttcc tggtgtacgc tgaggcacca 1861 gaagtagaaa aggcaaaacg ggactttcct ggtcagccag atatgtggaa ggacgccttc 1921 aagccggact tttcacacat caagctacag cttgcacctc agggcggctt tgacaagaat 1981 ggcaacaccc cacatgggaa aggagtgatg aagaccctca ctaccggttc tctgattgcc 2041 cgtgcatcag gcctactaca tgagaggatg gatgaatttg aactccaagg tcccacaatc 2101 accaccttca atttcgaccg aaacagaatc acagcattca gacaattggc tgcagaaaac 2161 aagtatggat tggtggatac catgaaagtt ggcaatcaat taaaaggagt gaaaaccatg 2221 gaagaactca aacaagcaat cagaaatgtg accatcaaga ggtgccggat catctacggt 2281 ggctccacgt atgaccttga atctgatggc aagggcaaag ttttggtgga aaaggtcaag 2341 aacacctctg tacagaccaa caacgagttg gccggggccc tgcaccatct caaacacgcc 2401 cgaatcaggt actatgtcaa atgtgtgcaa gaagcagtct attccatcat acaaattgcc 2461 ggcgctgcgt ttgtcaccac gcgcattgca cgccgcatga acatacaaga actctggtcg 2521 aagccacaat tagatcaaaa tgaatcagag actaaggaag aggcccccaa atcagaagat 2581 gacgagttca tcatatcttc taaggacatc aaggaggaag gaaagaaggg caaaaacaaa 2641 actggccgtg gcaagaaaca cactgcattc tccagcaagg gcttgagcga tgaggagtat 2701 gacgagtaca agaggataag agaagagaga aatgggaagt actctataga ggagtatctt 2761 caagacagag acaggtacta tgaggagctc gccattgcca aggccacgga agaagacttc 2821 tgtgaagagg aggagataaa aatccgtcag agaattttcc gtcccaccag gaaacaaaga 2881 aaggaagaga gggccacatt aggactggta acaggttcag aaatcagaaa aagaaaccct 2941 gatgacttca aacccaaagg gaagctgtgg gccgatgaca acagaagtgt tgactataat 3001 gagaaactgg actttgaggc ccccccaagc atatggtcta ggattgtgag ctttggttct 3061 ggctggggct tctgggtatc accaagcctg ttcataacat caactcatgt aatccccgca 3121 ggcataacag aagcatttgg agtccccatc aaacaaattc agatccacaa atcaggtgaa 3181 ttttgccgat tcagattccc aagaccaatt agaccagacg tgacaggaat gatcttggaa 3241 gaaggtgcgc ctgaaggcac cgtggcaact gtgctcatca aacgccccac cggagagctc 3301 atgcctcttg cagccagaat gggaacacac gcaaccatga aaattcaagg ccgcatggtt 3361 ggcggacaga tgggtatgtt gctcactgga tcaaatgcta aaggaatgga tttgggaaca 3421 actcctggtg actgtggctg tccttacatc tacaaaaggg gcaatgacta tatagtcatt 3481 ggggtgcaca ctgcagcagc ccgtggtgga aacaccgtca tctgtgccac acagggaagt 3541 gagggtgagg caactcttga gggtggatat gacaaaggaa catactgtgg ggcacccatt 3601 ctaggccctg ggggtgcacc aaagttgagc accaaaacca aattttggag gtcatcgaac 3661 acgcccctcc caccagggac atatgagcct gcctacctcg gtggccgtga tccgcgtgtt 3721 aagggtgggc cctccttgca gcaggtaatg agagaccagt tgaagccatt cactgaaccc 3781 aggggcaaac ctccaagacc aagtgtattg gaagcagcca aacaaaccgt tatcaatgtc 3841 ctcgaacaaa ccctggatcc tccacaaaaa tggacatacg cacaggcgtg tgcctcactt 3901 gacaaaacca cttccagcgg gcatcctcat cacgtccgaa agaatgaatt ctggaatggt 3961 gagaccttca ccggcaaatt ggcagaccaa gcatcaaaag caaacctaat gtttgaggaa 4021 gggaaacaca tgacaccagt gtatacagca gcactcaagg acgagctagt caagactgag 4081 aaaatctata gaaagatcaa gaagagactg ctctggggct ctgacttgtc caccatgatc 4141 cggtgcgcta ggtcatttgg tgggctcatg gacgagatga aggcacactg catatcactc 4201 ccagtacgag ttggcatgaa tgtgaatgaa gatggcccaa taatatttga gaaacattcc 4261 agatacaaat accactatga cgcagactac tctcgttggg attcaacaca acagagggca 4321 gtactagcag cagccttgga aatcatggtc agattctctg cagaaccaca attggcacaa 4381 atagtcgctg aggatcttct ggcccctagc gtagtagatg taggagactt taaaatcact 4441 ataaatgaag ggctcccatc tggtgtgcca tgcacctccc aatggaactc catcgcacac 4501 tggctgctaa ctctctgtgc cttgtctgaa gtcaccaaac tgtcccctga cattatacaa 4561 gcaaattcca tgttctcatt ttacggtgat gacgagattg tcagcaccga cataaaattg 4621 gaccctgaac agttaaccgc caagttgaag gagtatggcc tgaaaccaac ccgcccagac 4681 aagaccgagg gacccctgat catcagtgaa gatttgaacg gactcacttt cctccgaagg 4741 acggtgactc gtgacccagc tggctggttt ggaaaactgg accaaagttc aattttgagg 4801 cagatgtact ggactagagg accaaatcac gaagatccca atgagacaat gataccccat 4861 tctcaaagac ccatacagct catggcactg cttggtgaag cctctcttca cggaccctct 4921 ttctacagta gaatcagtaa attggtcata actgaactca aagaaggtgg gatggacttt 4981 tacgtgccaa ggcaggaacc catgttcagg tggatgaggt tttctgactt gagcacgtgg 5041 gagggcgatc gcaatctggc tcccaatttt gtgaatgaag atggcgtcga gtgacgccaa 5101 cccatctgat gggtccgcag ccaacctcgt accagaggtc aacaatgagg ttatggcttt 5161 ggagcccgtt gtcggtgccg ctattgcggc acctgtagcg ggccaacaaa atgtaattga 5221 cccctggatt agaaataatt ttgtacaagc ccctggtggg gagtttacag tatcccccag 5281 aaacgctcca ggtgaaatac tatggagcgc gcccctaggc cctgacctaa atccctacct 5341 atcccatttg gccagaatgt acaatggtta tgcaggtggt tttgaagtgc aggtaattct 5401 cgcggggaac gcgttcaccg ccgggaagat catatttgca gcagtcccac caaattttcc 5461 aactgaaggc ttaagtccta gccaggtcac tatgttcccc catataatag tagatgttag 5521 acagttagaa cctgtgctga ttcctttacc cgatgttagg aataatttct atcattacaa 5581 tcagtcaaat gactccacta ttaagttgat agcaatgttg tacacaccac ttagggctaa 5641 taatgctggg gatgatgttt tcacagtttc gtgccgagtt ctcacgagac catcccccga 5701 ttttgatttc atatttttag tgccacccac agttgagtca agaactaaac cattctctgt 5761 cccagtttta actgttgagg agatgaccaa ttcaagattc cccatccctt tggaaaagct 5821 gttcacaggc cccagcagtg cctttgttgt tcaaccacaa aacggcaggt gcacaactga 5881 tggcgtgctc ctaggcacca cccaactttc tcctgtcaac atctgcacct tcagagggga 5941 tgtcacccac atcacaggta gtcgcaacta cacaatgaat ttggcttctc aaaattggaa 6001 caactatgac ccaacagaag aaatcccagc ccctctagga actccagatt ttgtggggaa 6061 gattcaaggc atgctcaccc aaaccacaag gacagatggt tcaacacgcg gccacaaagc 6121 tacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc aatttgaaac 6181 tgacacagac catgattttg aagctaatca aaacacaaag ttcaccccag tcggtgtcat 6241 ccaagatggt agcaccaccc atcgaaacga accccaacag tgggtgctcc caagttactc 6301 aggcagaaat actcacaatg tacatctggc ccccgctgta gcccccacct ttccgggtga 6361 gcaacttctc ttcttcagat ccaccatgcc cggatgcagc gggtacccca acatggattt 6421 ggactgtctg ctcccccagg aatgggtgca gtacttctat caagaggcag ccccagcaca 6481 atctgatgtg gctctgctaa gatttgtgaa tccagacaca ggtagggttt tgtttgagtg 6541 caagcttcac aaatcaggct atgttacagt ggctcacact ggccaacatg atttggttat 6601 cccccccaat ggctacttta gatttgattc ctgggtcaac cagttctata cgcttgcccc 6661 catgggaaat ggaacggggc gtagacgtgt agtataatgg ctggagcttt ctttgctgga 6721 ttggcatctg atgtccttgg ctctggactt ggttccctca tcaatgctgg ggctggggcc 6781 atcaaccaaa aagttgagtt tgaaaataac agaaaattgc aacaagcatc cttccaattt 6841 agcagcaatc tgcaacaggc ttcctttcaa catgataaag agatgctcca agcacaaatt 6901 gaggccacta aaaagctaca acaggaaatg atgaaagtta agcaggcaat gctcctagag 6961 ggtgggttct ctgagacaga tgcagcccgc ggggcaatta acgcccccat gacaaaagct 7021 ttggactgga gtgggacaag gtactgggct cccgatgcta ggactacaac atacaatgca 7081 ggccgctttt ccacccctca accatcgggg gcactgccag gaagagctaa tcttagggat 7141 gctgtccctg ctcggggacc ctccaacaaa tcttctaact cttctactgc cacctctgtg 7201 tattcaaatc aaactatttc aacgagactt ggttctacag ctggttctgg aaccagtgtc 7261 tcgagcctcc cgtcaactgc aaggactagg agctgggttg aggatcaaaa taggaatttg 7321 tcacctttca tgaggggggc ccacaacata tcatttgtca ccccaccatc tagcagatcc 7381 tctagccaag gcacagtctc aaccgtgcct aaagagattt tggactcctg gactggcgct 7441 ttcaacacgc gcaggcagcc tctcttcgct cacattcgta agcgagggga gtcacgggtg 7501 taatgtgaaa agacaaaatt gattattttt ctttttcttt agtgtctttt actcacaatg 7561 tacatctggc ccccgctgta gcc //