Typing tool
|
Complete norovirus genomes
MW305541 | GII.3 | ||
---|---|---|---|
GII.P21 |
ORF1: 1..5069 ORF2: 5050..6696 ORF3: 6696..7460LOCUS MW305541 7479 bp RNA linear VRL 06-JUL-2021 DEFINITION Norovirus GII isolate PNV016867 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MW305541 VERSION MW305541.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7479) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Genome-wide analyses of human noroviruses reveal coexistence of viral populations evolving under recombination constraints JOURNAL Unpublished REFERENCE 2 (bases 1 to 7479) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Direct Submission JOURNAL Submitted (23-NOV-2020) CBER/OVRR/DVP/LHV, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA COMMENT ##Assembly-Data-START## Assembly Method :: High-performance Integrated Virtual Environment (HIVE) v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7479 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="PNV016867" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="Peru" /collection_date="14-May-2009" /note="genotype: GII.3[GII.P21]" gene <1..5069 /gene="ORF1" CDS <1..5069 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QPJ58959.1" /translation="AAANSNNDNAKSSSDGVLSNMAVTFKRALGARSKQPPPRDKPPK PPRPPTPELVKAIPPPPPNGEDEPIISYNVKEGVSGLPELSTVTQLEESSTAFSVPPL SQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLAKVELTPLSL YWRPVYTPQYLISPDTLRKLHGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTT GFFRPYQDWNRKPLPTMDDSKVKKVANVVLCALSSLFTRPIKDIIGKLKPLNILNILA TCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVVMG GIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEKNELAMV RSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDMEEEKARKLSTKSASPDIVGTI NALLSRIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIASSLTGD QRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIEN KGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPEVEKAKRDFPGQPDM WKDTFKPDFTHIKLTLAPQGGFDKNGNTPHGKGVMKTLTASSLVARASGLLHERLDEY ELQGPTPTTFNFDRNKVLAFRQLAAENKYGLVDTMRVGSQLKNVKTMTELKQALKNIS VKKCQLVYGGGTYTLESDGKGNVRVEKVNNTSVQTNNELSGALHHLRCARIRYYVKCV QEALYSILQIAGAAFITTRIAKRMNIQNLWSKPQAEDLEETNNEEGCPKPKNDEEFVI SSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDR DKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERVSLGLVTGSEIRKRNPD DFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIP QGAQELFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRPT GELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGN DYVVIGVHTAAARGGNTVICATQGNEGEAMLEGGDNKGTYCGAPILGPGNAPKLSTKT KFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPKPSVLE AAKKTIINVLEQTIDPPQKWTYAQACASLDKTTSSGHPYHMRKNDCWNGETFTGKLAD QASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGTIKKRLLWGSDLSTMIRCARAFG GLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWDSTQQRAVLATA LEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFKISINEGLPSGVPCTSQWNSIAHWLL TLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTTKLKEYGLKPTRPDK TEGPLIISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHEDPFETMIP HSQRPIQLMSLLGEAALHGPSFYSRISKLVISELKEGGMDFYVPRQEPMFRWMRFSDL STWEGDRNLAPSFVNEDGVE" mat_peptide <1..959 /gene="ORF1" /product="p48" mat_peptide 960..2057 /gene="ORF1" /product="NTPase" mat_peptide 2058..2594 /gene="ORF1" /product="p22" mat_peptide 2595..2993 /gene="ORF1" /product="VPg" mat_peptide 2994..3536 /gene="ORF1" /product="Pro" mat_peptide 3537..5066 /gene="ORF1" /product="RdRp" gene 5050..6696 /gene="ORF2" CDS 5050..6696 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QPJ58960.1" /translation="MKMASNDAAPSNDGAAGLVPEINNEAMALEPVAGAAIAAPLTGQ QNIIDPWIMNNFVQAPGGEFTVSPRNSPGEVLLNLELGPEINPYLAHLARMYNGYAGG FEVQVVLAGNAFTAGKIIFAAIPPNFPIDNLSAAQITMCPHVIVDVRQLEPVNLPMPD VRNNFFHYNQGSDSRLRLIAMLYTPLRANNSGDDVFTVSCRVLTRPSPDFSFNFLVPP TVESKTKPFSLPILTISEMSNSRFPVPIDSLHTSPTENIVVQCQNGRVTLDGELMGTT QLLPSQICAFRGVLTRSTSRASDQADTATPRLFNYYWHIQLDNLNGTPYDPAEDIPGP LGTPDFRGKVFGVASQRNPDTTTRAHEAKIDTTSGRFTPKLGSLEISTESDDFDQNKP TRFTPVGIGVDHEADFQQWTLPDYAGQFTHNMNLAPAVAPNFPGEQLLFFRSQLPSSG GRSNGILDCLVPQEWVQHFYQESAPSQSQVALVRYINPDTGRVLFEAKLHKLGFITIA KNGDSPITVPPNGYFRFESWVNPFYTLAPMGTGNGRRRIQ" gene 6696..7460 /gene="ORF3" CDS 6696..7460 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QPJ58961.1" /translation="MAGAFIAGLAGDMFTNTVGSLVNAGANAINQTIDFENNKYLQNA SFNHDKEMLNAQIEATKRLQADMIAIKQGVLTAGGFSPTDAARGAINAPMTKVLDWNG TRYWAPGATSTTSMSGGFTNQTVHRSTPNFKTNQAPKPTPSSGSSVRSNSTQITSLSS HSFGSSRSSGSTVVSSIPSSNRTRDWVNQQNFNLEPHMPGSLRTAFVTPPSSTASSSG TVSTVSKNVLDSWTSAFNTRRQPLFAHLRRRGESNV" ORIGIN 1 ccgctgctgc taacagcaac aacgacaacg caaaatcttc aagtgacgga gtattatcta 61 atatggctgt cacttttaaa cgagccctcg gggcgcggtc taaacagccg cccccgaggg 121 acaaaccacc aaaaccccca agaccaccca caccagaact ggttaaagca atcccccctc 181 ccccacccaa cggggaggat gaaccaatca tttcttacaa cgtcaaagag ggtgtttctg 241 gtctgcctga actctcaact gtcactcaac tggaagagag ttctacagca ttcagcgttc 301 cccctcttag tcagagggag aacagagatg caaaggaacc attgactgga accatcctgg 361 agatgtggga cggggagatt taccattatg gcttatacgt ggaacgagga ttagtactag 421 gtgtacacaa accaccagca gccatcagcc ttgccaaggt tgagctgacc cccttgtctt 481 tgtattggag accagtgtac accccacagt acctcatttc ccctgacact ctcaggaagt 541 tgcatgggga gacgttccct tatacagcct ttgacaacaa ctgttatgcc ttctgctgtt 601 gggtgcttga cttgaacgat tcatggctga gcagaaggat gatacagagg actacaggct 661 ttttccgacc ctatcaggac tggaacagga aacccctccc cacgatggat gactccaaag 721 tgaaaaaggt ggccaatgtt gttctctgtg ctctctcatc attgtttaca agaccaatta 781 aggacattat tgggaagttg aaacccctaa acatcctcaa catcctagcc acatgtgatt 841 ggacttttgc aggcatagtg gaatccctga tccttttggc tgaactgttt ggagttttct 901 ggacaccccc agatgtgtct gcaatgatcg ctcccttact gggtgattat gagctgcagg 961 gacccgagga cctcgctgtg gaactcgtac ccgtagtaat gggagggata ggtttggtgc 1021 tgggattcac caaagagaaa atcggcaaaa tgctttcatc tgctgcttca accctgaggg 1081 catgtaaaga tctcggtgca tatgggttgg agatcctcaa gttggtcatg aaatggttct 1141 tcccaaagaa agaagagaaa aacgaattgg caatggtgag atccatcgag gatgcagtgc 1201 tagatcttga ggccattgag aacaaccata tgacagctct actcaaggat aaagacagcc 1261 ttgcaaccta catgaggact cttgacatgg aggaggagaa ggcgaggaaa ctttccacca 1321 agtctgcctc gccggatatt gtgggcacga taaacgccct actgtcaagg attgcagctg 1381 cccggtctct agtgcacagg gctaaggagg agctgtcaag caggccccga ccggttgttg 1441 tgatgatttc agggagacca ggtataggga aaacccatct agctagagag ttggcaaaga 1501 agatcgcctc ctcacttaca ggtgaccaga gggtgggtct tatcccacgc aacggggtcg 1561 accactggga tgcatacaaa ggtgaaagag tcgttctctg ggacgactac gggatgagca 1621 accctatcca tgacgccctc agactccagg agcttgctga cacctgtcct ctcacactca 1681 actgtgacag gattgagaac aaaggcaaag tctttgatag tgatgccata ataataacca 1741 ccaacctggc caacccagcg ccactggatt acgtcaactt tgaagcttgc tcaagacgca 1801 tagacttcct cgtctatgct gatgcccctg aggttgagaa ggctaaacgg gacttcccag 1861 gccaaccaga catgtggaaa gacaccttta aacctgattt cacacacata aaactgacat 1921 tggccccgca aggaggtttt gacaagaatg gtaacactcc tcatgggaag ggtgtcatga 1981 aaaccctcac tgccagttcc ctcgttgccc gagcatcagg gctcctccac gagagattag 2041 acgaatatga gctgcagggc ccaactccca caacattcaa ctttgaccgc aacaaggtgc 2101 ttgcttttag gcaacttgcc gctgaaaaca agtacggtct tgttgacaca atgagggtcg 2161 gatcacagct caaaaatgtc aaaaccatga cagaactcaa acaggctctc aagaatatct 2221 cagtcaagaa atgtcagctt gtgtatggtg ggggcacgta cacacttgaa tctgatggta 2281 aaggtaatgt gcgtgtcgag aaggtgaaca acaccagtgt gcaaactaac aacgagctct 2341 ccggggcttt gcaccatctc aggtgtgcca gaatcaggta ttatgttaag tgtgtccagg 2401 aagctctcta ctccatcttg caaattgccg gggctgcatt tattaccacg cgcattgcaa 2461 agcgcatgaa catacaaaac ctctggtcca aaccacaagc agaagatctg gaggagacta 2521 acaacgagga gggttgtcca aaacctaaaa atgatgagga atttgtcatc tcctctgatg 2581 acatcaaaac cgagggcaaa aaaggaaaga acaaaactgg ccgtggcaag aagcacacag 2641 ccttttccag caaaggactc agtgatgagg agtacgatga gtacaaaaga atcagggagg 2701 agagaaatgg caagtactcc atagaggaat acctacagga cagagacaaa tactatgagg 2761 aggtggccat agccagggca actgaggaag acttctgtga agaagaggag gcaaaaatcc 2821 ggcaaaggat atttagacca acaaggaaac aacgcaaaga ggaaagagtc tctcttggtt 2881 tggttacagg ttctgagatc aggaagagaa acccagatga cttcaaaccc aaagggaaac 2941 tatgggccga tgatgacagg agtgttgact acaatgagaa actcagtttc gaggctccac 3001 caagcatttg gtcacgaata gtcaactttg gctcagggtg gggcttttgg gtctcgccca 3061 gcctcttcat aacatcaacc catgtcattc cccaaggcgc acaggagctc tttggagtgc 3121 ccatcaaaca aatacaagtt cacaagtcag gtgagttctg ccggcttagg ttcccaaaac 3181 caatcaggac agacgtcaca ggcatgatct tggaggaagg tgctccagaa ggcactgttg 3241 ccacacttct catcaagaga ccaactgggg agctcatgcc cttggcagcc agaatgggca 3301 cccacgcaac catgaaaatt cagggtcgca ctgttggtgg acagatgggc atgctactca 3361 cagggtctaa cgctaagagc atggacttgg gcacaactcc tggcgattgt ggttgtccct 3421 atatctacaa gagagggaac gactacgtgg ttattggagt tcacactgcc gccgctcgtg 3481 gaggaaacac cgtcatctgt gcaacccaag gaaacgaggg tgaggccatg ctagaaggtg 3541 gtgacaacaa gggaacttat tgtggagcac caatattagg ccctggaaat gcccccaaac 3601 tcagcaccaa aaccaaattc tggaggtctt ccaccacccc cctgccaccc ggaacctatg 3661 agccagctta tctgggtggc aaggacccta gagtgaaagg tggcccctca ctgcaacagg 3721 ttatgaggga ccaactaaaa ccattcactg agcccagagg caaaccaccc aaaccaagtg 3781 tgctagaagc tgccaagaag actataatca atgtgcttga acaaacaata gacccacctc 3841 aaaaatggac atacgcacag gcgtgtgcat cactagacaa gaccacttcc agcggccacc 3901 cttaccacat gcggaagaac gattgctgga atggggagac tttcacagga aaactggcag 3961 accaagcatc aaaggctaac ctaatgtttg aggaaggaaa gaacatgacc ccagtgtaca 4021 caggagctct gaaagatgag ctagtcaaga ctgataagat ctatgggaca atcaagaaga 4081 gactcctgtg gggttcagac ctatcaacca tgatacggtg tgcacgagcc ttcggtgggc 4141 taatggacga actcaaggcc cattgcgtca cactaccagt tagggttggt atgaacatga 4201 atgaggatgg acccataata tttgagaaac attccagata caaataccat tatgatgcag 4261 attattcccg ctgggactca acacaacaaa gagcagtgct agctacagcc ctggaaataa 4321 tggtcaaatt ctcaccagaa ccccatctgg cccaggtggt tgcagaagac ctcttgtccc 4381 ccagtgtgat ggatgtgggt gatttcaaga tatcaatcaa cgagggatta ccctctggtg 4441 ttccctgcac ctcacaatgg aactccattg cccactggct cctcacacta tgtgcactgt 4501 ctgaagtcac agacctgtcc cctgacatca tccaggcaaa ttccctgttc tccttttatg 4561 gtgatgatga aatagtgagc acagacatca aactggaccc agagaaatta acaacaaaat 4621 tgaaggagta cgggctaaaa ccaacccgtc ctgacaaaac agaaggaccc ttaattatct 4681 ctgaagattt ggatggcctg accttcttac ggagaacggt gacccgtgat ccggccgggt 4741 ggtttggcaa actggaccaa agttcaatac tcagacagat gtactggacc aggggaccaa 4801 accatgagga ccccttcgaa acaatgatac cacactccca aagacccata caactgatgt 4861 cattattggg tgaagccgcg ttgcatggtc catcattcta cagtaggatc agcaaattgg 4921 tcatctcaga attgaaagag ggtggaatgg atttttacgt gcccagacaa gaaccaatgt 4981 tcaggtggat gagattctca gatttgagca cgtgggaggg cgatcgcaat ctggctccca 5041 gttttgtgaa tgaagatggc gtcgaatgac gccgctccat ctaatgatgg tgccgccggc 5101 ctcgtcccag agatcaacaa tgaggcaatg gcgctagagc cagtggcggg tgcagcgata 5161 gcagcacccc tcactggtca gcaaaatata attgatccct ggattatgaa taattttgtg 5221 caagcacctg gtggtgagtt tacagtatcc cctagaaatt cccctggtga agttcttctt 5281 aatttggaat tgggcccaga aataaacccc tatttggccc atcttgctag aatgtataat 5341 ggttatgcag gtggatttga agtgcaggtg gtcctagctg gaaatgcgtt tacagcagga 5401 aagataatct ttgcagcaat tccccccaat tttccaattg acaatctaag tgcagcacag 5461 atcacaatgt gtccacatgt gattgtggat gtcagacagc tggaaccagt caacctcccg 5521 atgcctgacg ttcgcaacaa tttctttcat tacaatcaag ggtctgattc gagattgcgc 5581 ctaattgcaa tgctatatac acctcttagg gcaaataatt ctggggatga tgtttttact 5641 gtgtcttgca gagtgctaac tagacctagc cctgacttct catttaattt ccttgtgcca 5701 cctactgtgg agtcaaagac aaaacccttt tccctcccta ttctgactat ctctgaaatg 5761 tccaattcta ggttcccagt accaattgat tctctgcaca ccagccctac tgagaatatt 5821 gttgtccagt gtcagaatgg gcgcgtcact cttgatggtg agttgatggg cactacccaa 5881 ctcttaccta gccaaatctg tgctttcagg ggcgtgctca ccagatcaac aagcagggct 5941 agtgaccagg ccgatacagc aacccctaga ttgtttaatt attattggca catacaattg 6001 gataatctaa atggaactcc ttatgaccct gcagaagata taccaggccc cctagggaca 6061 ccagatttcc ggggcaaagt ctttggcgtg gccagccaga gaaatcctga tactacaact 6121 agggcacatg aagcaaagat agacacaaca tctggccgct tcaccccaaa attaggctca 6181 ttagagattt ctactgaatc tgatgatttt gatcaaaaca aaccaacaag attcacccca 6241 gttggcattg gggttgacca tgaggcagac tttcaacaat ggactcttcc cgactatgct 6301 ggccagttca cccacaacat gaacttggcc ccagctgttg ctcccaactt ccctggtgag 6361 caactccttt tcttccgctc acagttgcca tcttctggtg ggcgatccaa cgggattcta 6421 gactgcctgg tcccccaaga atgggtacag cacttctacc aagaatcagc cccctcccaa 6481 tctcaagtgg ccctggttag gtatatcaac cctgacactg gtagagtgtt atttgaggcc 6541 aagctgcaca aattaggttt cataactata gccaagaatg gtgactctcc aataaccgtc 6601 cctccaaatg gatattttag gtttgaatct tgggtgaacc ccttttatac acttgccccc 6661 atgggaacag ggaatgggcg tagaagaatt caataatggc tggagctttt atagcaggat 6721 tggctggtga catgttcaca aatactgtag gatctttagt taatgcaggg gctaatgcca 6781 ttaatcaaac aattgatttt gaaaataata aatatttgca aaatgcttct tttaatcatg 6841 ataaggagat gttgaatgca caaattgagg caacaaagag gttacaggct gacatgattg 6901 ctatcaagca aggggttttg accgctggcg gcttctcccc tactgatgca gcccgcgggg 6961 caattaatgc ccccatgaca aaagtcctag attggaatgg aacgagatac tgggcaccag 7021 gtgccacctc cacaacctcg atgtcgggtg gctttacgaa tcaaactgtg cacagatcta 7081 caccaaattt taaaacgaac caggctccca aacccacacc cagcagtgga tcttcagtga 7141 ggtcaaattc aacccaaatc actagcctga gctcacactc gttcgggtcg tctcgatcca 7201 gcgggtctac agttgtcagc tcaataccat cctctaacag gactagggac tgggtcaacc 7261 aacaaaattt taatttggaa ccacacatgc ctggatccct taggacagct tttgtcactc 7321 caccatctag tacagcctct agctcaggca cagtctcaac tgtgtctaaa aatgttttgg 7381 actcctggac atctgcgttt aacacgcgca gacagccgct atttgcacac cttcgcagaa 7441 ggggggagtc aaatgtttag tgaaaagatt attttaaat //