Typing tool
|
Complete norovirus genomes
MW305600 | GII.2 | ||
---|---|---|---|
GII.P21 |
ORF1: 1..5069 ORF2: 5050..6678 ORF3: 6678..7445LOCUS MW305600 7482 bp RNA linear VRL 06-JUL-2021 DEFINITION Norovirus GII isolate Arg4220 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MW305600 VERSION MW305600.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7482) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Genome-wide analyses of human noroviruses reveal coexistence of viral populations evolving under recombination constraints JOURNAL Unpublished REFERENCE 2 (bases 1 to 7482) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Direct Submission JOURNAL Submitted (23-NOV-2020) CBER/OVRR/DVP/LHV, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA COMMENT ##Assembly-Data-START## Assembly Method :: High-performance Integrated Virtual Environment (HIVE) v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7482 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="Arg4220" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="Argentina" /collection_date="2004" /note="genotype: GII.2[GII.P21]" gene <1..5069 /gene="ORF1" CDS <1..5069 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QPJ59136.1" /translation="TVANSNNDNAKSSSDGVLSNMAVTFKRALGARPKQPPPRDKPPR PPRPPTPELVKRIPPPPPNGEDEPTISYNVKEGVSGLPELSTVTQPEESSTAFSVPPL SLRENRDAKEPLTGAILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLAKVELTPLSL YWRPVYTPQYLISPDTLRKLHGETFPYTAFDNNCYAFCCWVLNLNDSWLSRRMIQRTT GFFRPYQDWNRKPLPTMDDSKVKKVANVLLCALSSLFTRPIKDIIGKLKPLNILNILA TCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVVMG GIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKDETNELAMV RSIEDAVLDLEAIENNHMTALLKDKDSLAAYMRTLDLEEEKARKLSTKSASPDIVGTI NALLSRIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIASSLTGD QRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIEN KGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPEVEKAKRDFPGQPDM WKDTFKPDFSHIKLTLAPQGGFDKNGNTPHGKGVMKTLTASSLVARASGLLHERLDEY ELQGPTPTTFNFDRNKVLAFRQLAAENKYGLVDTMRVGSQLKNVKTMTELKQALKNIS VKKCQLVYGGSTYTLESDGKGNVHVEKVNNTSVQTNNELSGALHHLRCARIRYYVKCV QEALYSILQIAGAAFITTRIAKRMNIQNLWSKPQVEDLEETSSEEGCPKPKNDEEFVI SSDDIKAEGKKGKNKAGRGKKHTAFSSKGLSDEEYEEYKRIREERNGKYSIEEYLQDR DKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPD DFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIP QGAQEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRPT GELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGN DYVVIGVHTAAARGGNTVICATQGSEGEAMLEGGDNKGTYCGAPILGPGNAPKLSTKT KFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPKPSVLE AAKKTIINVLEQTIDPPQKWTYAQACASLDKTTSSGHPHHMRKNECWNGETFTGKLAD QASKANLMYEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLSTMIRCARAFG GLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRAVLAAA LEIMVKFSSEPHLAQVVAEDLLSPSVMDVGDFKISINEGLPSGVPCTSQWNSIAHWLL TLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTTKLREYGLKPTRPDK TEGPLIISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHEDPSETMIP HSQRPIQLMSLLGEAALHGPSFYSKISKLVISELKEGGMDFYVPRQEPMFRWMRFSDL STWEGDRNLAPSFVNEDGVE" mat_peptide <1..959 /gene="ORF1" /product="p48" mat_peptide 960..2057 /gene="ORF1" /product="NTPase" mat_peptide 2058..2594 /gene="ORF1" /product="p22" mat_peptide 2595..2993 /gene="ORF1" /product="VPg" mat_peptide 2994..3536 /gene="ORF1" /product="Pro" mat_peptide 3537..5066 /gene="ORF1" /product="RdRp" gene 5050..6678 /gene="ORF2" CDS 5050..6678 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QPJ59137.1" /translation="MKMASNDAAPSTDGAAGLVPESNNEVMALEPVAGAALAAPVTGQ TNIIDPWIRTNFVQAPNGEFTVSPRNAPGEVLLNLELGPELNPYLAHLARMYNGYAGG MEVQVMLAGNAFTAGKLVFAAVPPHFPVENLSPQQITMFPHVIIDVRTLEPVLLPLPD VRNNFFHYNQKDDPKMRIVAMLYTPLRSNGSGDDVFTVSCRVLTRPSPDFDFTYLVPP TVESKTKPFTLPILTLGEMSNSRFPVPIDQMYTSPNEVISVQCQNGRCTLDGELQGTT QLQVSGICAFKGQVTAHLQDNGHLYNVTITNLNGSPFDPSEDIPAPLGVPDFEGRVFG VISQRDNYNFGGQNQPANRSHDAVVPTYTAQYTPKLGQIQIGTWQTDDITVDQPAKFT PVGLSDTDHFNQWVVPRYAGALNLNTNLAPSVAPVFPGERLLFFRSYIPLKGGFGNPA IDCLVPQEWIQHFYQEAAPSLSEVALVRYINPDTGRALFEAKLHRAGFMTVSSNTSAP VVVPANGYFRFDSWVNQFYSLAPMGTGNGRRRIQ" gene 6678..7445 /gene="ORF3" CDS 6678..7445 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QPJ59138.1" /translation="MAGAFIAGLAGDVLSNGLGSLINAGANVINQRAEFDFNRQLQQN SFNHDKEMLQAQIQATKQLQADMMAIKQGVLTAGGFSPADAARGAVNAPMTQALDWNG TRYWAPSSMRTTSYSGRFTSTTPIRQADFQHTQTRPSSGSSVASYSTQSSRPTSITTA GSSTTSNSTRSTNLPSTVSRATSRTSEWVRDQNRNLEPYMRGALQTAFVTPPSSRASD GTVSTVPKGVLDSWTPAFNTRRQPLFAHLRKRGESQA" ORIGIN 1 ccactgttgc taacagcaac aacgacaacg caaaatcttc aagtgacgga gtattatcta 61 atatggctgt cactttcaaa cgagccctcg gggcgcggcc taaacaaccg cccccgaggg 121 acaaaccacc aagaccccca agaccaccca caccagagtt ggttaagaga atcccccctc 181 ccccacccaa cggggaggac gagccaacca tttcttacaa tgtcaaagag ggtgtttctg 241 gtttgcccga actctcgacg gtcacccaac cggaagagag ctctacggca ttcagcgtcc 301 cccctcttag tctgagggag aacagagatg caaaggaacc cttgactgga gccatcttgg 361 aaatgtggga cggggagatc taccattatg gcttgtacgt ggaacgagga ttggtgctcg 421 gtgtacacaa accaccagca gccattagcc ttgctaaggt tgagttgacc cccttgtctt 481 tgtattggag accagtgtac actccacagt acctcatttc ccctgacact ctcagaaagc 541 tgcatgggga gacgttccct tacacagcct ttgacaacaa ctgttatgcc ttctgctgtt 601 gggttcttaa cttgaacgat tcatggctga gcaggaggat gatacagagg acaacgggct 661 tctttcgacc ctaccaggat tggaacagga aacctctccc cacgatggat gactccaaag 721 tgaagaaggt agccaatgtt ctcctctgtg ctctctcatc attgtttacc agaccaatta 781 aggatattat tggaaagttg aaacccctaa acatcctcaa catcctagcc acatgtgatt 841 ggactttcgc aggcatagta gagtccctga tactcttggc tgaactgttt ggagttttct 901 ggacaccccc agacgtgtct gcaatgatcg ctcccttact gggtgattat gagctacagg 961 ggcccgagga cctcgctgtg gaactcgtac ccgtagtgat gggagggata ggtttggtgc 1021 tgggattcac caaagagaag atcggcaaaa tgctttcatc tgctgcttca accctgagag 1081 catgtaaaga tctcggtgca tatgggctgg agatcctcaa gttggtcatg aaatggtttt 1141 tcccaaagaa agacgagaca aacgaactgg cgatggtgag atccatcgag gatgcagtgc 1201 tggacctcga ggccattgag aacaaccata tgacagcact gctcaaggat aaggatagcc 1261 ttgcagccta catgaggact cttgacttgg aggaggagaa ggcgaggaag ttgtccacca 1321 agtctgcctc accggacatc gtgggcacga tcaacgccct actttcaagg attgcagccg 1381 cccggtctct agtacacagg gccaaggagg aactgtcaag taggccccga ccggtcgttg 1441 tgatgatttc aggaagacca ggtataggga aaacccatct agctagagag ttggcaaaga 1501 agatcgcctc ctcactcaca ggtgaccaga gggtgggcct catcccacgc aatggagtcg 1561 accactggga tgcatacaaa ggtgaaagag tcgttctctg ggacgactat gggatgagca 1621 accctatcca cgatgccctc agactccagg aactcgctga cacctgcccc ctgacactta 1681 actgtgacag gattgagaac aaaggcaaag tctttgacag tgacgctata ataataacca 1741 ccaatttagc caacccagcg ccactggatt atgtcaactt tgaagcttgc tcaaggcgta 1801 tagacttcct cgtctatgct gatgcccctg aagttgagaa ggccaaacga gacttcccag 1861 gccagccaga tatgtggaaa gacaccttca aacctgattt ctcacacata aaactgacac 1921 tagccccaca agggggtttt gacaagaatg gcaacacccc tcatgggaag ggtgtcatga 1981 agaccctcac tgctagttcc ctcgttgccc gagcatcagg gctccttcac gagagactag 2041 acgagtatga gctgcaaggc ccaactccca caacattcaa cttcgaccgt aacaaggtgc 2101 ttgcttttag gcaacttgct gctgagaaca agtacggtct tgttgacaca atgagggtcg 2161 gatcgcagct caagaatgtc aaaaccatga cagaactcaa acaggctctc aagaacatct 2221 cagtcaagaa atgccagctt gtgtatggcg ggagcacgta cacacttgaa tctgatggca 2281 aaggcaatgt gcatgtcgag aaggtaaaca acaccagtgt gcaaactaac aacgagctct 2341 ccggggcttt gcaccatctc aggtgtgcca ggattaggta ttatgtcaag tgtgtccagg 2401 aagctctcta ctccattttg caaattgccg gggctgcgtt cattaccacg cgcattgcaa 2461 agcgcatgaa tatacaaaac ctctggtcca aaccacaggt ggaggatttg gaggagacta 2521 gcagcgagga gggttgccca aaacctaaaa atgatgagga atttgttatt tcctccgatg 2581 acatcaaagc cgagggcaag aaaggaaaga ataaagctgg ccgtggcaag aagcacacag 2641 ccttttccag taaaggtctc agtgacgagg agtacgaaga gtacaagaga atcagagaag 2701 agagaaatgg aaagtactcc atagaggaat acctacagga cagagacaag tactatgagg 2761 aggtggccat agccagggca actgaggaag atttctgtga agaagaggag gccaaaatcc 2821 ggcaaaggat attcagacca acaagaaaac aacgcaaaga ggaaagggcc tctcttggtt 2881 tggtcacagg ttctgagatc aggaagagaa acccagatga cttcaagccc aaagggaaac 2941 tatgggctga tgatgacagg agtgttgact acaatgagaa acttagtttc gaggctccac 3001 caagcatttg gtcacgaata gtcaactttg gttcagggtg gggcttttgg gtctcaccta 3061 gcctcttcat aacatcaacc catgtcattc cccagggcgc acaggagttc tttggagtgc 3121 ccatcaaaca gatacaaatt cataaatcag gtgagttctg tcggcttagg ttcccaaaac 3181 caatcaggac agatgtcacg ggcatgatct tagaggaagg tgctccagaa ggcactgttg 3241 ccacacttct catcaagaga ccaactgggg aactcatgcc cttggcagcc agaatgggca 3301 cccacgcgac catgaaaatc cagggtcgca ctgttggtgg gcagatgggt atgctactca 3361 cagggtctaa cgccaagagc atggatttgg gcacaactcc tggcgattgt ggttgtccct 3421 acatctacaa gaggggtaac gactacgtgg tcattggagt tcacaccgct gccgctcgtg 3481 gaggaaacac tgtcatctgt gcgacccaag gaagcgaggg tgaagccatg ctagagggtg 3541 gcgacaacaa gggaacctac tgtggagcac caatactagg ccctgggaat gcccccaaac 3601 tcagcactaa aaccaaattc tggaggtcat ccaccacccc cctgccaccc ggaacctatg 3661 aaccagctta tctgggtggc aaggacccta gagtgaaagg tggtccctca ctgcaacagg 3721 ttatgaggga tcaactaaaa ccattcactg agcccagagg caaaccaccc aaaccaagtg 3781 tgttagaagc tgccaagaag actataatca atgtgcttga acagacaata gacccacctc 3841 aaaaatggac atacgcacag gcgtgtgcat cactagataa gaccacttcc agcggccacc 3901 ctcaccacat gcggaagaac gagtgctgga acggggagac tttcacaggg aaactggcag 3961 accaagcatc aaaggccaac ctaatgtatg aggaaggaaa gaacatgacc ccagtgtaca 4021 caggagcttt gaaggacgag ctagtcaaga ctgataagat ctatgggaag atcaaaaaga 4081 gactcctatg ggggtcagac ctatcaacca tgatacggtg tgcacgagcc ttcggtggac 4141 tgatggacga gctcaaggcc cattgcgtca cactaccagt cagggttggc atgaatatga 4201 atgaggatgg acccataata tttgagaaac attccagata cagataccat tatgatgcag 4261 attactcccg ctgggactcg acacaacaaa gagcagtact ggctgcagct ctggaaataa 4321 tggtcaaatt ctcatcagaa cctcatctgg cccaagtggt tgcggaagac ctcttgtccc 4381 ccagtgtgat ggatgtgggt gatttcaaga tatcaatcaa tgaggggtta ccctctggcg 4441 taccttgcac ctcacaatgg aactccattg cccattggct cctcacacta tgtgcactgt 4501 ctgaagtcac agatctgtcc cctgacatca tccaggcaaa ctccctgttc tccttttatg 4561 gtgatgatga aatagtgagc acagatatca aattggaccc agaaaaattg acaacaaaat 4621 tgagggaata cgggctaaaa ccaacccgtc ctgataaaac agaaggaccc ttaattatct 4681 ctgaagattt ggatggcctg accttcttac ggagaacggt gacccgcgat ccggctgggt 4741 ggtttggcaa actggaccaa agttcaatac tcagacagat gtactggacc aggggaccaa 4801 accatgaaga cccctctgaa acaatgatac cacactccca aagacccata caactgatgt 4861 cactactggg tgaagctgca ttacatggcc catcattcta cagtaaaatc agcaaactgg 4921 tcatctcaga attgaaagag ggtgggatgg atttttacgt gcccaggcaa gaaccaatgt 4981 ttaggtggat gagattctca gatttgagca cgtgggaggg cgatcgcaat ctggctccca 5041 gttttgtgaa tgaagatggc gtcgaatgac gccgctccat ctactgatgg tgcagccggc 5101 ctcgtgccag aaagtaacaa tgaggtcatg gctcttgagc cagtggctgg cgctgccttg 5161 gcggctccag tcactggcca aacaaatatt atagaccctt ggattagaac aaattttgtc 5221 caggccccta atggtgaatt cactgtttct ccacgcaacg cccctggtga agtgctgtta 5281 aatctagagt tgggtccaga attaaatcct tatctggcac atttagcaag aatgtacaac 5341 gggtatgccg gcggaatgga ggtgcaggtc atgttagctg gaaatgcgtt tacagctgga 5401 aagttggtct tcgccgcagt cccacctcac tttcccgttg aaaatcttag cccacagcaa 5461 atcacaatgt ttccccatgt gattattgat gttagaactt tggaacctgt cctgttacca 5521 ctccctgatg ttagaaataa tttctttcat tataatcaaa aggatgatcc taagatgaga 5581 attgtggcta tgctctatac tcccctcaga tcaaatggtt ctggtgatga tgtattcaca 5641 gtttcttgta gagtgttaac taggccctcc cctgatttcg attttacgta tttggtgcca 5701 ccaacggtgg aatccaaaac aaagccattc acactcccaa tcctcacatt aggggagatg 5761 tccaattcaa gatttccagt gcctatagat caaatgtaca ctagccctaa tgaagttatt 5821 tctgtgcaat gtcagaatgg caggtgcaca ctggacgggg agctccaagg gacaacccag 5881 ctccaggtca gtggtatttg cgcattcaag ggacaagtga ccgcccacct acaggacaat 5941 ggacatcttt acaatgtcac catcacaaat ttgaatgggt ccccttttga cccctctgag 6001 gacatccccg ctcccctggg tgttcccgac tttgaaggga gggttttcgg tgtcatcagc 6061 caaagagata attacaattt tggtggacaa aaccagcctg caaataggtc tcatgatgct 6121 gtggtcccaa cttacacagc tcaatatact ccaaaattgg gtcaaattca aattggcacg 6181 tggcagactg atgacataac agtcgaccag cctgctaaat tcaccccagt tggccttagt 6241 gacacagacc actttaacca atgggtggtc cccaggtatg ctggtgcatt aaatttgaac 6301 acaaatcttg ccccttccgt tgcaccggtg ttcccaggag agcggttgct tttctttaga 6361 tcatacatcc cccttaaagg tggttttggt aacccagcca ttgattgcct ggtgccacag 6421 gagtggatac agcacttcta ccaagaagct gctccctcac tgagtgaggt cgcccttgtc 6481 aggtatatca atccggacac tggtcgggca ctgtttgagg ccaagctcca tagggctggc 6541 ttcatgacag tctcgagtaa caccagtgcc ccggtggttg tacctgccaa tgggtacttc 6601 agatttgatt cttgggtgaa ccaattttat tctcttgccc ccatgggaac tggcaatggg 6661 cgtagaagga ttcaataatg gctggtgctt ttattgctgg tttagcagga gacgtgctca 6721 gcaatgggct cggctcatta atcaatgcag gtgctaatgt aattaatcag agagcagaat 6781 ttgattttaa tcgacagtta caacaaaatt cttttaatca tgataaggag atgttgcagg 6841 ctcaaataca ggcaaccaag cagctgcagg ctgacatgat ggctataaag cagggggtgt 6901 tgactgctgg cggcttctcc cccgctgatg cagccagggg cgctgtaaat gcgcccatga 6961 cacaagcact agattggaat ggcacaaggt actgggcacc aagttccatg agaacaacat 7021 cctattccgg aagattcaca tcaaccactc caataagaca ggctgatttt caacacaccc 7081 aaacccggcc ttcaagtggt tcttccgtgg cctcctactc cacacagtcc tcaagaccaa 7141 cctcaataac aactgcgggg tcttcaacca cgtccaactc gacccgcagt acaaatctcc 7201 cctcgacagt ttctagggcc acctctagaa ctagtgagtg ggttagagac caaaatagga 7261 atttggaacc ctacatgcgt ggtgccttac aaacagcctt tgtcacacca ccttctagca 7321 gggcatctga tgggacggtc tcaaccgttc ccaagggtgt tttggactcc tggacacctg 7381 cgtttaacac ccgcaggcag ccgctctttg cacacctccg taagaggggg gagtcacaag 7441 cctagtgaaa aggaaaattt taaaatgatt tgatttgcct tt //