Typing tool
|
Complete norovirus genomes
MW305575 | GII.3 | ||
---|---|---|---|
GII.P41 |
ORF1: 1..5075 ORF2: 5056..6702 ORF3: 6702..7466LOCUS MW305575 7518 bp RNA linear VRL 06-JUL-2021 DEFINITION Norovirus GII isolate HK54 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MW305575 VERSION MW305575.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7518) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Genome-wide analyses of human noroviruses reveal coexistence of viral populations evolving under recombination constraints JOURNAL Unpublished REFERENCE 2 (bases 1 to 7518) AUTHORS Tohma,K., Lepore,C.J., Martinez,M., Degiuseppe,J.I., Khamrin,P., Saito,M., Mayta,H., Nwaba,A., Ford-Siltz,L.A., Galeano,M.E., Zimic,M., Stupka,J.A., Gilman,R.H., Maneekarn,N., Ushijima,H., Green,K.Y. and Parra,G.I. TITLE Direct Submission JOURNAL Submitted (23-NOV-2020) CBER/OVRR/DVP/LHV, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA COMMENT ##Assembly-Data-START## Assembly Method :: High-performance Integrated Virtual Environment (HIVE) v. 2 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7518 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="HK54" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:122929" /country="Hong Kong" /collection_date="12-Dec-1977" /note="genotype: GII.3[GII.P41]" gene <1..5075 /gene="ORF1" CDS <1..5075 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QPJ59061.1" /translation="AAATSNNDNAKSSSDGVLNSMAVTFKRALGARPKQPPPRETTQK QKPPRPPTPELVKKIPPPPPNGEDELVVSYSVKDGVSGLPELSTVSQPDEANTAFSVP PLNQRENRDAKEPLPGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLAKVELTPL SLYWRPVYTPQYLMSPDTLRKLHGELFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQR TTGFFRPYQDWNRKPLPTMDDSKLKKMANIILCALSSLFTRPIKDIIGKLRPLNILNI LASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVV MGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKDEANELA MVRSIEDAVLDLEAIENNHMTSLLKDKDSLATYMRTLDLEEEKARRLSTKSASPDIVG TINALLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLARELAKKIAATLT GDQRVGLVPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRI ENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPDVEKAKRDFPGQP DMWKDAFKPDFSHIKLMLAPQGGFDKNGNTPHGKGVMKTLTSGSLIARASGLLHERLD EYELQGPTPTTFNFDQNKVFAFRQLAAENKYGLMDTMRVGSQLKGVKTVSELKQALKN IAIKRCQIVYNGSTYSLESDGKGNVKVEKVQSTTVQTNNELSGALHHLRCARIRYYVK CAQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDAEETINKDGCPKPKDDEEF VISSEDIKVEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQ DRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRN PDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHV IPQGSQEFFGVSIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKR PTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKR GNDYVVIGVHTAAARGGNTVICATQGSEGEAMLEGGDNKGTYCGAPILGPGNAPKLST KTKFWRSSTAPLPPGTYEPAYLGGKDPRIKGGPSLQQVMRDQLKPFMEPRGKPPNPSV LEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGYPHHMRKNECWNGESFTGKL ADQASKANLMYEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLATMVRCARA FGGLMDELKAHCVTLPIRVGMNMNEDGPIIFEKHSRYNYHYDADYSRWDSTQQRAVLA AALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFKISITEGLPSGVPCTSQWNSIAHW LLTLCALSEVTNLSPDTIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRP DKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPSETM IPHSQRPIQLMSLLGEAALHGPSFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFS DLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..965 /gene="ORF1" /product="p48" mat_peptide 966..2063 /gene="ORF1" /product="NTPase" mat_peptide 2064..2600 /gene="ORF1" /product="p22" mat_peptide 2601..2999 /gene="ORF1" /product="VPg" mat_peptide 3000..3542 /gene="ORF1" /product="Pro" mat_peptide 3543..5072 /gene="ORF1" /product="RdRp" gene 5056..6702 /gene="ORF2" CDS 5056..6702 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QPJ59062.1" /translation="MKMASNDAAPSNDGAAGLVPEINNEAMALEPVVGAAIAAPLTGQ QNIIDPWIMNNFVQAPGGEFTVSPRNSPGEVLLNLELGPEINPYLAHLARMYNGYAGG FEVQVVLAGNAFTAGKVIFAAIPPNFPIDNLSAAQITMCPHVIVDVRQLEPINLPMPD VRNNFFHYNQGSDSRLRLIAMLYTPLRANNSGDDVFTVSCRVLTRPSPDFSFNFLVPP TVESKIKPFTLPILTISEMSNSRFPVPIDSLHTSPTENIVVQCQNGRVTLDGELMGTT QLLPSQICAFRGTLTRSTSRASDQADTATPRLFNYYWHIQLDNLNGTPYDPAEDIPAP LGTPDFRGKVFGVASQRNPDSTTRAHEAKVDTTSGRFTPKLGSLEISTESGDFDPNQP TRFTPVGIGVDNEADFQQWSLPDYSGQFTHNMHLAPAVAPNFPGEQLLFFRSQLPSSG GRSNGILDCLVPQEWVQHFYQESAPAQTQVALVRYVNPDTGRVLFEAKLHKLGFMTIA KNGDSPITVPPNGYFRFESWVNPFYTLAPMGTGNGRRRIQ" gene 6702..7466 /gene="ORF3" CDS 6702..7466 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QPJ59063.1" /translation="MAGAFIAGLAGDMLTSTVGSLVNAGANAINQKVDFENNKYLQNA SFNHDKEMLNAQIEATKRLQADMIAIKQGVLTAGGFSPTDAARGAINAPITRVLDWSG TRYWAPSATSTTSMSGGFTSQTVHRTTPNFKTNQAPKSTPSSGSSVRSNSTQLTSLSS HSSGSSRSSGSTVVSSLPSSNRTRDWVNQQNFNLEPHMPGSLRTAFVTPPSSTASSSD TVSTVPKSVLDSWTSAFNTRRQPLFAHLRRRGESNV" ORIGIN 1 ccgctgctgc taccagcaac aacgacaacg caaaatcttc aagtgacgga gtactaaata 61 gtatggctgt cacttttaaa cgagccctcg gggcccggcc caaacagccg cccccgaggg 121 aaacaacaca aaaacaaaaa cccccacgac cgcccactcc ggagttggtc aaaaagatcc 181 cgcctccccc gcccaacggg gaggacgagt tagtggtctc ctatagtgtt aaagatggcg 241 tctccggttt gcctgagctc tctaccgtca gtcaaccaga cgaggccaac acagcattta 301 gtgtcccccc attaaaccaa agggagaaca gagatgctaa ggaaccactg cccggcacca 361 tcttggagat gtgggacgga gagatctacc attatggcct gtacgtggaa cggggtctgg 421 tgcttggggt acacaaacca ccagcggcta ttagtcttgc caaggttgag ttgacaccat 481 tgtctctata ttggagacca gtgtataccc cccagtacct tatgtccccg gacactctca 541 gaaagctaca tggagaacta ttcccttata cggcctttga taataactgt tatgccttct 601 gttgttgggt tttagatcta aacgactctt ggcttagcag gagaatgata cagagaacaa 661 ctggcttttt ccggccctac caggactgga acagaaagcc ccttcccacc atggatgatt 721 ccaaattgaa aaagatggcc aatataatac tgtgtgcttt gtcatcgctg tttaccaggc 781 ccattaagga cataattgga aagttgagac ccctaaacat cctcaatata ttggcttcct 841 gcgattggac ttttgcaggt atagtagaat ccctaatcct cttggcagag ctctttggag 901 ttttctggac gcccccagat gtgtctgcga tgatcgcccc cttactaggt gactacgagc 961 tacagggacc tgaagatctt gctgtggaac tcgtaccagt agtaatgggg gggataggtt 1021 tggttctagg tttcaccaaa gaaaagattg gaaaaatgct atcatctgct gcatccactc 1081 ttagggcttg taaagacctt ggagcatacg ggttggaaat cttaaaattg gtcatgaagt 1141 ggttcttccc gaagaaagat gaggcaaacg agctcgcaat ggtgagatcc atcgaggacg 1201 cagtattgga tctcgaagca attgaaaaca accacatgac ctctttgctc aaagacaagg 1261 acagtttggc aacctacatg agaactctag atcttgaaga ggagaaggct agaagactct 1321 ccaccaagtc tgcctctcct gacattgtgg gcacaatcaa tgccctattg gcgcggatcg 1381 cagccgcccg ctccctggtg catcgggcaa aagaagagct ctctagcagg ccaaggcccg 1441 ttgttgtgat gatatcaggc aaaccaggaa taggaaagac tcacctcgct agagagttag 1501 caaagaaaat tgcagccacc ctcacaggag atcagagggt gggccttgtc ccacggaacg 1561 gtgttgacca ctgggacgca tacaaaggtg agagagtcgt cctttgggat gactacggga 1621 tgagcaatcc cattcacgac gccctcagac tgcaagaact tgctgacacg tgccccctaa 1681 cactaaattg tgataggatt gagaacaagg gaaaagtctt tgacagtgac gctataatca 1741 ttacaactaa tctggccaac ccagcaccac tggactatgt caactttgaa gcatgctcaa 1801 ggcgcattga cttcctcgtg tatgctgatg cccctgatgt tgaaaaggcg aagcgcgact 1861 ttccaggaca acctgatatg tggaaggacg ctttcaaacc cgacttctca cacataaaac 1921 taatgctggc ccctcaaggt gggtttgata agaacggcaa caccccacac ggaaagggcg 1981 tcatgaaaac cctcacatct ggttctctca ttgcacgtgc atcagggctc ctccatgaaa 2041 gattggacga atacgagttg caaggcccaa cgcccacaac cttcaatttc gaccagaaca 2101 aggtctttgc tttcaggcaa ctcgctgctg aaaacaaata cgggttgatg gacactatga 2161 gagtgggaag ccagctcaag ggagtcaaaa ctgtgtcaga gcttaagcag gcgctcaaga 2221 acatcgcaat taaaaggtgt cagatagtct acaatggttc cacatactca cttgaatctg 2281 atggcaaagg taatgtgaaa gttgagaagg tacagagtac aactgtgcaa acaaacaatg 2341 agctatctgg tgcgctacac cacctcaggt gcgccaggat cagatattat gttaagtgtg 2401 cccaggaggc cctttattcc atcatccaaa ttgctggggc cgcgtttgtc accacgcgca 2461 ttgcaaaacg catgaacata caaaatctct ggtctaagcc acaggtggag gatgcagaag 2521 aaaccatcaa taaagatggg tgtccaaagc caaaagatga tgaggaattt gtcatctcgt 2581 ctgaagacat caaagtcgag ggcaagaagg gaaagaacaa gtctggccgt ggcaagaaac 2641 acacagcctt ttcaagcaag ggtctcagtg atgaggagta cgatgaatac aaaagaatta 2701 gagaagaaag aaatggtaag tactccatag aagaatacct tcaggacagg gacaaatact 2761 atgaagaggt ggccatagcc agggctactg aagaggactt ctgtgaggaa gaggaagcca 2821 aaatccgaca gaggattttc agaccaacga ggaaacaacg taaagaggag agggcttctc 2881 ttggcctggt cacaggctca gaaatcagga agaggaaccc agacgacttc aaacccaagg 2941 gaaagttgtg ggctgatgac gacaggagtg tcgattacaa tgagaaactc agctttgaag 3001 ctcccccaag catctggtca agaatagtca actttggttc aggttggggt ttctgggttt 3061 caccaagttt gtttataaca tccacccatg ttatacctca gggatcacag gagttcttcg 3121 gggtttccat taaacagatc caaatccaca aatcgggtga gttctgccga ctaagatttc 3181 caaaaccaat cagaactgat gtgacaggca tgatcctgga ggaaggggcc cctgaaggaa 3241 cagtggccac actactcata aagagaccaa ctggggaact catgccactg gcagctagga 3301 tgggcaccca cgcaactatg aagatccagg gtcgcactgt tgggggccaa atgggaatgc 3361 tcttgacagg atccaacgcc aagagcatgg acctgggtac tacaccaggt gattgtgggt 3421 gtccatatat ctacaaaaga ggaaatgact acgtggtcat tggagtccac actgccgccg 3481 ctcgcggggg gaacaccgtc atctgtgcaa cccagggaag cgagggtgag gccatgcttg 3541 aaggtggtga caacaaaggc acctattgcg gtgccccaat cctaggtcca ggaaatgccc 3601 ccaagctcag cactaagacc aaattttgga ggtcctcaac agctccactc ccacctggca 3661 catacgaacc agcctacctc ggaggtaagg accccaggat caaaggtggc ccctcattac 3721 aacaagtcat gagggatcaa ttaaagccat tcatggaacc caggggcaaa ccaccaaacc 3781 caagtgtgct agaagctgcc aaaaagacca tcattaatgt tcttgaacaa acaatagacc 3841 caccacagaa atggtcattt gcacaagcat gcgcatcgct tgacaaaacc acctccagtg 3901 gctacccgca ccacatgcgg aagaatgaat gttggaatgg agagtccttc acaggaaaat 3961 tggcagacca agcttcaaaa gctaacctaa tgtatgagga aggtaaaaac atgaccccgg 4021 tctacacagg tgccctcaag gatgagctag tcaaaactga caaaatatat ggcaagatta 4081 agaagaggct cctctggggg tcagacttgg caaccatggt ccggtgcgct cgagcattcg 4141 gagggctaat ggatgaactc aaggcccact gcgtcacact ccctattagg gttgggatga 4201 acatgaatga ggatggcccc atcatctttg agaagcactc caggtacaac taccattatg 4261 atgcagatta ctctcggtgg gattcaacac aacagagggc tgtgttagct gcagctctag 4321 aaatcatggt aaaattttcc ccagaaccac acctagccca gatagtcgca gaagaccttt 4381 tgtcccccag tgtgatggac gtgggcgatt ttaaaatatc aatcactgaa gggctcccct 4441 ctggggtgcc ttgcacctca caatggaact ccatcgccca ttggctcctc acactctgtg 4501 cactctctga ggtaacaaat ttatcccctg acaccatcca agcaaattct cttttttctt 4561 tctatggtga tgatgaaatt gtgagcacag atattaaatt ggatccagaa aagctgacag 4621 ctaaattgaa agagtatggg ctaaaaccaa ctcgccctga taagactgaa ggacctctgg 4681 tcatctctga ggacttgaat ggtctgacct tcctgcggag aactgtgacc cgcgacccag 4741 ctggttggtt tggaaaattg gaacagagct caatacttag acaaatgtat tggaccaggg 4801 gccccaatca tgaggacccc tccgaaacaa tgataccaca ttcccaaaga cccatacagc 4861 taatgtccct actaggtgaa gctgcactgc atggcccatc attctacagc aagatcagta 4921 agctagttat tgcagagttg aaggaaggtg gcatggattt ttacgtgccc agacaagagc 4981 caatgtttcg atggatgagg ttctcagact tgagcacgtg ggagggcgat cgcaatctgg 5041 ctcccagttt tgtgaatgaa gatggcgtcg aatgacgctg ctccatctaa cgatggtgcc 5101 gccggcctcg tcccagagat caacaatgag gcaatggcgc tagagccagt ggtgggtgca 5161 gcgatagcag cacccctcac tggccagcaa aacataattg atccctggat tatgaataat 5221 tttgtgcaag cacctggtgg tgagtttaca gtgtcaccta ggaattcccc tggtgaagtg 5281 cttcttaatt tagaattagg tccagaaata aacccctatt tggctcacct tgctaggatg 5341 tacaatggtt atgcaggtgg gtttgaagtg caggtagtcc tggctggaaa tgcgtttaca 5401 gcaggaaagg tgatctttgc agctataccc cccaattttc caattgataa tctgagcgca 5461 gcacaaatta caatgtgccc gcatgtgatt gtggatgtca ggcagctgga accaattaat 5521 cttccgatgc ctgatgtccg caacaatttc tttcattata atcaagggtc tgattcgagg 5581 ttacgcttaa ttgcaatgct gtatacacct cttagggcaa acaattccgg agatgatgtt 5641 tttactgtgt cctgtagagt attaactagg cctagccctg atttctcatt caattttctt 5701 gtcccaccca ctgtggaatc aaagataaaa cccttcaccc tccccattct gactatctct 5761 gaaatgtcta attccaggtt tccagtgcca attgactctc tgcacaccag cccgactgag 5821 aacattgttg tccagtgcca aaatgggcgc gtcactcttg acggtgagtt aatgggtacc 5881 acccaactct tgccgagtca gatatgtgct ttcaggggca cgctcaccag atcaacaagc 5941 agggccagtg accaagccga cacagcaacc cctaggttat tcaattatta ttggcacata 6001 caattggaca atctaaatgg aaccccctac gaccctgcag aggacatacc agcccctctg 6061 ggaacaccag acttccgggg caaggtcttt ggcgtagcca gccagagaaa ccctgacagc 6121 acaacaagag cacatgaagc aaaagtggac acaacatctg gtcgcttcac cccgaaattg 6181 ggctccctag aaatatccac tgaatccggt gactttgacc caaaccaacc aacaagattc 6241 accccagttg gcattggggt tgacaatgag gcagattttc agcaatggtc cttacctgac 6301 tattccggtc agttcactca caacatgcac ttagccccag ctgtcgcccc caattttcct 6361 ggtgagcagc ttcttttctt ccgctcacag ttgccatctt ctggtgggcg gtcaaacggg 6421 attctagact gcctggtccc ccaggaatgg gttcaacact tctaccagga atcagcccct 6481 gcccaaacac aggtggccct ggttaggtat gttaaccctg acactggtag agtgctattt 6541 gaggccaagc tacataaatt aggtttcatg actatagcta agaatggtga ctctccaata 6601 accgtccctc caaatgggta ctttaggttt gaatcttggg tgaacccctt ttacacactt 6661 gcccccatgg gaactggaaa tgggcgtaga aggattcaat aatggctgga gcctttatag 6721 caggattggc tggtgacatg ctcacaagta ctgtgggatc tttagttaat gcaggggcta 6781 atgctatcaa tcaaaaagtt gattttgaaa ataataaata tttacaaaat gcatctttta 6841 atcatgataa ggagatgtta aatgcacaaa ttgaggcaac aaagaggctg caggctgaca 6901 tgattgctat caaacaaggg gtcttgaccg ctggcggctt ttcccccact gatgcagccc 6961 gtggggcaat taatgccccc ataacaagag ttttggactg gagtggaacg aggtactggg 7021 caccaagcgc cacctccaca acctcaatgt caggtggctt cacaagccaa actgtacaca 7081 gaaccacacc aaattttaaa acgaaccagg cccccaagtc cacacccagc agtgggtctt 7141 cagtgagatc aaactcaacc caactcacta gcttgagctc acactcatcc gggtcgtctc 7201 gatccagcgg gtctacggtt gttagctcat tgccatcttc caacaggact agggattggg 7261 tcaatcaaca gaatttcaat ttggaaccac acatgcctgg atctctcagg acagcttttg 7321 tcactccacc atctagtaca gcctctagtt cagacacggt ctcaaccgtg cccaaaagtg 7381 ttttggactc ctggacatct gcgtttaata cgcgcagaca gccgctattc gcacaccttc 7441 gtagaagggg ggagtcaaat gtttagtgaa aagattatct taaatttagt ttagattgga 7501 tttaatttgg aatctttt //