Typing tool
|
Complete norovirus genomes
MK956198 | GII.6 | ||
---|---|---|---|
GII.P7 |
ORF1: 1..2682 ORF2: 2663..4306 ORF3: 4306..5050LOCUS MK956198 5050 bp RNA linear VRL 12-NOV-2019 DEFINITION Norovirus GII isolate G19_008 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MK956198 VERSION MK956198.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 5050) AUTHORS Strubbia,S., Schaeffer,J., Oude Munnink,B.B., Besnard,A., Phan,M.V.T., Nieuwenhuijse,D.F., de Graaf,M., Schapendonk,C.M.E., Wacrenier,C., Cotten,M., Koopmans,M.P.G. and Le Guyader,F.S. TITLE Metavirome Sequencing to Evaluate Norovirus Diversity in Sewage and Related Bioaccumulated Oysters JOURNAL Front Microbiol 10, 2394 (2019) PUBMED 31681246 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 5050) AUTHORS Strubbia,S., Schaeffer,J. and Le Guyader,S.F. TITLE Direct Submission JOURNAL Submitted (22-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. v3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..5050 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_008" /isolation_source="sewage" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="22-Mar-2018" /note="genotype: GII.6-GII.P7" gene <1..2682 /gene="ORF1" CDS <1..2682 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QCT04939.1" /translation="RARYYISCFQDLVYTLIQVAGASFVVNRISKRFCWERWVKPTET QETSESEKEVAQGKWEIEPKDTEPEGKKGKNKKGRGKKHTAFSSKGLSDEEYDEFKRI REERNGKYSIEEYLQDRDRYYEEVAVARATEEDFCEEEEAKIRQRIFRPTKKQRKEER GVLGLVTGSDIRKRRPDDFQPKGNLWADDTRSVDYNERLDFEAPPSVWSRIVPLGTGW GFWVSSNLLITTTHVLPKGIKELFGVEVKQIQIHKSGEFCRFRFPRPIRPDVTGLVLE EGAPEGTVCSILVKRPTGEMIPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDL GTGPGDCGCPYIYKRGNDIVVAGVHTAAARGGNTVICATQGQDGEAVLEGNENLGTYC GAPILGPGKAPKLSTKTKFWRSSPDALPPGTYEPAYLGGKDPRVEKGPSLQQVMRDQL KPFTEPRGKPPRPAVLEEAKKTVMNVLEQTIDPAKPWTYSQACASLDKTTSSGSPHHV RKNDHWNGESFTGPLADQASKANLMYEQAKHVQPVYTAALKDELVKTDKIYRKIKKRL LWGSDLGTMIRCARAFGGLMDSMKASCITLPCRVGMNMNEDGPIIFDKHSKYRYHYDA DYSRWDSTQQRSILSAAMEVMVRFSAEPELAQVVAEDLLAPSQLDVGDFVISVQEGLP SGVPCTSQWNSIAHWILTLSAMAEVSGLSPDVIQAHSCFSFYGDDEIVSTDINLDPMK LTQKLREYGLVPTRPDKTEGPLVITEDLTGLTFLRRSIARDPAGWFGKLDQDSILRQL YWTRGPNHENPYESMVPHSQRATQLMALLGEASLHGPQFYKKVSKMVINEIKSGGLEF YVPRQEAMFRWMRFSDLSTWEGDRNLAPEGVNEDGVE" mat_peptide <1..207 /gene="ORF1" /product="p22" mat_peptide 208..606 /gene="ORF1" /product="VPg" mat_peptide 607..1149 /gene="ORF1" /product="Pro" mat_peptide 1150..2679 /gene="ORF1" /product="RdRp" gene 2663..4306 /gene="ORF2" CDS 2663..4306 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCT04940.1" /translation="MKMASNDAAPSNDGAANLVPEANNEVMALEPVVGASIAAPVVGQ QNIIDPWIRENFVQAPQGEFTVSPRNSPGEMLLNLELGPELNPYLSHLSRMYNGYAGG MQVQVVLAGNAFTAGKIIFAAVPPHFPVENISAAQITMCPHVIVDVRQLEPVLLPLPD IRNRFFHYNQENTPRMRLVAMLYTPLRANSGEDVFTVSCRVLTRPAPDFEFTFLVPPT VESKTKPFTLPILTLGELSNSRFPAPIDMLYTDPNEGIVVQPQNGRCTLDGTLQGTTQ LVPTQICAFRGTLIGQTSRSSDSTDSAPRRRDHPLHVQLKNLDGTQYDPTDEVPAVLG AIDFKGTVFGVASQRDVSGQQVGATRAHEVHINTTDPRYTPKLGSILMHSESDDFVTG QPVRFTPIGMGDNDWHQWELPDYSGRLTLNMNLAPAVAPAFPGERILFFRSIVPSAGG YGSGQIDCLIPQEWVQHFYQEAAPSQSAVALIRYVNPDTGRNIFEAKLHREGFITVAN SGNNPIVVPPNGYFRFEAWVNQFYTLTPMGTGQGRRRNQ" gene 4306..>5050 /gene="ORF3" CDS 4306..>5050 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCT04941.1" /translation="MASAFLAGLAGDVITNGVGSLINAGANAVNQKVEYDFNKQLQMA SFKHDKEMLQSQVLATKQLQQEMMNIRQGVLTAGGFSPADAARGAVNAPMTKILDWNG TRYWAPNSMKTTSYSGQFSSSPVHKSPAPPQHAVSSKSRLQNDSASVYSSPSSVSSQS THSTVLSAGTGSSRSISTSTATPTLSRTSDWVRGQNERLGPFMDGALQTAFVTPPSSK ASSNGTVSTVPKAVLDSWTPMFNTHRQPLF" ORIGIN 1 agggccagat actacatcag ctgcttccaa gaccttgtct acaccctaat acaggtcgcc 61 ggagcatcat ttgttgtcaa caggatctct aagagattct gctgggagag atgggtcaaa 121 ccaaccgaga cccaggaaac gagtgaatcc gaaaaggaag tagcccaagg caaatgggag 181 attgagccca aggacacaga accagaaggt aagaagggta agaacaagaa aggaagaggc 241 aagaaacaca cagctttctc cagcaaaggc cttagtgatg aggagtatga cgagttcaag 301 agaattagag aagaaaggaa cggaaaatac tccattgagg aatacctgca agaccgggat 361 cgctactatg aggaagtggc agttgcccgg gcgacagagg aggacttctg tgaagaggag 421 gaggccaaga taagacaaag aatctttcgt cccacaaaga aacagaggaa ggaagaaagg 481 ggggtgcttg gtttggtcac tggctcagac atcaggaaga ggagaccaga cgattttcaa 541 ccgaagggca acctgtgggc agacgacacc aggagtgtgg attacaacga gagacttgat 601 ttcgaagctc ccccgagtgt ctggtctaga atagtcccat tgggtactgg ctggggcttc 661 tgggtctcat ccaacctcct gatcacaaca acacacgtcc tgcctaaggg gattaaggaa 721 ctttttggag ttgaagtcaa acagatccaa attcataagt ctggagagtt ctgcaggttc 781 agattcccaa gaccaatcag accagatgtc acagggctcg tgttggagga gggtgctcca 841 gaaggcactg tctgctccat actcgtgaag agaccaacag gtgagatgat ccctctggca 901 gtgaggatgg gcacgcatgc atccatgaaa atacagggta gaaccgtcgg tggtcagatg 961 ggaatgcttc tcacaggggc aaatgcgaag aacatggatc ttggcaccgg ccctggtgac 1021 tgtggttgtc cttacatcta caaacgtggc aatgatatag ttgtcgcggg tgtccacacc 1081 gcagcagccc ggggaggcaa cactgtcata tgtgccactc aagggcaaga tggggaggca 1141 gtccttgagg gaaatgagaa ccttggaact tactgcggtg ccccaatttt gggtccaggc 1201 aaggcgccca aacttagcac aaagaccaag ttctggcgtt cgtcaccaga cgctctgccg 1261 ccaggcactt atgagcctgc ctacttggga ggcaaggacc ctagagtaga aaagggacca 1321 tccctgcagc aagtcatgag agaccaacta aaacctttca cagaacccag gggcaagcca 1381 cctagacccg cagtcttaga agaagccaaa aagacagtga tgaatgttct agaacaaacc 1441 attgatcctg ctaagccatg gacctactcc caagcatgcg cttcactgga caagaccacc 1501 tccagtggta gccctcatca tgtcaggaaa aatgaccatt ggaatgggga atcctttact 1561 ggcccccttg cagaccaagc atccaaagcc aacctcatgt acgagcaggc caaacatgtg 1621 cagcccgtgt acacggccgc actcaaagac gagctagtca agactgacaa gatctacagg 1681 aagataaaga aaaggctctt atgggggtcg gatcttggta caatgatcag gtgtgccagg 1741 gcctttggtg gtctcatgga tagcatgaag gcaagttgca taaccctccc atgtagggtg 1801 ggaatgaaca tgaatgaaga tggtcccatc atatttgaca aacactctaa gtataggtac 1861 cattatgatg ctgactattc caggtgggac tcaacccagc aaaggagcat tctctcggcc 1921 gctatggaag tgatggtgcg gttctctgcc gaaccagagc tggcacaagt ggttgcagag 1981 gacctcctgg cacccagcca actagatgtt ggcgactttg tcatctcagt tcaggagggt 2041 ctgccgtcag gggtaccatg tacatcacaa tggaattcaa tagcacattg gatcctaact 2101 ttgagtgcaa tggcagaagt gtcgggtctc tcaccagatg ttattcaagc ccactcctgt 2161 ttctcatttt acggtgatga tgagatcgtc agcactgaca tcaaccttga tcccatgaag 2221 ttgacacaga aactcagaga gtatggccta gtccctactc gacctgacaa aactgagggt 2281 ccccttgtaa taactgaaga tctcaccggc ctaacgttcc tgcgtaggtc aattgcacgg 2341 gatccagctg ggtggtttgg aaaattagac caagactcaa tcctcaggca attgtactgg 2401 acaaggggcc ccaatcatga gaacccatat gagagcatgg tcccccattc ccagcgggcc 2461 acacagctta tggcccttct cggtgaggct tcactgcatg gcccccagtt ttacaagaag 2521 gtcagcaaga tggtcattaa cgaaatcaaa agtggtggtc tggaattcta tgtgcccaga 2581 caagaggcca tgttcagatg gatgagattc tctgacctca gcacatggga gggcgatcgc 2641 aatcttgctc ccgagggtgt gaatgaagat ggcgtcgaat gacgctgctc catcgaatga 2701 tggtgctgcc aacctcgtac cagaggccaa caatgaggtt atggcacttg agccggtggt 2761 gggagcttca attgcagctc ctgtcgtcgg ccaacaaaac ataattgacc cctggattag 2821 agaaaatttt gttcaagcac cacagggtga gttcactgtc tcgccgagga actcgcctgg 2881 tgagatgcta ttaaatcttg aattaggccc agaactcaac ccttacctaa gtcacctgtc 2941 ccgtatgtat aatgggtatg ctggtggcat gcaggttcag gtggtcctag ctgggaatgc 3001 gttcacagct gggaaaatca tctttgccgc tgtgccacca catttccccg tggaaaacat 3061 cagtgcagct caaataacta tgtgccccca tgtgattgtt gatgtgagac aacttgagcc 3121 agtactccta ccccttcctg acataaggaa taggttcttt cattacaatc aggagaacac 3181 tccccggatg agactagtgg ctatgcttta cacccccctg agggccaact ctggtgaaga 3241 tgtgtttact gtctcttgta gggtcttaac ccgtcctgcc cctgattttg aatttacttt 3301 cttggtacca ccaactgttg aatcaaagac taagcctttt acactaccca tattaactct 3361 tggtgagcta tctaattcca gattcccagc cccaatagat atgttgtaca ctgatccaaa 3421 tgagggaatt gtagtccaac cacaaaatgg taggtgcact cttgatggca ctctgcaagg 3481 caccacacaa ctggtcccca cccaaatttg tgctttcagg ggcacactaa ttggccaaac 3541 atcaagatct tcagactcaa ccgactcagc ccctcggagg agggatcacc cactccatgt 3601 gcaattaaag aaccttgatg gcacgcagta tgaccctact gatgaagtgc cagcagtcct 3661 cggtgccatt gatttcaagg ggactgtctt tggggtggcc agtcagaggg acgtgtcagg 3721 acaacaggtg ggagcaactc gagcccatga agtgcacatc aacacaaccg atcctaggta 3781 tacaccaaaa ctagggtcca ttctcatgca ctcagagtcg gacgacttcg tgactggaca 3841 gccggtccgc ttcacaccca taggaatggg cgacaacgac tggcatcagt gggagctgcc 3901 cgactattct ggacgcctaa ccctaaacat gaaccttgcc ccagcagttg ctcctgcatt 3961 tccgggtgag aggattcttt tcttcaggtc aattgtcccg tctgctggtg gctacggctc 4021 tgggcaaata gattgcctca taccacagga gtgggttcag catttctacc aagaagctgc 4081 accatcccaa tctgccgtgg cactcatcag gtatgtcaac cctgacacag gcagaaacat 4141 ctttgaggct aaattgcaca gggaaggctt catcaccgtg gctaattctg gcaacaaccc 4201 cattgttgtc ccccctaatg ggtattttag gtttgaggct tgggtgaatc aattttacac 4261 tttgaccccc atgggaactg gtcaggggcg taggaggaat caataatggc tagtgctttt 4321 cttgcaggtc ttgctggtga cgtcataaca aatggcgttg gatctctaat aaatgctggt 4381 gctaatgcag ttaatcagaa agttgaatat gattttaata aacagcttca aatggcatca 4441 tttaaacatg ataaagagat gttgcaatca caagtgttgg caaccaagca gttgcagcag 4501 gagatgatga acattaggca gggggtgttg accgctggcg gcttctcccc cgcggatgct 4561 gctagagggg ctgtcaatgc cccaatgaca aagattctgg actggaatgg caccaggtat 4621 tgggcaccaa acagcatgaa aaccacaagt tattcaggac agttttctag cagccctgtt 4681 cataagtctc ctgctcctcc ccagcatgct gtttcatcaa agagtagatt gcaaaatgat 4741 tctgctagtg tatatagttc tccttcttct gtttcttcac aatcaactca ttcaacagtg 4801 ctgtcagcag gaactgggtc ctccaggtcc atctccacat ctacagctac ccccaccttg 4861 tctaggacca gcgactgggt taggggacag aacgagaggc ttggcccgtt catggatggt 4921 gctctccaaa ctgcctttgt cacacctcca tcgagcaaag cttcttcaaa tggtacggtc 4981 tcaaccgttc ccaaagctgt tttggactcc tggactccta tgtttaacac ccataggcag 5041 cctctcttcg //