Typing tool
|
Complete norovirus genomes
MK907789 | GII.6 | ||
---|---|---|---|
GII.P7 |
ORF1: 1..5059 ORF2: 5040..6683 ORF3: 6683..7431LOCUS MK907789 7431 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_019 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MK907789 VERSION MK907789.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7431) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7431) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7431 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_019" /isolation_source="sewage" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="30-Mar-2014" /note="genotype: GII.6-GII.P7" gene <1..5059 /gene="ORF1" CDS <1..5059 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="QCO93068.1" /translation="GSQNSVNDSINTAPSNKEEVGAFSNIKVGFKKMLGAVPKGTKAP SSEQYCPSVKIGTKTLTVPPEPPNGEDTVQFDAKSETVRGLPDLTTVQNEHENTPYTV PPLSEREHRPATEPLPGTILEMWDGEFYHYSVYVSGGKALGVHKPPAAISLATIELTP ISLYWRPVYTPNYLVCPDTLKGLAGEKFPYTAFSNNCYNFCCWVLELNDTWLSRRSIS RTTGFFKPYQSWNRKPLPTVDDGKIKKVANAILCALGSLFSKPIKDLLGKLKPLNLLH LLASCDWTFAGIVETVILMAELFNIFWTPPDVSSFIASLIGDFELQGPEDLAVELVPV IMGGIGMVLGFTAEKIGRMLSSAASTLRACKDLGNYALDILKLVMKWFFPKKEEKAEM ETLRAIEDAVLDMEAIGNNHLTTLLKDKDSLTAFMKTLDLEEEKARKLSTKSSSPDIV GTINAILARIAAARSLLHKAKEEMFSRIRPVVVMISGRPGIGKTHMARHLAKSIANTM SGDQRVGLIPRNGVDHWDAYRGERVVLWDDYGMGNPVKDALTLQELADTCPVTLNCDR IENKGKMFDSDVIIITTNLVNPAPLDYVNFEACSRRVDFLVYAESPEIEKVKRDFPGQ PDMWKDHFKPDFSHIKLTLAPQGGFDKNGNTPHGKGTMRSLTQGSLTARVAGLVHERR DEFQLQGNDLQTYNFDTNRVSAFRKLAADNKYGIMETMRVGTALKSVKTLEDLKVALR DVKFNECEIIYRNSKYRVSSNGKGSVSVDKVEDQTSQTANEVGAALLRLRQARARYYV SCFQDLVYTLIQVAGASFVVNRISKRFCWERWVKPTETQETSESEKEVAQGRWEIEPK DTEPEGKKGKNKKGRGKKHTAFSSKGLSDEEYDEFKRIREERNGKYSIEEYLQDRDRY YEEVAVARATEEDFCEEEEAKIRQRIFRPTKKQRKEERGVLGLVTGSDIRKRRPDDFQ PKGDLWADDTRSVDYNERLDFEAPPSVWSRIVPLGTGWGFWVSSNLLITTTHVLPKGV KELFGVEIKQIQIHKSGEFCRFRFPRPIRPDVTGLVLEEGAPEGTVCSILVKRPTGEM IPLAVRMGTHASMKIQGRTVGGQMGMLLTGANAKNMDLGTGPGDCGCPYIYKRGNDIV VAGVHTAAARGGNTVICATQGQDGEAVLEGNENLGTYCGAPILGPGKAPKLSTKTKFW RSSPDALPPGTYEPAYLGGKDPRVEKGPSLQQVMRDQLKPFTEPRGKPPRPAVLEEAK KTVMNVLEQTIDPAKPWTYSQACASLDKTTSSGSPHHVRKNDHWNGESFTGPLADQAS KANLMYEQAKHVQPVYTAALKDELVKTDKIYKKIKKRLLWGSDLGTMIRCARAFGGLM DSMKASCIALPCRVGMNMNEDGPIIFDKHSKYRYHYDADYSRWDSTQQRSILSAAMEV MVRFSAEPELAQVVAEDLLAPSQLDVGDFVISVQEGLPSGVPCTSQWNSIAHWILTLS AMAEVSGLSPDVVQAHSCFSFYGDDEIVSTDINLDPMKLTQKLREYGLVPTRPDKTEG PLVITEDLTGLTFLRRSVARDPAGWFGKLDQDSILRQLYWTRGPNHENPYESMVPHSQ RATQLMALLGEASLHGPQFYKKVSKMVISEIKSGGLEFYVPRQEAMFRWMRFSDLSTW EGDRNLAPEGVNEDGVE" mat_peptide <1..967 /gene="ORF1" /product="p48" mat_peptide 968..2065 /gene="ORF1" /product="NTPase" mat_peptide 2066..2584 /gene="ORF1" /product="p22" mat_peptide 2585..2983 /gene="ORF1" /product="VPg" mat_peptide 2984..3526 /gene="ORF1" /product="Pro" mat_peptide 3527..5056 /gene="ORF1" /product="RdRp" gene 5040..6683 /gene="ORF2" CDS 5040..6683 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93066.1" /translation="MKMASNDAAPSNDGAANLVPEATNEVMALEPVVGASIAAPVVGQ QNIIDPWIRENFVQAPQGEFTVSPRNSPGEMLLNLELGPELNPYLSHLSRMYNGYAGG MQVQVVLAGNAFTAGKIIFAAVPPHFPVENISAAQITMCPHVIVDVRQLEPVLLPLPD IRNRFFHYNQENTSRMRLVAMLYTPLRANSGEDVFTVSCRVLTRPAPDFEFTFLVPPT VESKTKPFTLPILTLGELSNSRFPAPVDMLYTDPNEAIVVQPQNGRCTLDGTLQGTTQ LVPTQICSFRGTLISQTSRSADSTDSAPRVRNHPLHVQLKNLDGTPYDPTDEVPAVLG AIDFKGTVFGVASQRNTTGSSIGATRAHEVHIDTTNPRYTPKLGSVLMYSESTDFDDG QPTRFTPIGMGADDWHQWELPEYSGHLTLNMNLAPAVAPAFPGERILFFRSVVPSAGG YGSGHIDCLIPQEWVQHFYQEAAPSQSAVALIRYVNPDTGRNIFEAKLHREGFITVAN SGNNPIVVPPNGYFRFEAWVNQFYTLTPMGIGQGRRRAQ" gene 6683..>7431 /gene="ORF3" CDS 6683..>7431 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93067.1" /translation="MAGAFLAGLAGDAITSGVGSLINAGANAINQKVEYDFNKQLQLA SFKHDKEMLQSQVLATKQLQQEMMNIRQGVLSAGGFSPTDAARGAVNAPMTKILDWNG TRYFAPNSMKTTSYSGQFSSNPVHRQPTLPRSDPPKIKVNSDSASVYSSASSAPSQST HSTALTAGSSSSRTTSSSAASTLSRTSDWVRGQNERLSPFMDGALQTTFVTPPSSKAS SYGSVSTVPKAVLDSWTPMFNTHRQPLFAHVR" ORIGIN 1 tggcagccaa aactctgtca atgacagtat caacaccgcc ccttctaata aagaagaggt 61 tggtgcattc tccaacatta aagttggttt taagaaaatg ctgggtgccg ttcccaaggg 121 gaccaaagca cccagtagtg agcagtactg cccctcagtt aagattggga ccaagacatt 181 aacagttccc cctgagcccc caaatggtga agacaccgtg cagtttgatg caaagtcgga 241 aacagtgcgc gggctaccag acttgacaac ggtgcagaat gaacacgaaa acacaccata 301 caccgtcccc ccattgagtg aaagggaaca cagaccagcc actgaaccgc ttcctggcac 361 aatactggag atgtgggatg gtgaattcta ccattactct gtgtacgtca gcggtggcaa 421 agccctaggg gttcacaagc cccctgcagc gataagcctc gcgacgatag agctcacacc 481 catatctctt tattggaggc ccgtttacac ccccaactat ctggtctgtc cagacacgtt 541 gaaaggcctt gctggtgaga agttccccta cacggccttc agcaacaact gttacaactt 601 ctgttgctgg gtgcttgaac tcaatgacac gtggcttagc aggagaagca tctctaggac 661 taccggcttc ttcaaaccat atcagtcttg gaataggaaa ccccttccaa ccgttgatga 721 tggaaagatc aagaaggtgg ccaacgcaat tctctgcgca cttggatcac tgttttcaaa 781 accaatcaag gacctattgg gcaaactcaa accattgaac ctgctgcact tgctcgcatc 841 atgtgactgg acgttcgcgg gcatagtaga gacagtcatc ctaatggcgg aacttttcaa 901 catcttttgg acaccgccag atgtttccag tttcatagcc tccctgatag gtgattttga 961 attgcaaggg cctgaggacc tggctgtgga gcttgtcccc gtgatcatgg gtggcatagg 1021 aatggttctc ggctttactg ctgagaagat aggccgcatg ctgtcgtccg ctgcatcaac 1081 actacgggca tgcaaagact tggggaatta tgcccttgac atattgaaac tggttatgaa 1141 gtggttcttc ccaaagaaag aagagaaagc cgagatggaa accctaaggg cgattgagga 1201 tgccgtcctg gacatggaag ctataggaaa taaccatctc acaaccctcc tcaaggacaa 1261 ggatagcctc acggctttca tgaagactct agatctagaa gaggagaagg caaggaagct 1321 gtccaccaaa tcatcctcgc cagacatcgt cggcactatt aatgccatac tggccagaat 1381 agctgctgcc aggtccctct tgcacaaggc taaagaagaa atgtttagca ggattagacc 1441 cgtggttgtc atgatatcgg gtagacctgg cattggcaaa acccacatgg caagacactt 1501 ggctaagagt atcgccaaca ccatgagcgg cgaccagaga gtaggactca tcccacgcaa 1561 cggggtcgat cattgggacg cctacagggg agagagggtg gtcttatggg atgattatgg 1621 catgggaaac cctgtcaaag atgccctaac actgcaagag ttggccgaca cctgcccagt 1681 aactctaaac tgtgatagga tcgagaacaa ggggaagatg tttgacagtg atgtcatcat 1741 aatcacgacc aatctagtca acccggcgcc cctcgattat gtaaacttcg aggcgtgctc 1801 cagaagggtc gactttctgg tctatgcaga gtcaccagaa attgagaagg ttaagagaga 1861 tttccccggc caacctgaca tgtggaagga ccattttaag ccagactttt cccacataaa 1921 actgacccta gccccacagg gtggatttga caagaatggc aacaccccgc atggtaaggg 1981 caccatgcga tccctaaccc agggttccct gactgcaagg gttgcaggtc ttgttcatga 2041 gaggagagac gagtttcagc tccaggggaa cgaccttcaa acatacaact ttgacaccaa 2101 cagggtctct gcatttagga agctggccgc agacaacaag tatgggatca tggagactat 2161 gagagtgggc acagcgctta agagtgtcaa gaccttggag gatctgaaag ttgcactgag 2221 ggatgtgaag ttcaatgagt gtgagataat ttacagaaac tccaaatacc gagtctcttc 2281 caatggtaaa ggttctgtct ctgttgacaa agttgaggac caaacatccc agactgcaaa 2341 cgaagtgggt gcagcactcc ttagactcag gcaggcaaga gccagatact atgtcagctg 2401 cttccaagat ctcgtttaca ctctaataca ggtcgccgga gcatcattcg ttgtcaacag 2461 gatctctaaa aggttctgct gggagagatg ggtcaaacca accgagaccc aggaaacgag 2521 tgaatccgaa aaggaagtgg cccaaggcag atgggagatt gagcccaagg acacagagcc 2581 agaaggtaag aagggtaaga acaagaaagg aagaggcaag aaacacacag ctttctccag 2641 taaaggcctg agtgatgagg agtatgacga gttcaagaga atcagagaag aaaggaacgg 2701 aaaatactcc attgaggaat acctgcaaga ccgggatcgc tactatgagg aagtggcagt 2761 tgcccgggcg acagaggagg acttctgtga agaggaggag gccaagataa gacaaagaat 2821 ctttcgcccc acgaagaaac agaggaagga agaaaggggg gtgcttggtt tggtcactgg 2881 ctcagacatc aggaagagaa gaccagacga ttttcaaccg aagggtgacc tgtgggcaga 2941 cgataccagg agtgtggact acaacgagag acttgatttc gaggcgcctc cgagtgtctg 3001 gtcgagaata gtcccattgg gcactggctg gggcttctgg gtctcatcaa acctcctgat 3061 cacaacaaca cacgtcctgc ctaagggggt taaggaactt tttggagttg aaatcaaaca 3121 gattcaaatc cataagtctg gagagttctg caggttcagg ttcccgagac caattagacc 3181 agatgtcaca gggctcgtgt tggaggaagg cgctccagaa ggcactgtct gctccatact 3241 cgtaaagaga cctacaggtg agatgatccc cttggcagtg aggatgggca cgcatgcatc 3301 tatgaaaata cagggtagga ccgtcggtgg ccagatggga atgctcctca caggggcaaa 3361 tgcgaagaac atggatcttg gtaccggtcc tggtgactgc ggttgccctt acatctacaa 3421 acgtggcaat gacatagttg tcgcgggtgt ccacaccgca gcagcccggg gaggcaacac 3481 tgttatatgt gccactcaag ggcaagatgg ggaagcagtc cttgagggaa atgagaacct 3541 tggcacttac tgcggtgccc caatcttggg tccaggcaag gcgcccaaac ttagcacgaa 3601 gaccaagttc tggcgctcat caccagacgc tttgccgcca ggcacttatg agcctgccta 3661 cttgggaggt aaggacccca gagtggaaaa aggaccatct ctgcagcaag tcatgaggga 3721 ccagctaaaa cctttcacag aacccagggg caagccacct agacctgcag tcttagaaga 3781 ggccaaaaag acagtgatga atgttctaga gcaaaccatt gaccctgcta agccgtggac 3841 ctattcccaa gcatgtgcct cactggacaa gaccacttcc agtggtagcc cccatcatgt 3901 caggaaaaat gaccattgga atggggaatc cttcactggc ccccttgcag accaagcatc 3961 caaagccaac ctcatgtacg agcaggccaa acatgtgcag cccgtgtaca cggccgcact 4021 taaagatgaa ctagtcaaaa ctgacaagat ctacaagaag ataaagaaaa ggctcttgtg 4081 ggggtcggat cttggcacaa tgatcaggtg tgccagggcc tttggcggtc tcatggatag 4141 catgaaggca agttgcatag ccctcccatg tagggtagga atgaacatga atgaagatgg 4201 tcccatcata tttgacaaac attctaagta taggtatcat tatgatgctg actattccag 4261 gtgggactca acccagcaaa ggagcattct ctcggccgct atggaagtga tggtgcggtt 4321 ctctgccgaa ccagagctgg cacaggtggt tgcagaggac ctcctggcac ccagccaact 4381 agatgttggc gatttcgtca tctcagtcca ggagggtctg ccatcagggg ttccatgcac 4441 atcacaatgg aattcaatag cacactggat cctaaccttg agtgcaatgg cagaagtgtc 4501 gggtctctca ccagatgttg ttcaagccca ctcctgtttc tcattctacg gtgatgatga 4561 gatcgtcagc actgacatca accttgatcc catgaagttg acacagaaac tcagagagta 4621 tggcctggtc cccactcgac ctgacaaaac tgagggtccc ctcgtaataa ctgaagacct 4681 caccggccta acgttcctgc gtaggtcagt tgcacgggac ccagctgggt ggtttgggaa 4741 attggaccaa gactcaatcc tcagacagtt gtactggaca aggggcccca atcatgagaa 4801 cccatatgag agcatggtcc ctcattccca gcgggccaca cagcttatgg cccttctcgg 4861 tgaggcttca ctgcatggcc cccagtttta caagaaggtt agcaagatgg tcattagtga 4921 gatcaaaagt ggtggtctgg aattctatgt gcccagacaa gaggccatgt tcagatggat 4981 gagattttct gacctcagca catgggaggg cgatcgcaat cttgctcccg agggtgtgaa 5041 tgaagatggc gtcgaatgac gctgctccat cgaatgatgg tgccgccaac ctcgtaccag 5101 aggccaccaa tgaggttatg gcacttgagc cggtggtggg agcctcaatt gctgctcctg 5161 tcgtcggcca acaaaatata attgacccct ggattagaga aaattttgtt caggcaccac 5221 agggtgagtt tactgtttca ccaaggaact cacctggtga gatgcttcta aatcttgaat 5281 taggccctga gctcaatcct tatctgagtc acttgtcccg catgtataac ggctatgctg 5341 gtggtatgca ggttcaggtg gtcctagctg ggaatgcgtt cacagctggt aaaatcatct 5401 ttgccgccgt cccaccacat tttcctgttg agaacattag tgcagcccaa attacaatgt 5461 gcccccatgt aattgttgat gtaaggcagc ttgagccagt gcttttgccc ctccctgaca 5521 taaggaatag attctttcat tataatcagg aaaacacctc tcggatgaga cttgtggcta 5581 tgttgtacac accccttaga gcaaattctg gtgaggatgt gtttacagtg tcttgcagag 5641 ttttgacccg ccctgccccc gactttgagt tcacgttctt ggtgccacca actgtggagt 5701 caaagactaa gccctttaca ctccctattc taactcttgg agaactgtcc aactccagat 5761 tccccgctcc ggtagacatg ttgtacactg accctaatga ggcaattgtg gtgcaaccac 5821 aaaatggcag gtgcactcta gatggaacac ttcaagggac cacccaactg gtacccaccc 5881 aaatttgctc cttcaggggc acgctaatta gccagacgtc gaggtctgca gattcaacag 5941 attcagcccc acgggtgagg aaccaccctc tccacgttca gctgaagaac ctcgatggga 6001 caccatatga cccaacagat gaggtgccag cagttttggg tgccatagat ttcaaaggaa 6061 ctgtgtttgg ggtcgctagt caaagaaaca ccacagggag ctctatagga gcaacccgcg 6121 cccatgaagt gcacatagat accacaaacc ctagatatac cccaaagctt ggctctgtgt 6181 tgatgtattc tgaatctact gattttgatg atggacaacc cacccgcttt acccccattg 6241 gcatgggagc cgatgattgg caccaatggg aattgcccga gtattctggt caccttactc 6301 ttaacatgaa tttggccccc gcagttgccc ctgctttccc tggtgagcgc attctcttct 6361 tcagatcagt ggtgccatct gctggtggct atggatcagg tcacatagat tgcctcatcc 6421 cacaggagtg ggttcagcat ttctaccagg aggccgctcc atcacagtct gcggtggctc 6481 ttatcagata tgtcaaccct gacactggaa gaaacatctt tgaggcaaaa ctgcatagag 6541 aagggttcat cactgtggca aattctggaa acaaccccat agttgtgccc ccaaatggtt 6601 acttcaggtt tgaggcctgg gtcaatcaat tctacacact cacccccatg ggaattggac 6661 aggggcgcag aagagctcaa taatggcagg agctttctta gctggattgg caggtgacgc 6721 cataacaagt ggtgtcgggt ccctaatcaa tgctggggcc aatgcaatta accaaaaggt 6781 agagtatgat tttaacaaac agctccaatt ggcatccttc aaacatgaca aggagatgtt 6841 gcagtcacaa gtactggcca ctaaacagct ccagcaggaa atgatgaaca taaggcaagg 6901 ggtcttgtcc gctggcggct tttcccctac agatgccgcg agaggtgctg tgaacgcgcc 6961 gatgacaaaa attttggatt ggaatggaac aaggtacttc gccccaaata gtatgaagac 7021 aaccagttat tcgggccagt tttccagcaa ccctgtacat aggcaaccca ctttaccccg 7081 ttctgacccc ccaaaaatca aggttaatag tgattctgct agtgtgtata gttctgcatc 7141 ttctgctcct tcacaatcaa cccactcaac ggctttgact gcggggtcta gctcatctag 7201 gacaacatcc tcttctgcag cttccactct atctagaacc agtgactggg tgagaggaca 7261 aaatgagagg cttagcccgt ttatggatgg tgctctccaa acaacttttg tcacaccacc 7321 atcgagcaag gcttcatcat atgggtcggt ctcaaccgtt cccaaagctg ttttggactc 7381 ctggactcct atgttcaata cccataggca gcctctcttc gctcatgtgc g //