Typing tool
|
Complete norovirus genomes
MK956197 | GII.6 | ||
---|---|---|---|
GII.P7 |
ORF1: 1..1617 ORF2: 1598..3241 ORF3: 3241..3720LOCUS MK956197 3720 bp RNA linear VRL 12-NOV-2019 DEFINITION Norovirus GII isolate G19_007 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MK956197 VERSION MK956197.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 3720) AUTHORS Strubbia,S., Schaeffer,J., Oude Munnink,B.B., Besnard,A., Phan,M.V.T., Nieuwenhuijse,D.F., de Graaf,M., Schapendonk,C.M.E., Wacrenier,C., Cotten,M., Koopmans,M.P.G. and Le Guyader,F.S. TITLE Metavirome Sequencing to Evaluate Norovirus Diversity in Sewage and Related Bioaccumulated Oysters JOURNAL Front Microbiol 10, 2394 (2019) PUBMED 31681246 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 3720) AUTHORS Strubbia,S., Schaeffer,J. and Le Guyader,S.F. TITLE Direct Submission JOURNAL Submitted (22-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. v3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..3720 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_007" /isolation_source="sewage" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="22-Mar-2018" /note="genotype: GII.6-GII.P7" gene <1..1617 /gene="ORF1" CDS <1..1617 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QCT04936.1" /translation="AGVHTAAARGGNTVICATQGQDGEAVLEGNENLGTYCGAPILGP GKAPKLSTKTKFWRSSPDALPPGTYEPAYLGGKDPRVEKGPSLQQVMRDQLKPFTEPR GKPPRPAVLEEAKKTVMNVLEQTIDPAKPWTYSQACASLDKTTSSGSPHHVRKNDHWN GESFTGPLADQASKANLMYEQAKHVQPVYTAALKDELVKTDKIYKKIKKRLLWGSDLG TMIRCARAFGGLMDSMKASCITLPCRVGMNMNEDGPIIFDKHSKYRYHYDADYSRWDS TQQRSILSAAMEVMVRFSAEPELAQVVAEDLLAPSQLDVGDFVISVQEGLPSGVPCTS QWNSIAHWILTLSAMAEVSGLSPDVVQAHSCFSFYGDDEIVSTDINLDPMKLTQKLRE YGLVPTRPDKTEGPLVITEDLTGLTFLRRSIARDPAGWFGKLDQDSILRQLYWTRGPN HENPYESMVPHSQRATQLMALLGEASLHGPQFYKKVSKMVINEIKSGGLEFYVPRQEA MFRWMRFSDLSTWEGDRNLAPEGVNEDGVE" mat_peptide <1..84 /gene="ORF1" /product="Pro" mat_peptide 85..1614 /gene="ORF1" /product="RdRp" gene 1598..3241 /gene="ORF2" CDS 1598..3241 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCT04937.1" /translation="MKMASNDAAPSNDGAANLVPEANNEVMALEPVVGASIAAPVVGQ QNIIDPWIRENFVQAPQGEFTVSPRNSPGEMLLNLELGPELNPYLSHLSRMYNGHAGG MQVQVVLAGNAFTAGKIIFAAVPPHFPVENISAAQITMCPHVIVDVRQLEPVLLPLPD IRNRFFHYNQENTPRMRLVAMLYTPLRANSGEDVFTVSCRVLTRPAPDFEFTFLVPPT VESKTKPFTLPILTLGELSNSRFPAPIDMLYTDPNEGIVVQPQNGRCTLDGTLQGTTQ LVPTQICAFRGTLIGQTSRSSDSTDSAPRRRDHPLHVQLKNLDGTQYDPTDEVPAVLG AIDFKGTVFGVASQRDVSGQQVGATRAHEVHINTTDPRYTPKLGSILMHSESDDFVTG QPVRFTPIGMGDNDWHQWELPDYSGRLTLNMNLAPAVAPAFPGERILFFRSIVPSAGG YGSGQIDCLIPQEWVQHFYQEAAPSQSAVALIRYVNPDTGRNIFEAKLHREGFITVAN SGNNPIVVPPNGYFRFEAWVNQFYTLTPMGTGQGRRRNQ" gene 3241..>3720 /gene="ORF3" CDS 3241..>3720 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCT04938.1" /translation="MASAFLAGLAGDVITNGVGSLINAGANAVNQKVEYDFNKQLQMA SFKHDKEMLQSQVLATKQLQQEMMNIRQGVLTAGGFSPADAARGAVNAPMTKILDWSG TRYWAPNSMKTTSYSGQFSSSPVHKSPAPPQHAVSSKSRLQNDSASVYSSPSSVSSQS " ORIGIN 1 gcgggtgtcc acaccgcagc agcccgggga ggcaacactg tcatatgtgc cactcaaggg 61 caagatgggg aggcagtcct tgagggaaat gagaaccttg gaacttactg cggtgcccca 121 attttgggtc caggcaaggc gcccaaactt agcacaaaga ccaagttctg gcgttcgtca 181 ccagacgctc tgccgccagg cacttatgag cctgcctact tgggaggcaa ggaccctaga 241 gtagaaaagg gaccatccct gcagcaagtc atgagagacc aactaaaacc tttcacagaa 301 cccaggggca agccacctag acccgcagtc ttagaagaag ccaaaaagac agtgatgaat 361 gttctagaac aaaccattga ccctgctaag ccatggacct actcccaagc atgcgcctca 421 ctggacaaga ccacctccag tggtagccct catcatgtca ggaaaaatga ccattggaat 481 ggagaatcct ttactggccc ccttgcagac caagcatcca aagccaacct catgtacgag 541 caggccaaac atgtgcagcc cgtgtacacg gccgcactca aagatgagct agtcaagact 601 gacaagatct ataagaagat aaagaaaagg ctcttatggg ggtcggatct tggtacaatg 661 atcaggtgtg ccagggcctt tggtggtctc atggatagca tgaaggcaag ttgcataacc 721 ctcccatgta gggtgggaat gaacatgaat gaagatggtc ccatcatatt tgacaaacac 781 tctaagtata ggtaccatta tgatgctgac tattccaggt gggactcaac ccagcaaagg 841 agcattctct cggccgctat ggaagtgatg gtgcggttct ctgccgaacc agagctggca 901 caagtggttg cagaggacct cctggcaccc agccaactag atgttggcga ctttgtcatc 961 tcagttcagg agggtctgcc gtcaggggta ccatgtacat cacaatggaa ttcaatagca 1021 cattggatcc taactttgag tgcaatggca gaagtgtcgg gtctctcacc agatgttgtt 1081 caagcccact cctgtttctc attttacggt gatgatgaga tcgtcagcac tgacatcaac 1141 cttgatccca tgaagttgac acagaaactc agagagtatg gcctagtccc tactcgacct 1201 gacaaaactg agggtcccct tgtaataact gaagatctca ccggcctaac gttcctgcgt 1261 aggtcaattg cacgggatcc agctgggtgg tttggaaaat tagaccaaga ctcaatcctc 1321 aggcaattgt actggacaag gggccccaat catgagaacc catatgagag catggtcccc 1381 cattcccagc gggccacaca gcttatggcc cttctcggtg aggcttcact gcatggcccc 1441 cagttttaca agaaggtcag caagatggtc attaacgaaa tcaaaagtgg tggtctggaa 1501 ttctatgtgc ccagacaaga ggccatgttc agatggatga gattctctga cctcagcaca 1561 tgggagggcg atcgcaatct tgctcccgag ggtgtgaatg aagatggcgt cgaatgacgc 1621 tgctccatcg aatgatggtg ctgccaacct cgtaccagag gccaacaatg aggttatggc 1681 acttgagccg gtggtgggag cctcaattgc tgctcctgtc gtcggccaac aaaacataat 1741 tgacccctgg attagagaaa attttgttca ggcaccacag ggtgagttta ctgtttcacc 1801 aaggaactca cctggtgaga tgcttctaaa tcttgaatta ggccctgagc tcaatcctta 1861 tctgagtcac ttgtcccgca tgtacaatgg tcatgctggc ggcatgcagg ttcaggtggt 1921 cctagctggg aatgcgttca cagctgggaa aatcatcttt gccgctgtgc caccacattt 1981 ccccgtggaa aacatcagtg cagctcaaat aactatgtgc ccccatgtga ttgttgatgt 2041 gagacaactt gagccagtac tcctacccct tcctgacata aggaataggt tctttcatta 2101 caatcaggag aacactcccc ggatgagact tgtggctatg ctttacaccc ccctgagggc 2161 caactctggt gaagatgtgt ttactgtctc ttgtagggtc ttaacccgtc ctgcccctga 2221 ttttgaattt actttcttgg taccaccaac tgttgaatca aagactaagc cttttacact 2281 acccatatta actcttggtg agctatctaa ttccagattc ccagccccaa tagatatgtt 2341 gtacactgat ccaaatgagg gaattgtggt ccaaccacaa aatggtaggt gcactcttga 2401 tggcactctg caaggcacca cacaactggt ccccacccaa atttgtgctt tcagaggcac 2461 actaattggc caaacatcaa gatcttcaga ctcaaccgac tcagcccctc ggaggaggga 2521 tcacccactc catgttcaat taaagaacct tgatggcacg cagtatgacc ccactgatga 2581 agtgccagca gtcctcggtg ccattgattt caaggggact gtctttgggg tggccagtca 2641 gagggacgtg tcaggacaac aggtgggagc aactcgagcc catgaagtgc acatcaacac 2701 aaccgatcct aggtatacac caaaactagg gtccattctc atgcactcag agtcggacga 2761 cttcgtgact ggacagccgg tccgcttcac acccatagga atgggcgaca acgactggca 2821 tcagtgggag ctgcccgact attctggacg cctaacccta aacatgaacc ttgccccagc 2881 agttgctcct gcattcccgg gtgagaggat tcttttcttc aggtcaattg tcccgtctgc 2941 tggtggctac ggctctgggc aaatagattg cctcatacca caggagtggg ttcagcattt 3001 ctaccaagaa gctgcaccat cccaatctgc cgtggcactc atcaggtatg tcaaccctga 3061 cacaggcaga aacatctttg aggctaaatt gcacagggaa ggcttcatca ccgtggctaa 3121 ttctggcaac aaccccattg ttgtcccccc taatgggtat tttaggtttg aggcttgggt 3181 gaatcaattt tacactttga cccccatggg aactggtcag gggcgtagga ggaatcaata 3241 atggctagtg cttttcttgc aggtcttgct ggtgacgtca taacaaatgg cgttggatct 3301 ctaataaatg ctggagctaa tgcagttaat cagaaagttg aatatgattt taataaacag 3361 cttcaaatgg catcatttaa acatgataaa gagatgttgc aatcacaagt gctggcaacc 3421 aagcagttgc agcaggagat gatgaacatt aggcaggggg tgttgaccgc tggcggcttc 3481 tcccccgcgg atgctgctag aggggctgtc aatgccccaa tgacaaagat tctggactgg 3541 agcggcacca ggtattgggc accaaacagc atgaaaacca caagttattc aggacagttt 3601 tctagtagcc ctgttcataa gtctcctgct cctccccagc atgctgtttc atcaaagagt 3661 agattgcaaa atgattctgc tagtgtatat agttctcctt cttctgtttc ttcacaatca //