Typing tool
|
Complete norovirus genomes
MN416765 | GI.4 | ||
---|---|---|---|
GI.P4 |
ORF1: 1..1763 ORF2: 1747..3381 ORF3: 3381..3903LOCUS MN416765 3903 bp RNA linear VRL 31-DEC-2019 DEFINITION Norovirus GI isolate G19_044 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MN416765 VERSION MN416765.1 KEYWORDS . SOURCE Norovirus GI ORGANISM Norovirus GI Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 3903) AUTHORS Strubbia,S., Schaeffer,J. and Le Guyader,F.S. TITLE Metagenomic to detect norovirus and Human enteric viruses in oysters: impact on hexamer selection and targeted capture-based enrichment JOURNAL Unpublished REFERENCE 2 (bases 1 to 3903) AUTHORS Strubbia,S., Schaeffer,J. and Le Guyader,F.S. TITLE Direct Submission JOURNAL Submitted (06-SEP-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. v3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..3903 /organism="Norovirus GI" /mol_type="genomic RNA" /isolate="G19_044" /isolation_source="digestive tissue" /host="shellfish" /db_xref="taxon:122928" /country="France" /collection_date="11-Nov-2018" /note="genotype: GIP4, GI.4" gene <1..1763 /gene="ORF1" CDS <1..1763 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QEM24801.1" /translation="IASMKIQGRLVHGQSGMLLTGANAKGMDLGTLPGDCGAPYVYKR NNDWVVCGVHAAATKSGNTVVCAVQAGEGETTLEGGDKGHYAGHEIIKYSKGPALSTK TKFWRSTPEPLPPGVYEPAYLGGRDPRVKGGPSLQQVLRDQLKPFAEPRGRMPEPGLL EAAVETVTSMLEQTMDTPTPWSFSDACQSLDKTTSSGHPYHKRKNDDWNGTSFVGELG EQAAHANNMYELGKSMKPLYTAALKDELVKPEKVYQKIKKRLLWGADLSTVIRAARAF GPFCDAIKPHVIKLPIKVGMNSIEDGPMIYAEHAKYKNHFDADYSAWDSTQNRQIMTE SFAIMCRLTASPELASVVAKDLLTPSEMDVGDYIIRVKEGLPSGFPCTSQVNSINHWL ITLCAMSEVTGLSPDVIQSQSYFSFYGDDEIVSTDIDFDPARLTQVLKEYGLRPTRPD KSEGPIMLRRQVDGLVFLRRTISKDAAGFQGRLDRGSIERQLWWTRGPNHDDPSETLI PHPQRKVQLISLLGEASLHGEKFYRKISSKVIQEIKTGGLEMYVPGWQAMFRWMRFHD LGLWTGDRNLLPEFVNDDGV" mat_peptide <1..236 /gene="ORF1" /product="Pro" mat_peptide 237..1760 /gene="ORF1" /product="RdRp" gene 1747..3381 /gene="ORF2" CDS 1747..3381 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QEM24802.1" /translation="MMMASKDATPSADGATGAGQLVPEVNTADPIPIDPVAGSSTALA TAGQVNLIDPWIINNFVQAPQGEFTISPNNTPGDVLFDLQLGPHLNPFLSHLSQMYNG WVGNMRVRVVLAGNAFTAGKVIICCVPPGFQSRTLSIAQATLFPHVIADVRTLDPVEV PLEDVRNVLYHNNDTQPTMRLLCMLYTPLRTGGASGGTDSFVVAGRVLTCPGPDFNFL FLVPPTVEQKTRPFTVPNIPLKYLSNSRIPNPIEGMSLSPDQTQNVQFQNGRCTIDGQ PLGTTPVSVSQLCKFRGRITSGQRVLNLTELDGSPFMAFAAPAPAGFPDLGSCDWHIE MSKIPNSSTQNNPIVVNSVKPNSQQFVPHLSSITLDDNVSSGGDYIGTIQWTSPPSDS GGANTNFWKIPDYGSSLAEASQLAPAVYPPGFNEVIVYFMASIPGPNQSGSPNLVPCL LPQEYITHFISEQAPIQGEAALLHYVDPDTNRNLGEFKLYPGGYLTCVPNSSSTGPQQ LPLDGVFVFASWVSRFYQLKPVGTAGPARGRLGVRR" gene 3381..>3903 /gene="ORF3" CDS 3381..>3903 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QEM24803.1" /translation="MAQAIIGAIAASAAGSALGAGIQAGAEAALQAQRYQQDLTLQQN SFNHDKEMLGYQVEMSNKLLAKNLNTRYSLLQAGGLSPSDAARAVAGAPVTRLVDWGG VRVAAPQSSATTLRSGNFMTVPLPAQPKQKPLASEGYSNPAYDPMQRTASWVQSQNSS RNWSPYHRQALQTV" ORIGIN 1 ctatagcgtc catgaagatt cagggaaggc tggtgcatgg tcagtctggg atgctgctta 61 ctggagccaa cgctaagggg atggatttag gcaccctacc aggtgattgt ggtgcccctt 121 atgtgtacaa aaggaataat gactgggtgg tttgtggtgt acatgcagcc gccacaaagt 181 caggcaatac tgtagtgtgt gctgtccagg ctggagaggg tgaaaccacc ctagagggtg 241 gtgacaaagg ccactatgca ggccatgaaa taatcaagta tagtaaggga ccagctctat 301 ctactaagac caagttttgg aggtcaaccc ctgaaccatt gccgccaggt gtttatgaac 361 cagcatacct tgggggtcgt gatccaaggg taaaaggtgg gccctctctg cagcaggtct 421 tgcgtgatca gctgaaacca tttgctgaac cgcgaggtcg aatgccagag cctggtctac 481 ttgaggcagc tgttgaaact gtcacgtcca tgcttgaaca aacaatggat acaccaaccc 541 cctggtcttt ttctgatgca tgtcaatcac ttgacaaaac cacaagttcg ggacacccat 601 atcataaaag gaaaaatgat gattggaatg gaacctcatt tgttggagag cttggagaac 661 aggcagcaca tgccaacaac atgtacgaat taggcaaatc tatgaaacca ctatacactg 721 cagccctaaa agatgagtta gtcaagcccg agaaagttta tcaaaagatc aagaagaggc 781 ttctctgggg tgctgatctc tcaacagtta taagagctgc cagggccttt gggccatttt 841 gtgatgccat aaagcctcat gtcattaaac tccccatcaa agtcggtatg aactccatag 901 aggatggccc aatgatttat gcagagcatg caaaatataa gaaccatttt gatgcagatt 961 attcggcctg ggattctacg cagaatagac agattatgac tgagtccttt gcaatcatgt 1021 gccgcctcac ggcatcaccg gaattggctt ctgttgtggc taaagatcta ctaacgcctt 1081 cagaaatgga tgttggtgat tacatcatcc gtgtaaaaga aggcttgcca tcaggattcc 1141 cttgcacatc tcaggtgaac agtataaacc actggctgat cacactgtgt gccatgtctg 1201 aagtcacggg tttgtcaccc gatgtcatac aatcccaatc ttacttttct ttctatggtg 1261 atgatgagat agtctcaaca gacattgatt ttgaccccgc tcgcctcacc caagtgctta 1321 aggaatatgg tttgagaccc accagaccag acaagtcaga agggcccatc atgctcagaa 1381 gacaggtgga tggccttgta ttcttaagga ggacaatttc caaagatgct gcaggctttc 1441 agggaaggct agataggggc tcaattgaaa ggcagttgtg gtggactcgg ggccctaacc 1501 atgatgatcc cagtgaaact ctgattccac acccccagag gaaagttcag cttatatccc 1561 ttcttggtga ggcctccctc catggtgaga agttttacag gaagatctcc agtaaagtca 1621 tacaggagat aaagaccggt ggcttagaaa tgtatgtgcc aggttggcag gccatgttcc 1681 gctggatgcg cttccatgat ctcggattgt ggacaggaga tcgcaatctc ttgcccgaat 1741 tcgtaaatga tgatggcgtc taaggacgct acaccaagcg cagatggcgc cactggcgcc 1801 ggccagctgg taccggaggt taatacagct gaccctatac ctattgaccc tgtggctggc 1861 tcctctacag cccttgccac tgcgggccaa gttaatttga ttgatccctg gataatcaat 1921 aattttgtgc aagcccctca gggcgagttc acaatatccc caaataatac ccccggtgat 1981 gtgctatttg atttgcagtt agggcctcat ctgaaccctt tcctttccca tctttctcag 2041 atgtacaatg gctgggtggg caacatgcga gtgcgcgttg ttttggctgg caatgctttc 2101 acagctggga aggttataat ttgttgtgtt ccccctggct ttcaatctcg cactctttcc 2161 atagctcagg ctactctatt tccccatgtt attgctgatg ttaggaccct tgatcctgta 2221 gaagtgcccc ttgaagatgt taggaatgta ttgtatcaca ataatgacac tcaacctacc 2281 atgcgcctcc tttgcatgtt gtacacccct cttcggactg ggggggcgtc tggtgggact 2341 gattcttttg tggtagctgg gcgtgtgctt acttgcccag gccctgactt taacttcttg 2401 ttcctagttc cccctacagt tgagcaaaag actcgccctt tcactgtgcc taatatccct 2461 ttgaagtacc tgtctaattc taggatccca aaccctattg agggtatgtc tctgtcacct 2521 gaccagaccc aaaatgttca attccagaat ggtaggtgta caattgacgg tcagcccctc 2581 gggaccacac ctgtctcagt tagccagttg tgtaagttta ggggtaggat tacatctgga 2641 cagagggtgc tcaatttgac agaattagat ggttcacctt ttatggcctt tgccgctccc 2701 gcccccgcgg gctttccaga tcttggatct tgtgattggc atattgaaat gagcaaaatt 2761 ccaaactcta gcacccagaa caatccaatc gtagttaatt ctgtcaaacc caacagtcaa 2821 cagtttgtcc cacatttgtc aagtatcacc cttgatgata atgtttctag tgggggcgac 2881 tatattggca ctatacaatg gacctctccc ccttctgatt ccggcggtgc caacacaaac 2941 ttttggaaga ttcctgacta tgggtctagc ctagcggaag cctcacagct ggctcccgct 3001 gtttatccac ctggtttcaa tgaggtgatt gtgtatttta tggcatctat acctggcccc 3061 aatcaatctg gttcccccaa tttggtgcca tgccttctcc cccaggagta tataacacat 3121 ttcatcagtg agcaggcccc cattcagggt gaggctgcct tactccatta tgtagatcca 3181 gacaccaatc gtaatttggg tgagttcaag ttgtatcctg gtggttatct gacttgtgtc 3241 cctaacagtt ctagtacagg acctcaacaa cttcctcttg atggtgtgtt tgtctttgct 3301 tcttgggttt ctagatttta tcaattaaag cctgtgggaa cagccggacc ggctagaggt 3361 aggcttggcg tccgcagata atggcccaag ccattatagg agccattgct gcctccgcgg 3421 ctggcagtgc cctgggtgct ggcatccagg ctggtgctga ggctgcgtta caggcacaga 3481 gatatcaaca agatttgact ttacagcaaa attctttcaa tcatgataaa gagatgttag 3541 gttatcaagt agagatgtct aataagctat tagctaaaaa tcttaatacc cgctattcac 3601 ttctccaggc gggtggtctc tccccctctg atgcggctag ggctgtggcc ggggcccccg 3661 tcactaggtt ggttgactgg ggtggagtcc gtgttgcggc acctcaatca tctgccacca 3721 cactgagatc tggcaatttt atgacagtgc cacttccagc ccagccaaag cagaaacctc 3781 ttgccagcga gggatattcc aatccagctt atgaccccat gcagcgtaca gcttcttggg 3841 ttcaatccca aaattccagc cggaattggt ccccatacca caggcaagcc ctccaaactg 3901 tgt //