![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MN416765 | GI.4 | ||
|---|---|---|---|
| GI.P4 |
ORF1: 1..1763
ORF2: 1747..3381
ORF3: 3381..3903
LOCUS MN416765 3903 bp RNA linear VRL 31-DEC-2019
DEFINITION Norovirus GI isolate G19_044 nonstructural polyprotein (ORF1) gene,
partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene,
partial cds.
ACCESSION MN416765
VERSION MN416765.1
KEYWORDS .
SOURCE Norovirus GI
ORGANISM Norovirus GI
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 3903)
AUTHORS Strubbia,S., Schaeffer,J. and Le Guyader,F.S.
TITLE Metagenomic to detect norovirus and Human enteric viruses in
oysters: impact on hexamer selection and targeted capture-based
enrichment
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 3903)
AUTHORS Strubbia,S., Schaeffer,J. and Le Guyader,F.S.
TITLE Direct Submission
JOURNAL Submitted (06-SEP-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
44311, France
COMMENT ##Assembly-Data-START##
Assembly Method :: SPAdes v. v3.12.0
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..3903
/organism="Norovirus GI"
/mol_type="genomic RNA"
/isolate="G19_044"
/isolation_source="digestive tissue"
/host="shellfish"
/db_xref="taxon:122928"
/geo_loc_name="France"
/collection_date="11-Nov-2018"
/note="genotype: GIP4, GI.4"
gene <1..1763
/gene="ORF1"
CDS <1..1763
/gene="ORF1"
/codon_start=3
/product="nonstructural polyprotein"
/protein_id="QEM24801.1"
/translation="IASMKIQGRLVHGQSGMLLTGANAKGMDLGTLPGDCGAPYVYKR
NNDWVVCGVHAAATKSGNTVVCAVQAGEGETTLEGGDKGHYAGHEIIKYSKGPALSTK
TKFWRSTPEPLPPGVYEPAYLGGRDPRVKGGPSLQQVLRDQLKPFAEPRGRMPEPGLL
EAAVETVTSMLEQTMDTPTPWSFSDACQSLDKTTSSGHPYHKRKNDDWNGTSFVGELG
EQAAHANNMYELGKSMKPLYTAALKDELVKPEKVYQKIKKRLLWGADLSTVIRAARAF
GPFCDAIKPHVIKLPIKVGMNSIEDGPMIYAEHAKYKNHFDADYSAWDSTQNRQIMTE
SFAIMCRLTASPELASVVAKDLLTPSEMDVGDYIIRVKEGLPSGFPCTSQVNSINHWL
ITLCAMSEVTGLSPDVIQSQSYFSFYGDDEIVSTDIDFDPARLTQVLKEYGLRPTRPD
KSEGPIMLRRQVDGLVFLRRTISKDAAGFQGRLDRGSIERQLWWTRGPNHDDPSETLI
PHPQRKVQLISLLGEASLHGEKFYRKISSKVIQEIKTGGLEMYVPGWQAMFRWMRFHD
LGLWTGDRNLLPEFVNDDGV"
mat_peptide <1..236
/gene="ORF1"
/product="Pro"
mat_peptide 237..1760
/gene="ORF1"
/product="RdRp"
gene 1747..3381
/gene="ORF2"
CDS 1747..3381
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QEM24802.1"
/translation="MMMASKDATPSADGATGAGQLVPEVNTADPIPIDPVAGSSTALA
TAGQVNLIDPWIINNFVQAPQGEFTISPNNTPGDVLFDLQLGPHLNPFLSHLSQMYNG
WVGNMRVRVVLAGNAFTAGKVIICCVPPGFQSRTLSIAQATLFPHVIADVRTLDPVEV
PLEDVRNVLYHNNDTQPTMRLLCMLYTPLRTGGASGGTDSFVVAGRVLTCPGPDFNFL
FLVPPTVEQKTRPFTVPNIPLKYLSNSRIPNPIEGMSLSPDQTQNVQFQNGRCTIDGQ
PLGTTPVSVSQLCKFRGRITSGQRVLNLTELDGSPFMAFAAPAPAGFPDLGSCDWHIE
MSKIPNSSTQNNPIVVNSVKPNSQQFVPHLSSITLDDNVSSGGDYIGTIQWTSPPSDS
GGANTNFWKIPDYGSSLAEASQLAPAVYPPGFNEVIVYFMASIPGPNQSGSPNLVPCL
LPQEYITHFISEQAPIQGEAALLHYVDPDTNRNLGEFKLYPGGYLTCVPNSSSTGPQQ
LPLDGVFVFASWVSRFYQLKPVGTAGPARGRLGVRR"
gene 3381..>3903
/gene="ORF3"
CDS 3381..>3903
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QEM24803.1"
/translation="MAQAIIGAIAASAAGSALGAGIQAGAEAALQAQRYQQDLTLQQN
SFNHDKEMLGYQVEMSNKLLAKNLNTRYSLLQAGGLSPSDAARAVAGAPVTRLVDWGG
VRVAAPQSSATTLRSGNFMTVPLPAQPKQKPLASEGYSNPAYDPMQRTASWVQSQNSS
RNWSPYHRQALQTV"
ORIGIN
1 ctatagcgtc catgaagatt cagggaaggc tggtgcatgg tcagtctggg atgctgctta
61 ctggagccaa cgctaagggg atggatttag gcaccctacc aggtgattgt ggtgcccctt
121 atgtgtacaa aaggaataat gactgggtgg tttgtggtgt acatgcagcc gccacaaagt
181 caggcaatac tgtagtgtgt gctgtccagg ctggagaggg tgaaaccacc ctagagggtg
241 gtgacaaagg ccactatgca ggccatgaaa taatcaagta tagtaaggga ccagctctat
301 ctactaagac caagttttgg aggtcaaccc ctgaaccatt gccgccaggt gtttatgaac
361 cagcatacct tgggggtcgt gatccaaggg taaaaggtgg gccctctctg cagcaggtct
421 tgcgtgatca gctgaaacca tttgctgaac cgcgaggtcg aatgccagag cctggtctac
481 ttgaggcagc tgttgaaact gtcacgtcca tgcttgaaca aacaatggat acaccaaccc
541 cctggtcttt ttctgatgca tgtcaatcac ttgacaaaac cacaagttcg ggacacccat
601 atcataaaag gaaaaatgat gattggaatg gaacctcatt tgttggagag cttggagaac
661 aggcagcaca tgccaacaac atgtacgaat taggcaaatc tatgaaacca ctatacactg
721 cagccctaaa agatgagtta gtcaagcccg agaaagttta tcaaaagatc aagaagaggc
781 ttctctgggg tgctgatctc tcaacagtta taagagctgc cagggccttt gggccatttt
841 gtgatgccat aaagcctcat gtcattaaac tccccatcaa agtcggtatg aactccatag
901 aggatggccc aatgatttat gcagagcatg caaaatataa gaaccatttt gatgcagatt
961 attcggcctg ggattctacg cagaatagac agattatgac tgagtccttt gcaatcatgt
1021 gccgcctcac ggcatcaccg gaattggctt ctgttgtggc taaagatcta ctaacgcctt
1081 cagaaatgga tgttggtgat tacatcatcc gtgtaaaaga aggcttgcca tcaggattcc
1141 cttgcacatc tcaggtgaac agtataaacc actggctgat cacactgtgt gccatgtctg
1201 aagtcacggg tttgtcaccc gatgtcatac aatcccaatc ttacttttct ttctatggtg
1261 atgatgagat agtctcaaca gacattgatt ttgaccccgc tcgcctcacc caagtgctta
1321 aggaatatgg tttgagaccc accagaccag acaagtcaga agggcccatc atgctcagaa
1381 gacaggtgga tggccttgta ttcttaagga ggacaatttc caaagatgct gcaggctttc
1441 agggaaggct agataggggc tcaattgaaa ggcagttgtg gtggactcgg ggccctaacc
1501 atgatgatcc cagtgaaact ctgattccac acccccagag gaaagttcag cttatatccc
1561 ttcttggtga ggcctccctc catggtgaga agttttacag gaagatctcc agtaaagtca
1621 tacaggagat aaagaccggt ggcttagaaa tgtatgtgcc aggttggcag gccatgttcc
1681 gctggatgcg cttccatgat ctcggattgt ggacaggaga tcgcaatctc ttgcccgaat
1741 tcgtaaatga tgatggcgtc taaggacgct acaccaagcg cagatggcgc cactggcgcc
1801 ggccagctgg taccggaggt taatacagct gaccctatac ctattgaccc tgtggctggc
1861 tcctctacag cccttgccac tgcgggccaa gttaatttga ttgatccctg gataatcaat
1921 aattttgtgc aagcccctca gggcgagttc acaatatccc caaataatac ccccggtgat
1981 gtgctatttg atttgcagtt agggcctcat ctgaaccctt tcctttccca tctttctcag
2041 atgtacaatg gctgggtggg caacatgcga gtgcgcgttg ttttggctgg caatgctttc
2101 acagctggga aggttataat ttgttgtgtt ccccctggct ttcaatctcg cactctttcc
2161 atagctcagg ctactctatt tccccatgtt attgctgatg ttaggaccct tgatcctgta
2221 gaagtgcccc ttgaagatgt taggaatgta ttgtatcaca ataatgacac tcaacctacc
2281 atgcgcctcc tttgcatgtt gtacacccct cttcggactg ggggggcgtc tggtgggact
2341 gattcttttg tggtagctgg gcgtgtgctt acttgcccag gccctgactt taacttcttg
2401 ttcctagttc cccctacagt tgagcaaaag actcgccctt tcactgtgcc taatatccct
2461 ttgaagtacc tgtctaattc taggatccca aaccctattg agggtatgtc tctgtcacct
2521 gaccagaccc aaaatgttca attccagaat ggtaggtgta caattgacgg tcagcccctc
2581 gggaccacac ctgtctcagt tagccagttg tgtaagttta ggggtaggat tacatctgga
2641 cagagggtgc tcaatttgac agaattagat ggttcacctt ttatggcctt tgccgctccc
2701 gcccccgcgg gctttccaga tcttggatct tgtgattggc atattgaaat gagcaaaatt
2761 ccaaactcta gcacccagaa caatccaatc gtagttaatt ctgtcaaacc caacagtcaa
2821 cagtttgtcc cacatttgtc aagtatcacc cttgatgata atgtttctag tgggggcgac
2881 tatattggca ctatacaatg gacctctccc ccttctgatt ccggcggtgc caacacaaac
2941 ttttggaaga ttcctgacta tgggtctagc ctagcggaag cctcacagct ggctcccgct
3001 gtttatccac ctggtttcaa tgaggtgatt gtgtatttta tggcatctat acctggcccc
3061 aatcaatctg gttcccccaa tttggtgcca tgccttctcc cccaggagta tataacacat
3121 ttcatcagtg agcaggcccc cattcagggt gaggctgcct tactccatta tgtagatcca
3181 gacaccaatc gtaatttggg tgagttcaag ttgtatcctg gtggttatct gacttgtgtc
3241 cctaacagtt ctagtacagg acctcaacaa cttcctcttg atggtgtgtt tgtctttgct
3301 tcttgggttt ctagatttta tcaattaaag cctgtgggaa cagccggacc ggctagaggt
3361 aggcttggcg tccgcagata atggcccaag ccattatagg agccattgct gcctccgcgg
3421 ctggcagtgc cctgggtgct ggcatccagg ctggtgctga ggctgcgtta caggcacaga
3481 gatatcaaca agatttgact ttacagcaaa attctttcaa tcatgataaa gagatgttag
3541 gttatcaagt agagatgtct aataagctat tagctaaaaa tcttaatacc cgctattcac
3601 ttctccaggc gggtggtctc tccccctctg atgcggctag ggctgtggcc ggggcccccg
3661 tcactaggtt ggttgactgg ggtggagtcc gtgttgcggc acctcaatca tctgccacca
3721 cactgagatc tggcaatttt atgacagtgc cacttccagc ccagccaaag cagaaacctc
3781 ttgccagcga gggatattcc aatccagctt atgaccccat gcagcgtaca gcttcttggg
3841 ttcaatccca aaattccagc cggaattggt ccccatacca caggcaagcc ctccaaactg
3901 tgt
//