![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MK907788 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 1..5038
ORF2: 5019..6641
ORF3: 6641..6826
LOCUS MK907788 6826 bp RNA linear VRL 02-NOV-2019
DEFINITION Norovirus GII isolate G19_017 nonstructural polyprotein (ORF1)
gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3)
gene, partial cds.
ACCESSION MK907788
VERSION MK907788.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 6826)
AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
Guyader,S.
TITLE Optimisation of agnostic metagenomic approaches to characterise
human enteric viruses in sewage
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 6826)
AUTHORS Le Guyader,S. and Strubbia,S.
TITLE Direct Submission
JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
44311, France
COMMENT ##Assembly-Data-START##
Assembly Method :: SPAdes v. 3.12.0
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..6826
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="G19_017"
/isolation_source="sewage"
/db_xref="taxon:122929"
/geo_loc_name="France: Nantes"
/collection_date="08-Jan-2014"
/note="genotype: GII.4-GII.Pe"
gene <1..5038
/gene="ORF1"
CDS <1..5038
/gene="ORF1"
/codon_start=2
/product="nonstructural polyprotein"
/protein_id="QCO93064.1"
/translation="KSSSDGVLSSMAVTFKRALGARPKQPPPKEIPPRPPRPPTPDLV
KKIPPPPPNGEDELVVSYRAKDGVSGLPELTTVRQPEETNTAFSVPPLNQRESRDAKE
PLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKVELAPLSLFWRPVYTPQY
LISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFRPYQDWN
RKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDWTFAGIV
ESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGLVLGFTK
EKIGKMLSSAASTLRACKDLGAYGLEILKLIMKWFFPKKEEANELAMVRSIEDAVLDL
EAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINSLLARIAAA
RSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVGLIPRNG
VDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKVFDSDAI
IITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNAFSPDFS
HIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQGPALTTF
NFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKCQIVYNG
GSYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARIRYYVKCVQEALYSIIQI
AGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGCLKPKDDEEFVVSSDDIKTEGK
KGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYYEEVAIA
RATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKPKGKLWA
DDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAKEFFGVP
IKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRPTGELMPLAARM
GTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVVIGVHTA
AARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSAKTKFWRSSTTPL
PPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKKTIINVL
EQTIDPPQRWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASKANLMFE
EGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDLATMIRCARAFGGLMDELKAHC
VTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIMVKFSPE
PHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCALSEVTD
LSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGPLVVSED
LDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQRPIQLMS
LLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLA
PSFVNEDGVE"
mat_peptide <1..928
/gene="ORF1"
/product="p48"
mat_peptide 929..2026
/gene="ORF1"
/product="NTPase"
mat_peptide 2027..2563
/gene="ORF1"
/product="p22"
mat_peptide 2564..2962
/gene="ORF1"
/product="VPg"
mat_peptide 2963..3505
/gene="ORF1"
/product="Pro"
mat_peptide 3506..5035
/gene="ORF1"
/product="RdRp"
gene 5019..6641
/gene="ORF2"
CDS 5019..6641
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QCO93065.1"
/translation="MKMASSDVNPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSRNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6641..>6826
/gene="ORF3"
CDS 6641..>6826
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QCO93063.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKE"
ORIGIN
1 aaaatcttca agtgacggtg tgctttctag catggctgtc acttttaagc gggccctcgg
61 ggcgcggcct aaacagccgc ccccgaagga gataccaccc agacccccgc gaccacccac
121 gccagacttg gttaaaaaga tccctcctcc cccacccaac ggggaggatg aactagtggt
181 ctcttaccgc gccaaagatg gcgtttccgg actgcctgag ctcaccactg tcagacaacc
241 ggaagaaacc aacacggcgt tcagtgtccc cccactcaac caaagggaga gcagggacgc
301 caaggagcca ctaactggga caatcattga aatgtgggat ggagaaatct accattacgg
361 cctgtacgtg gaacgaggtc tcatacttgg tgtgcacaag ccaccggcag ccattagcct
421 tgccaaggtc gagctagcac cgctctcttt gttctggaga cctgtataca ccccccagta
481 tctcatctct ccagacactc ttaggagatt acatggagag tcattcccct acactgcatt
541 tgacaacaat tgctacgcct tttgttgttg ggtattagac ctaaacgact catggctaag
601 caggagaatg attcagagaa caacaggctt cttcaggccg taccaggatt ggaacaggaa
661 acccctcccc actatggatg attccaaatt aaagaaggta gccaacatat tcttgtgcac
721 tttgtcttca ctattcacca gacccattaa ggacataata gggaagttga aacctcttaa
781 catactcaac attctggcca catgtgattg gaccttcgca ggcatagtgg aatccttaat
841 actcttggca gaactctttg gagttttctg gacaccccca gatgtgtctg cgatgatcgc
901 ccccttgcta ggtgattatg aactgcaagg acctgaggac cttgcagtgg aactggtccc
961 aatagtgatg ggggggatag gtttggtgct aggatttacc aaagagaaaa tcggaaagat
1021 gctatcatcc gctgcatcca ctttaagagc ttgtaaagac cttggtgcat acggactgga
1081 aatcttaaaa ttgatcatga agtggttctt cccaaagaaa gaggaagcaa atgaactggc
1141 tatggtgaga tccatcgagg atgcagtact agacctcgag gcaattgaaa acaaccacat
1201 gaccacccta ctcaaagaca aagacagctt ggcaacctac atgagaaccc ttgaccttga
1261 ggaggagaaa gccagaaaac tctcaaccaa atctgcttca cccgatattg tgggcacaat
1321 caactctctt ctggcaagaa tcgctgctgc acgctcccta gtgcatcggg cgaaagaaga
1381 gctctccagc aggccgagac ctgtcgttgt gatgatatcg ggaagaccag ggatagggaa
1441 aactcacctt gccagggagc tggccaagaa gatcgcggcc tccctcacag gggaccagcg
1501 tgtgggtctt atcccacgca atggtgtcga ccactgggac gcatacaagg gcgaaagagt
1561 tgtcctatgg gacgactatg gaatgagcaa ccccatccat gatgccctca ggctgcagga
1621 gcttgctgac acttgccccc tcacgctaaa ttgtgacaga attgagaaca aagggaaagt
1681 ctttgacagt gatgccataa ttatcaccac caacctggcc aacccagcac cactggatta
1741 tgtcaacttt gaagcgtgct cgagacgcat tgatttcctc gtgtacgcag aagcccctga
1801 ggtggagaag gcaaagcgcg acttcccagg tcaacctgac atgtggaaga acgctttcag
1861 tcctgacttc tcacacataa aactgtcatt ggctccacag ggtggttttg acaagaacgg
1921 caacaccccg catggaaaag gggtcatgaa gaccctcacc actggctccc tcatcgcccg
1981 agcatcaggg ttactccatg agaggctaga tgaatatgaa ctgcaaggcc cagccctcac
2041 cactttcaac tttgaccgca acaagatact tgcttttaga cagcttgctg ctgaaaacaa
2101 gtatgggctg atggacacaa tgagagttgg aaaacagctc aaggatgtca agaccatgtc
2161 agacctcaaa caagcactca agaacatcgc gatcaagaag tgccagatag tgtacaatgg
2221 tggctcctac acacttgagg ctgatggcaa gggtagtgtg aaagttgaca aagtgcaaag
2281 tgccactgtg cagaccaaca atgaattagc cggtgcccta caccacctaa ggtgcgctag
2341 aatcagatac tatgttaagt gcgtccagga ggcgctgtat tccatcatcc aaatcgctgg
2401 ggctgcattc gtcaccacgc gcatcgctaa gcgcatgaat atacagaatc tctggtccaa
2461 gccacaggtg gaagacacag aagagatggc caacaaagat ggttgcctaa aacccaaaga
2521 tgatgaagag tttgtcgtct catccgacga catcaaaact gagggcaaga aagggaaaaa
2581 caagtccggc cgtggcaaga agcacacagc cttttcaagt aaagggctca gtgatgagga
2641 gtacgatgag tacaagagaa tcagagaaga aaggaatggt aagtactcca tagaagagta
2701 ccttcaggac agagacaggt actacgagga ggtggccatt gccagggcaa ccgaagagga
2761 cttctgtgaa gaagaagagg ccaaaatccg gcagagaatt ttcagaccaa caagaaaaca
2821 acgcaaagaa gagagggcct ctctcggctt ggtcacaggc tctgaaatca ggaagagaaa
2881 cccagaagac ttcaaaccca agggaaagct gtgggctgat gacgacagaa gtgttgacta
2941 taatgagaaa ctcaactttg aggccccacc aagcatctgg tcgcgaatag tcaactttgg
3001 ttcaggctgg ggcttctggg tctcccccag tctgtttata acatcaaccc atgtcatacc
3061 ccaaggtgca aaagagttct tcggagtccc tatcaagcaa atccagatac acaagtcagg
3121 tgaattctgc cggttgagat tcccaaagcc aatcagaact gatgtgacgg gcatgattct
3181 agaagaaggt gcgcccgagg ggaccgtggc cacactgctc atcaagagac caactggaga
3241 gctcatgcct ctggcagcca gaatggggac ccatgcaacc atgaaaattc aggggcgcac
3301 agttggaggg caaatgggta tgctcctgac aggatccaac gccaagagta tggacctagg
3361 cacaacgcca ggcgactgcg gctgccccta catctacaag agggggaatg actacgtggt
3421 cataggagtc catacggccg ctgcccgtgg aggaaacact gtcatatgtg ccacccaggg
3481 gagtgaggga gaagccacac ttgaaggagg tgatagtaaa gggacatact gtggcgcacc
3541 gatcttgggc ccagggagcg ctccgaagct cagtgccaag actaagtttt ggagatcatc
3601 cacaacacca ctcccacctg gcacctacga accagcctac ctcggtggca aagaccctag
3661 agtcaaaggt ggcccttcat tgcaacaagt tatgagggac cagctgaagc cattcacaga
3721 acccagaggc aaaccaccaa gaccaaatgt gttggaagct gccaagaaaa ccatcatcaa
3781 tgtccttgag caaacaattg atccacccca aagatggtca tttgcgcaag cttgcgcatc
3841 ccttgacaaa accacctcca gcggccaccc gcaccacatg cggaaaaacg actgttggaa
3901 tggggagtcc ttcacaggaa aattggctga tcaagcctcc aaggccaacc taatgtttga
3961 agagggaaag aacatgactc cagtctacac aggtgcactt aaagatgagt tggtgaagac
4021 cgataaagtt tatggtaagg tcaagaagag gcttctgtgg ggttcagatc tggcgaccat
4081 gatacggtgc gcccgagctt ttggaggcct tatggatgaa ctcaaggcgc actgtgtcac
4141 acttcctgtc agagttggta tgaacatgaa tgaggatggc cccatcatct ttgagaagca
4201 ctccagatat agatatcact atgatgctga ttattcccgg tgggactcaa cacaacaaag
4261 ggatgtgcta gcagcagcac tagaaatcat ggttaagttc tctccagaac cacacctggc
4321 ccagatagtt gcagaagacc tcctttcccc tagcgtgatg gatgtaggtg actttcaaat
4381 atcaataagt gagggtctcc cctctggggt accttgtacc tcccagtgga attccatcgc
4441 ccactggctc ctcaccctgt gtgcactctc tgaagtcacg gacctgtccc ccgatatcat
4501 tcaggccaac tcccttttct ccttctatgg tgatgatgag attgtaagca cagacataaa
4561 attggaccca gagaagctga cagcaaagct caaggagtac gggctgaaac caacccgccc
4621 cgacaaaact gaaggacccc ttgttgtctc tgaagacctg gatggcctga cattcctccg
4681 gagaactgtg acccgtgatc cagctggctg gtttggaaaa ttggaacaaa gttcaattct
4741 caggcaaatg tactggacca ggggtcccaa ccatgaagat ccatttgaaa caatgatacc
4801 acactcccaa agacccatac aattgatgtc cttgctgggc gaggctgcgc tccacggccc
4861 ggcattctat agcaaaatta gcaaattagt cattgcagag ttgaaggaag gtggcatgga
4921 tttttacgta cccagacaag agccaatgtt cagatggatg aggttctcag atctgagcac
4981 gtgggagggc gatcgcaatc tggctcccag ttttgtgaat gaagatggcg tcgagtgacg
5041 tcaacccatc tgatgggtcc gcagccaacc tcgtcccaga ggtcaacaat gaggttatgg
5101 ctctggagcc cgttgttggt gccgccattg cggcacctgt agcgggccaa caaaatgtaa
5161 ttgacccctg gattagaaat aattttgtac aagcccctgg tggagagttt acagtatccc
5221 ctagaaacgc tccaggtgaa atactatgga gcgcgccctt gggccctgat ctaaacccct
5281 acctatccca tttggccaga atgtacaatg gttatgcagg tggttttgaa gtgcaggtaa
5341 ttctcgcggg gaacgcgttc accgccggga aggtcatatt tgcagcagtc ccaccaaatt
5401 ttccaactga aggcttgagc cccagccagg tcactatgtt cccccatata gtagtagatg
5461 ttaggcaact agaacctgtg ttgattccct tacccgatgt taggaataat ttctatcatt
5521 acaatcaatc aaatgacccc accattaagt tgatagcaat gttgtacaca ccacttaggg
5581 ctaataatgc tggggatgat gtcttcacag tttcttgccg agttctcacg agaccatccc
5641 ccgattttga tttcatattt ctagtgccac ccacagttga gtcaagaact aaaccattct
5701 ctgtcccagt tttaactgtt gaggagatga ccaattcaag attccccatt cctttggaaa
5761 agttgttcac gggtcccagc agtgcctttg ttgtccaacc acaaaacggt aggtgcacga
5821 ctgatggcgt gctcctaggc accacccaac tgtctcctgt caacatctgc accttcagag
5881 gagatgtcac ccatatcaca ggtagtcgta actacacaat gaatttggct tctcaaaatt
5941 ggagcaatta tgacccaaca gaagaaatcc cagcccccct aggaactcca gattttgtgg
6001 ggaagattca aggcgtgctc acccaaacca caaggacaga tggctcaaca cgcggccaca
6061 aagccacagt gtacactggg agcgccgact ttgctccaaa actgggtaga gttcaatttg
6121 aaactgacac agaccatgat tttgaagcta accaaaacac aaaattcacc ccagttggtg
6181 tcatccaaga tggtagcacc acccaccgaa atgaacccca acagtgggtg ctcccaagtt
6241 actcaggcag aaatactcct aatgtgcatc tggcccccgc tgtagccccc acttttccgg
6301 gtgagcaact tctcttcttc agatccacca tgcccggatg cagcgggtac cccaacatgg
6361 atttggactg tctgctcccc caggaatggg tgcagtactt ctaccaagag gcagccccag
6421 cacaatctga tgtggctctg ctaagatttg tgaatccaga cacaggtagg gttttgtttg
6481 agtgtaagct tcataaatca ggctatgtta cagtggctca cactggccaa catgatttgg
6541 ttatcccccc caatggttat tttaggtttg attcctgggt caaccagttt tacacgcttg
6601 cccccatggg aaatggaacg gggcgtagac gtgcactata atggctggag ctttctttgc
6661 tggattggca tctgatgtcc ttggctctgg acttggttcc cttatcaatg ccggggctgg
6721 ggccatcaac caaaaagttg agtttgaaaa taacagaaaa ttgcaacaag catccttcca
6781 atttagcagc aatctacaac aggcttcctt tcaacatgac aaagaa
//