![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MK907785 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 1..5038
ORF2: 5019..6641
ORF3: 6641..7447
LOCUS MK907785 7497 bp RNA linear VRL 02-NOV-2019
DEFINITION Norovirus GII isolate G19_014 nonstructural polyprotein (ORF1)
gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
cds.
ACCESSION MK907785
VERSION MK907785.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7497)
AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
Guyader,S.
TITLE Optimisation of agnostic metagenomic approaches to characterise
human enteric viruses in sewage
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 7497)
AUTHORS Le Guyader,S. and Strubbia,S.
TITLE Direct Submission
JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
44311, France
COMMENT ##Assembly-Data-START##
Assembly Method :: SPAdes v. 3.12.0
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7497
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="G19_014"
/isolation_source="sewage"
/db_xref="taxon:122929"
/geo_loc_name="France: Nantes"
/collection_date="08-Jan-2014"
/note="genotype: GII.4-GII.Pe"
gene <1..5038
/gene="ORF1"
CDS <1..5038
/gene="ORF1"
/codon_start=2
/product="nonstructural polyprotein"
/protein_id="QCO93054.1"
/translation="KSSSDGVLSSMAVTFKRALGARPKQPPPKEIPPRPPRPPTPDLV
KKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPEETNTAFSVPPLNQRESRDAKE
PLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKVELTPLSLFWRPVYTPQY
LISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFRPYQDWN
RKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDWTFAGIV
ESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGLVLGFTK
EKIGKMLSSAASTLRACKDLGAYGLEILKLIMKWFFPKKEEANELAMVRSIEDAVLDL
EAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINSLLARIAAA
RSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVGLIPRNG
VDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKVFDSDAI
IITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNAFSPDFS
HIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQGPALTTF
NFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKCQIVYNG
GSYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARIRYYVKCVQEALYSIIQI
AGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGCLKPKDDEEFVVSSDDIKTEGK
KGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYYEEVAIA
RATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKPKGKLWA
DDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAKEFFGVP
IKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRPTGELMPLAARM
GTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVVIGVHTA
AARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSTKTKFWRSSTTPL
PPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKKTIINVL
EQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASKANLMFE
EGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDLATMIRCARAFGGLMDELKAHC
VTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIMVKFSPE
PHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCALSEVTD
LSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGPLVISED
LDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQRPIQLMS
LLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLA
PSFVNEDGVE"
mat_peptide <1..928
/gene="ORF1"
/product="p48"
mat_peptide 929..2026
/gene="ORF1"
/product="NTPase"
mat_peptide 2027..2563
/gene="ORF1"
/product="p22"
mat_peptide 2564..2962
/gene="ORF1"
/product="VPg"
mat_peptide 2963..3505
/gene="ORF1"
/product="Pro"
mat_peptide 3506..5035
/gene="ORF1"
/product="RdRp"
gene 5019..6641
/gene="ORF2"
CDS 5019..6641
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QCO93055.1"
/translation="MKMASSDVNPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVL"
gene 6641..7447
/gene="ORF3"
CDS 6641..7447
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QCO93056.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPTRGSSSKSS
NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN
1 aaaatcttca agtgacggtg tgctttctag catggctgtc acttttaagc gggccctcgg
61 ggcgcggcct aaacagccgc ccccgaagga gataccaccc agacccccgc gaccacccac
121 gccagacttg gttaaaaaga tccctcctcc cccacccaac ggggaggatg aactagtggt
181 ctcttacagc gccaaagatg gcgtttccgg actgcctgag ctcaccactg tcagacaacc
241 ggaagaaacc aacacggcgt tcagtgtccc cccactcaac caaagggaga gcagggacgc
301 caaggagcca ctaactggaa caattattga aatgtgggat ggagaaatct accattacgg
361 cctgtacgtg gaacgaggtc ttatacttgg tgtgcacaag ccaccggcag ccatcagcct
421 tgccaaggtc gagctaacac cgctctcttt gttctggaga cctgtataca ccccccagta
481 tctcatctct ccagacactc ttaggagatt acatggagag tcattcccct acactgcatt
541 tgacaacaat tgctacgcct tttgttgttg ggtattagac ctaaacgact catggctaag
601 caggagaatg attcagagaa caacaggctt cttcaggccg taccaggatt ggaacaggaa
661 acccctcccc actatggatg attccaaatt aaagaaggta gccaacatat tcttgtgcac
721 tttgtcttca ctattcacca gacccattaa ggacataata gggaagttga aacctcttaa
781 catactcaac attctggcca catgtgattg gaccttcgca ggcatagtgg aatccttaat
841 actcttggca gaactctttg gagttttctg gacaccccca gatgtgtctg cgatgatcgc
901 ccccttgcta ggtgattatg aactgcaagg acctgaggac cttgcagtgg aactggtccc
961 aatagtgatg ggggggatag gtttggtgct aggatttacc aaagagaaaa tcggaaagat
1021 gctatcatcc gctgcatcca ctttaagagc ttgtaaagac cttggtgcat acggactgga
1081 aatcttaaaa ttgatcatga agtggttctt cccaaagaag gaggaagcaa atgaactggc
1141 tatggtgaga tccatcgagg atgcagtact agacctcgag gcaattgaaa acaaccacat
1201 gaccaccctg ctcaaagaca aagacagctt ggcaacctac atgagaaccc ttgaccttga
1261 ggaggagaaa gccagaaaac tctcaaccaa atctgcttca cccgatattg tgggcacaat
1321 caactctctt ctggcaagaa tcgctgctgc acgctcccta gtgcatcggg cgaaagaaga
1381 gctctccagc aggccgagac ctgtcgttgt gatgatatcg ggaagaccag ggatagggaa
1441 aactcacctt gccagggagc tggccaagaa gatcgcggcc tccctcacag gggaccagcg
1501 tgtgggtctt atcccacgca atggtgtcga ccactgggac gcatacaagg gcgaaagagt
1561 tgtcctatgg gacgactatg gaatgagcaa ccccatccat gatgccctca ggctgcagga
1621 gcttgctgac acttgccccc tcacgctaaa ttgtgacaga attgagaaca aagggaaagt
1681 ctttgacagt gatgccataa ttatcaccac caacctggcc aacccagcac cactggatta
1741 tgtcaacttt gaagcgtgct cgagacgcat tgatttcctc gtgtacgcag aagcccctga
1801 ggtggagaag gcaaagcgcg acttcccagg tcaacctgac atgtggaaga acgctttcag
1861 tcctgacttc tcacacataa aactgtcatt ggctccacag ggtggttttg acaagaacgg
1921 caacaccccg catggaaaag gggtcatgaa gaccctcacc actggctccc tcatcgcccg
1981 agcatcaggg ttactccatg agaggctaga tgaatatgaa ctgcaaggcc cagccctcac
2041 cactttcaac tttgaccgca acaagatact tgcttttaga cagcttgctg ctgaaaacaa
2101 gtatgggctg atggacacaa tgagagttgg aaaacagctc aaggatgtca agaccatgtc
2161 agacctcaaa caagcactca agaacatcgc gatcaagaag tgccagatag tgtacaatgg
2221 tggctcctac acacttgagg ctgatggcaa gggtagtgtg aaagttgaca aagtgcaaag
2281 tgccactgtg cagaccaaca atgaactagc cggtgcccta caccacctaa ggtgcgctag
2341 aatcagatac tatgttaagt gcgtccagga ggcgctgtat tccatcatcc aaatcgctgg
2401 ggctgcgttc gtcaccacgc gcatcgctaa gcgcatgaat atacagaatc tctggtccaa
2461 gccacaggtg gaagacacag aagagatggc caacaaagat ggttgcctaa aacccaaaga
2521 tgatgaagag tttgtcgtct catccgacga catcaaaact gagggcaaga aagggaaaaa
2581 caagtccggc cgtggcaaga agcacacagc cttttcaagt aaagggctca gtgatgagga
2641 gtacgatgag tacaagagaa tcagagaaga aaggaatggt aagtactcca tagaagagta
2701 ccttcaggac agagacaggt actacgagga ggtggccatt gccagggcaa ccgaagagga
2761 cttctgtgaa gaagaagagg ccaaaatccg gcagagaatt ttcagaccaa caagaaaaca
2821 acgcaaagaa gagagggcct ctctcggctt ggtcacaggc tctgaaatca ggaagagaaa
2881 cccagaagac ttcaaaccca agggaaagct gtgggctgat gacgacagaa gtgttgacta
2941 taatgagaaa ctcaactttg aggccccacc aagcatctgg tcgcgaatag tcaactttgg
3001 ttcaggctgg ggcttctggg tctcccccag tctgtttata acatcaaccc atgtcatacc
3061 ccaaggtgca aaagagttct tcggagtccc tatcaagcaa atccagatac acaagtcagg
3121 tgaattctgc cggttgagat tcccaaagcc aatcagaact gatgtgacgg gcatgattct
3181 agaagaaggt gcgcccgagg ggaccgtggc cacactgctc atcaagagac caactggaga
3241 gctcatgcct ctggcagcca gaatggggac ccatgcaacc atgaaaattc aggggcgcac
3301 agttggaggg caaatgggta tgctcctgac aggatccaac gccaagagta tggacctagg
3361 cacaacgcca ggcgactgcg gctgccccta catctacaag agggggaatg actacgtggt
3421 cataggagtc catacggccg ctgcccgtgg aggaaacact gtcatatgtg ccacccaggg
3481 gagtgaggga gaagccacac ttgaaggagg tgacagtaaa gggacatact gtggcgcacc
3541 aatcttgggc ccagggagcg ctccgaagct cagtaccaaa actaagtttt ggagatcatc
3601 cacaacacca ctcccacctg gcacctacga accagcctac ctcggtggca aagaccctag
3661 agtcaaaggt ggcccttcat tgcaacaagt tatgagggac cagctgaagc cattcacaga
3721 acccagaggc aaaccaccaa gaccaaatgt gttggaagct gccaagaaaa ccatcatcaa
3781 tgtccttgag caaacaattg atccacccca aaaatggtca tttgcgcaag cttgcgcatc
3841 ccttgacaaa accacctcca gcggccaccc gcaccacatg cggaaaaacg actgttggaa
3901 tggggagtcc ttcacaggaa aattggctga tcaagcctcc aaggccaacc taatgtttga
3961 agagggaaag aacatgactc cagtctacac aggtgcactt aaagatgagt tggtgaagac
4021 cgataaagtt tatggtaagg tcaagaagag gcttctgtgg ggttcagatc tggcgaccat
4081 gatacggtgc gcccgagctt ttggaggcct tatggatgaa ctcaaggcgc actgtgtcac
4141 acttcctgtc agagttggta tgaacatgaa tgaggatggc cccatcatct ttgagaagca
4201 ctccagatat agatatcact atgatgctga ttattcccgg tgggactcaa cacaacaaag
4261 ggatgtgcta gcagcagcac tagaaatcat ggttaagttc tctccagaac cacacctggc
4321 ccagatagtt gcagaagacc tcctttcccc tagcgtgatg gatgtaggtg actttcaaat
4381 atcaataagt gagggtctcc cctctggggt accttgtacc tcccagtgga attccatcgc
4441 ccactggctc ctcaccctgt gtgcactctc tgaagtcacg gacctgtccc ccgatatcat
4501 tcaggccaac tcccttttct ccttctatgg tgatgatgag attgtaagca cagacataaa
4561 attggaccca gagaagctga cagcaaagct caaggagtac gggctgaaac caacccgccc
4621 cgacaaaact gaaggacccc ttgttatctc tgaagacctg gatggcctga cattcctccg
4681 gagaactgtg acccgtgatc cagctggctg gtttggaaaa ttggaacaaa gttcaattct
4741 caggcaaatg tactggacca ggggtcccaa ccatgaagac ccatttgaaa caatgatacc
4801 acactcccaa agacccatac aattgatgtc cttgctgggc gaggctgcgc tccacggccc
4861 ggcattctat agcaaaatta gcaaattagt cattgcagag ttgaaggaag gtggcatgga
4921 tttttacgta cccagacaag agccaatgtt cagatggatg aggttctcag atctgagcac
4981 gtgggagggc gatcgcaatc tggctcccag ttttgtgaat gaagatggcg tcgagtgacg
5041 tcaacccatc tgatgggtcc gcagccaacc tcgtcccaga ggtcaacaat gaggttatgg
5101 ctctggagcc cgttgttggt gccgccattg cggcacctgt agcgggccaa caaaatgtaa
5161 ttgacccctg gattagaaat aattttgtac aagcccctgg tggagagttt acagtatccc
5221 ctagaaacgc tccaggtgaa atactatgga gcgcgccctt gggccctgat ctaaacccct
5281 acctatccca tttggccaga atgtacaatg gttatgcagg tggttttgaa gtgcaggtaa
5341 ttctcgcggg gaacgcgttc accgccggga aggtcatatt tgcagcagtc ccaccaaatt
5401 ttccaactga aggcttgagc cccagccagg tcactatgtt cccccatata gtagtagatg
5461 ttaggcaact agaacctgtg ttgattccct tacccgatgt taggaataat ttctatcatt
5521 acaatcaatc aaatgacccc accattaagt tgatagcaat gttgtacaca ccacttaggg
5581 ctaataatgc tggggatgat gtcttcacag tttcttgccg agttctcacg agaccatccc
5641 ccgattttga tttcatattt ctagtgccac ccacagttga gtcaagaact aaaccattct
5701 ctgtcccagt tttaactgtt gaggagatga ccaattcaag attccccatt cctttggaaa
5761 agttgttcac gggtcccagc agtgcctttg ttgtccaacc acaaaacggt aggtgcacga
5821 ctgatggcgt gctcctaggc accacccaac tgtctcctgt caacatctgc accttcagag
5881 gagatgtcac ccatatcaca ggtagtcgta actacacaat gaatttggct tctcaaaatt
5941 ggaacaacta tgacccaaca gaagaaatcc cagcccctct aggaactcca gactttgtgg
6001 ggaagattca aggcgtgctc acccaaacca caaggacaga tggctcaaca cgcggccaca
6061 aagccacagt gtacactggg agcgccgact ttgctccaaa actgggtaga gttcaatttg
6121 aaactgacac agaccatgat tttgaagcta accaaaacac aaagttcacc ccagttggtg
6181 tcatccaaga tggtagcacc acccaccgaa atgaacccca acagtgggtg ctcccaagtt
6241 actcaggcag aaatactcct aatgtgcatc tggcccccgc tgtggccccc acttttccgg
6301 gcgagcaact tctcttcttc agatccacca tgcccggatg cagcgggtac cccaacatgg
6361 atttggactg tctgctcccc caggaatggg tgcagtactt ctaccaagag gcagccccag
6421 cacaatctga tgtggctctg ctaagatttg tgaatccaga cacaggtagg gttttgtttg
6481 agtgtaagct tcataaatca ggttatgtta cagtggctca cactggccaa catgatttgg
6541 ttatcccccc caatggttat tttaggtttg attcctgggt caaccagttt tacacgcttg
6601 cccccatggg aaatggaacg gggcgtagac gtgtactata atggctggag ctttctttgc
6661 tggattggca tctgatgtcc ttggctctgg acttggatcc cttatcaatg ctggggctgg
6721 ggccatcaac caaaaagttg agtttgaaaa taacagaaaa ttgcaacaag catccttcca
6781 atttagcagc aatctacaac aggcttcctt tcaacatgac aaagagatgc tccaagcaca
6841 aattgaggcc accaaaaggc tacaacagga aatgatgaaa gttaagcagg caatgctcct
6901 agagggtggg ttctctgaga cagatgcagc ccgcggggca atcaacgccc ccatgacaaa
6961 agctttggac tggagcggga caaggtactg ggctcccgat gctaggacta caacatacaa
7021 tgcaggccgc ttttccaccc ctcaaccatc gggggcactg ccaggaagag ctaatcttag
7081 ggatgctgtc cctactcggg gttcctccag taagtcttct aattcttcta ctgctacttc
7141 tgtgtactca aatcaaacca cttcaacgag acttggttct acagctggtt ctggtaccag
7201 tgtctcgagc ttcccgtcaa ctgcaaggac taggagctgg gttgaggatc aaagtaggaa
7261 tttgtcacct ttcatgaggg gggcccacaa catatcgttt gtcaccccac catctagcag
7321 atcctctagc caaggcacag tctcaaccgt gcctaaagag attttggact cctggactgg
7381 cgctttcaac acgcgcaggc agccactctt cgctcacatt cgtaagcgag gggagtcacg
7441 ggcgtaatga gaaaagacaa aattgattat ctttcttttc tttagtgtct tttaaaa
//