![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MK907797 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 1..4677
ORF2: 4658..6280
ORF3: 6280..6976
LOCUS MK907797 6976 bp RNA linear VRL 02-NOV-2019
DEFINITION Norovirus GII isolate G19_033 nonstructural polyprotein (ORF1)
gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3)
gene, partial cds.
ACCESSION MK907797
VERSION MK907797.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 6976)
AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
Guyader,S.
TITLE Optimisation of agnostic metagenomic approaches to characterise
human enteric viruses in sewage
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 6976)
AUTHORS Le Guyader,S. and Strubbia,S.
TITLE Direct Submission
JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
44311, France
COMMENT ##Assembly-Data-START##
Assembly Method :: SPAdes v. 3.12.0
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..6976
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="G19_033"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="France: Nantes"
/collection_date="13-May-2014"
/note="genotype: GII.4-GII.Pe"
gene <1..4677
/gene="ORF1"
CDS <1..4677
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="QCO93088.1"
/translation="LHVERGLILGVHKPPAAISLAKVELAPLSLFWRPVYTPQYLISP
DTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQRTTGFFRPYQDWNRKPL
PTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNILATCDWTFAGIVESLI
LLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPIVMGGIGLVLGFTKEKIG
KMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANELAMVRSIEDAVLDLEAIE
NNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGTINSLLARIAAARSLV
HRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKIAASLTGDQRVGLIPRNGVDHW
DAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIENKGKVFDSDAIIITT
NLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRDFPGQPDMWKNAFSPDFSHIKL
SLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLLHERLDEYELQGPALTTFNFDR
NKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLKQALKNIAIKKCQIVYNGGTYT
LEADGKGSVKVDRVQSATVQTNNELAGALHHLRCARIRYYVKCVQEALYSIIQIAGAA
FVTTRIAKRMNIQNLWSKPQVEDTEETANKDGCLKPKDDEEFVVSSDDIKTEGKKGKN
KSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQDRDRYYEEVAIARATE
EDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSEIRKRNPEDFKPKGKLWADDDR
SVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVIPQGAKEFFGVPIKQI
QIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTLLIKRPTGELMPLAARMGTHA
TMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRGNDYVVIGVHTAAARG
GNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSAPKLSTKTKFWRSSTTPLPPGT
YEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPNVLEAAKKTIINVLEQTI
DPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKLADQASKANLMFEEGKN
MTPVYTGALKDELVKTEKVYGKVKKRLLWGSDLATMIRCARAFGGLMDELKAHCVTLP
VRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAAALEIMVKFSPEPHLA
QTVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWLLTLCALSEVTDLSPD
IIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLKPTRPDKTEGPLVISEDLDGL
TFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHEDPFETMIPHSQRPIQLMSLLGE
AALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLAPSFV
NEDGVE"
mat_peptide <1..567
/gene="ORF1"
/product="p48"
mat_peptide 568..1665
/gene="ORF1"
/product="NTPase"
mat_peptide 1666..2202
/gene="ORF1"
/product="p22"
mat_peptide 2203..2601
/gene="ORF1"
/product="VPg"
mat_peptide 2602..3144
/gene="ORF1"
/product="Pro"
mat_peptide 3145..4674
/gene="ORF1"
/product="RdRp"
gene 4658..6280
/gene="ORF2"
CDS 4658..6280
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QCO93089.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPCQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEVNQNTKFTPVGVIQDG
GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6280..>6976
/gene="ORF3"
CDS 6280..>6976
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QCO93090.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEYENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKPS
NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
SFVTPPSSRSSSQG"
ORIGIN
1 ctgcacgtgg aacgaggtct tatacttggt gtgcacaaac caccggcagc cattagcctt
61 gccaaggtcg agctagcacc gctctctttg ttctggagac ccgtatacac cccacagtat
121 ctcatctctc cagacactct taggagatta catggagagt cgttccccta cactgcattc
181 gacaacaatt gctacgcctt ttgttgttgg gtattagacc taaacgactc atggctgagc
241 aggagaatga ttcaaagaac aacaggcttc ttcaggccgt accaggattg gaacaggaaa
301 cccctcccca ctatggatga ttccaaatta aagaaggtag ccaacatatt cttgtgcact
361 ttgtcttcac tattcaccag acccattaag gacataatag ggaagttgaa acctcttaac
421 atccttaaca ttctggctac atgtgattgg accttcgcag gcatagtgga atccttaata
481 ctcttggcag aactctttgg agtcttctgg acacccccag atgtgtctgc gatgatcgcc
541 cccttgctag gtgattatga actgcaagga cctgaggacc ttgcagtgga attggtccca
601 atagtgatgg gggggatagg tttggtgcta ggatttacca aagagaaaat cggaaagatg
661 ctatcatccg ccgcatccac tttaagagct tgtaaagacc ttggtgcata cggactggaa
721 atcttaaaat tggtcatgaa gtggttcttc ccaaagaaag aggaagcaaa tgaactggct
781 atggtgagat ccatcgagga tgcagtacta gacctcgagg caattgaaaa caaccacatg
841 accaccctac tcaaagacaa agacagcttg gcaacctaca tgagaaccct tgaccttgag
901 gaggagaaag ccagaaaact ctcaaccaaa tctgcttcac ccgatattgt gggcacaatc
961 aactctcttc tggcaagaat cgctgctgca cgttccctag tgcatcgggc gaaagaagag
1021 ctctccagca ggccgagacc tgtcgttgtg atgatatcgg gaagaccagg gatagggaaa
1081 actcaccttg ccagggagct ggccaagaag atcgcggcct ctctcacagg ggaccagcgt
1141 gtgggtctta tcccacgcaa tggtgtcgac cactgggacg catacaaggg cgaaagagtt
1201 gtcctatggg acgactatgg aatgagcaac cccatccatg atgccctcag gttgcaggag
1261 cttgctgaca cttgccccct cacgctaaat tgtgacagaa ttgagaacaa ggggaaagtc
1321 tttgacagtg atgccataat tatcaccacc aatctggcca acccagcacc actggattat
1381 gtcaactttg aagcgtgctc gagacgcatt gatttcctcg tgtacgcaga agcccctgag
1441 gtggagaagg caaagcgcga cttcccaggt caacctgaca tgtggaagaa cgctttcagt
1501 cctgacttct cacacataaa actgtcattg gctccacagg gtggtttcga caagaacggc
1561 aacaccccgc atggaaaagg ggtcatgaag accctcacca ctggctccct catcgcccga
1621 gcatcagggt tactccatga gaggctagat gaatatgaac tgcaaggccc agccctcacc
1681 actttcaact ttgaccgcaa caagatactt gcttttagac agcttgctgc tgaaaacaag
1741 tatgggttga tggacacaat gagagttgga aaacagctca aggatgtcaa gaccatgtca
1801 gacctcaaac aagcactcaa gaacatcgcg atcaagaagt gccagatagt gtacaatggt
1861 ggcacctaca cacttgaagc tgatggcaag ggtagtgtga aagttgacag agtgcaaagt
1921 gccactgtgc agaccaacaa tgaactagcc ggtgccctac accacctaag gtgcgctaga
1981 atcagatact atgttaagtg cgtccaggag gcactgtatt ccatcatcca aatcgctggg
2041 gctgcattcg tcaccacgcg catcgctaag cgcatgaata tacaaaatct ctggtccaag
2101 ccacaggtgg aagacacaga agagacggcc aacaaagatg gttgcctgaa acccaaagat
2161 gatgaagagt ttgtcgtctc atccgacgac atcaaaactg agggcaagaa agggaagaac
2221 aagtccggcc gtggcaagaa gcacacggcc ttttcaagta aagggctcag tgatgaggag
2281 tacgatgagt acaagagaat cagagaagaa aggaatggta agtactccat agaagagtac
2341 cttcaggaca gagacaggta ctacgaggag gtggccattg ccagggcaac cgaagaggac
2401 ttctgtgaag aagaagaggc caaaatccgg cagagaattt ttaggccaac aaggaaacaa
2461 cgcaaagaag aaagggcctc tctcggcttg gtcacaggct ctgaaatcag gaagagaaac
2521 ccagaagact tcaaacccaa gggaaagttg tgggctgatg atgacagaag tgttgactac
2581 aatgagaaac tcaactttga agccccacca agcatctggt cgcggatagt caactttggc
2641 tcaggctggg gcttctgggt ctcccccagt ctgtttataa catcaaccca tgtcataccc
2701 caaggtgcaa aagagttctt cggagtccct atcaagcaaa tccagataca caagtcaggt
2761 gaattctgcc ggttgagatt cccaaagcca atcagaactg atgtgacggg catgattcta
2821 gaagaaggtg cgcccgaggg gaccgtggtc acactgctca tcaagagacc aactggagag
2881 ctcatgcccc tggcagccag aatggggacc catgcaacca tgaaaattca ggggcgcaca
2941 gttggagggc aaatgggtat gctcctgaca gggtccaacg ccaagagtat ggacctaggc
3001 acaacaccag gcgactgcgg ctgcccctac atctacaaga gggggaatga ctacgtggtc
3061 ataggagtcc atacggccgc tgcccgtgga ggaaacactg tcatatgtgc cacccagggg
3121 agtgagggag aagccacact cgaaggaggt gacagtaaag ggacatactg tggcgcacca
3181 atcttgggcc cagggagcgc tccgaagctc agcaccaaga ctaagttttg gagatcgtcc
3241 acaacaccac tcccacctgg cacctacgaa ccagcctatc tcggtggcaa agaccctaga
3301 gtcaaaggtg gcccttcatt gcaacaagtt atgagggacc agctgaagcc attcacagaa
3361 cccagaggta aaccaccaag accaaatgtg ttggaagctg ccaagaaaac catcatcaat
3421 gtccttgagc aaacaattga tccaccccaa aaatggtcat ttgcgcaagc ttgcgcatcc
3481 cttgacaaaa ccacctccag cggccacccg caccacatgc ggaaaaacga ctgttggaat
3541 ggggagtcct tcacaggaaa attggctgat caagcctcca aggccaacct aatgtttgaa
3601 gagggaaaga acatgactcc agtctacaca ggtgcactta aagatgagtt ggtaaagacc
3661 gaaaaagttt atggtaaggt caagaagagg cttctgtggg gttcagatct ggcgaccatg
3721 atacggtgcg cccgagcttt tggaggcctt atggatgaac tcaaggcaca ctgtgtcaca
3781 cttcctgtca gagttggtat gaacatgaat gaggatggcc ccatcatctt tgagaagcac
3841 tccagatata gatatcacta tgatgctgat tattcccggt gggactcaac acagcaaagg
3901 gatgtgctag cagcagcact agaaatcatg gttaagttct ctccagaacc acacctggcc
3961 cagacagttg cagaagacct cctttcccct agcgtgatgg atgtaggtga ctttcaaata
4021 tcaataagtg agggtctccc ctctggggta ccttgtacct cccagtggaa ttccatcgcc
4081 cactggctcc tcactctgtg tgcactctct gaagtcacgg acctgtcccc tgatatcatt
4141 caggccaact cccttttctc cttctatggt gatgatgaga ttgtaagcac agacataaag
4201 ttggacccag agaagctgac agcaaaactt aaggagtatg ggctgaaacc aacccgcccc
4261 gacaaaactg aaggacccct tgttatctct gaagacctgg atggcctgac attcctccgg
4321 agaactgtga cccgtgatcc agctggctgg tttggaaaat tggaacaaag ttcaattctc
4381 aggcaaatgt actggaccag gggtcccaac catgaagatc cttttgaaac aatgatacca
4441 cactcccaaa gacccataca attgatgtcc ttgctgggcg aggctgcgct ccacggcccg
4501 gcattctata gcaaaattag caaattagtc attgcagagt tgaaggaagg tggcatggat
4561 ttttacgtac ccagacaaga gccaatgttc agatggatga gattctcaga tctgagcacg
4621 tgggagggcg atcgcaatct ggctcccagt tttgtgaatg aagatggcgt cgagtgacgc
4681 caacccatct gatgggtccg cagccaacct cgtcccagag gtcaacaatg aggttatggc
4741 tctggagccc gttgttggtg ccgccattgc ggcacccgta gcgggccaac aaaatgtaat
4801 tgacccctgg attagaaata attttgtaca agcccctggt ggagagttta cagtatcccc
4861 tagaaacgct ccaggtgaaa tactatggag cgcacccttg ggccctgatc taaatcccta
4921 cctatcccat ttggccagaa tgtacaatgg ttatgcaggt ggttttgaag tgcaggtaat
4981 tctcgcgggg aacgcgttca ccgccgggaa ggttatattt gcagcagtcc caccaaattt
5041 tccaactgaa ggcttgagcc cctgccaggt tactatgttc ccccatatag tagtagatgt
5101 taggcaacta gaacctgtgt tgattccctt acccgatgtt aggaataatt tctatcatta
5161 caatcaatca aatgacccca ccattaagtt gatagcaatg ttgtatacac cacttagggc
5221 taacaatgct ggggatgatg tcttcacagt ttcttgccga gttctcacga gaccttcccc
5281 cgattttgat ttcatatttc tagtgccacc cacagttgag tcaagaacta aaccattctc
5341 tgtcccagtt ttaactgttg aggagatgac caattcaaga ttccccattc ctttggaaaa
5401 gttgttcacg ggtcccagca gtgcctttgt tgtccaacca caaaacggta ggtgcacgac
5461 tgatggcgtg ctcctaggca ccacccaact gtctcccgtc aacatctgca ccttcagagg
5521 agatgtcacc catatcacag gtagtcataa ctacacaatg aatttggctt ctcaaaattg
5581 gagcaattat gacccaacag aagaaatccc agcccctcta ggaactccag actttgtggg
5641 gaagattcaa ggcgtgctca cccaaaccac aaggacagat ggctcaacac gcggccacaa
5701 agccacagtg tacactggga gcgccgactt tgctccaaaa ctgggtagag ttcaatttga
5761 aactgacaca aaccatgatt ttgaagttaa tcaaaacaca aagttcaccc cagttggtgt
5821 catccaagat ggtggcacca cccaccgaaa tgaaccccaa cagtgggtgc tcccaagtta
5881 ctcaggcaga aatactccta atgtgcatct ggcccccgct gtagccccca cttttccggg
5941 tgagcaactt ctcttcttca gatccaccat gcccggatgc agcgggtacc ccaacatgga
6001 tttggactgt ctgctccccc aggaatgggt gcagtacttc taccaagagg cagccccagc
6061 acaatctgat gtggctctgc taagatttgt gaatccagac acaggtaggg ttttgtttga
6121 gtgtaagctt cataaatcag gctatgttac agtggctcac actggccaac atgatttggt
6181 tatccccccc aatggttatt ttaggtttga ttcctgggtc aaccaatttt acacgcttgc
6241 ccccatggga aatggaacgg ggcgtagacg tgcattataa tggctggagc tttctttgct
6301 ggattggcat ctgatgtcct tggctctgga cttggttccc ttatcaatgc tggggctggg
6361 gccatcaatc aaaaagttga gtatgaaaat aacagaaaat tgcaacaagc atccttccaa
6421 tttagcagca atctacaaca ggcttctttt caacatgaca aagagatgct ccaagcacaa
6481 attgaggcca ccaaaaggct acaacaagaa atgatgaaag ttaagcaggc aatgctccta
6541 gagggtgggt tctctgagac agatgcagcc cgcggggcaa tcaacgcccc catgacaaaa
6601 gctttggact ggagcgggac aaggtactgg gctcccgatg ctaggactac aacatacaat
6661 gcaggccgct tttccacccc tcaaccatcg ggggcactgc caggaagagc taatcttagg
6721 gatgctgtcc ctgctcgggg ttcctccagc aagccttcta attcttctac tgccacttct
6781 gtgtactcaa atcaaactac ttcaacgaga cttggttcta cagctggttc tggtaccagt
6841 gtctcgagct tcccgtcaac tgcaaggact aggagctggg ttgaggatca aagtaggaat
6901 ttgtcacctt tcatgagggg ggcccacaac atatcgtttg tcaccccacc atctagcaga
6961 tcctctagcc aaggga
//