![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MK907796 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 1..5092
ORF2: 5073..6695
ORF3: 6695..7501
LOCUS MK907796 7557 bp RNA linear VRL 02-NOV-2019
DEFINITION Norovirus GII isolate G19_032 nonstructural polyprotein (ORF1)
gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
cds.
ACCESSION MK907796
VERSION MK907796.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7557)
AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
Guyader,S.
TITLE Optimisation of agnostic metagenomic approaches to characterise
human enteric viruses in sewage
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 7557)
AUTHORS Le Guyader,S. and Strubbia,S.
TITLE Direct Submission
JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
44311, France
COMMENT ##Assembly-Data-START##
Assembly Method :: SPAdes v. 3.12.0
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7557
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="G19_032"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="France: Nantes"
/collection_date="12-May-2014"
/note="genotype: GII.4-GII.Pe"
gene <1..5092
/gene="ORF1"
CDS <1..5092
/gene="ORF1"
/codon_start=2
/product="nonstructural polyprotein"
/protein_id="QCO93085.1"
/translation="ASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPKQP
PPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRHPEEAN
TAFSVPPLSQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAK
VELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLS
RRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKP
LNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAV
ELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKE
EANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSA
SPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKK
IAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLT
LNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKR
DFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGL
LHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDL
KQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDRVQSATVQTNNELAGALHHLRCAR
IRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETANKDGCLKP
KDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYS
IEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGS
EIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLF
ITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVV
TLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGC
PYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGS
APKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGK
PPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGE
SFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTEKVYGKVKKRLLWGSDLATM
IRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQ
QRDVLAAALEIMVKFSPEPHLAQTVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQW
NSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYG
LKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHE
DPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMF
RWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide <1..982
/gene="ORF1"
/product="p48"
mat_peptide 983..2080
/gene="ORF1"
/product="NTPase"
mat_peptide 2081..2617
/gene="ORF1"
/product="p22"
mat_peptide 2618..3016
/gene="ORF1"
/product="VPg"
mat_peptide 3017..3559
/gene="ORF1"
/product="Pro"
mat_peptide 3560..5089
/gene="ORF1"
/product="RdRp"
gene 5073..6695
/gene="ORF2"
CDS 5073..6695
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QCO93086.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEVNQNTKFTPVGVIQDG
GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
gene 6695..7501
/gene="ORF3"
CDS 6695..7501
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QCO93087.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEYENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKPS
NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN
1 ggcgtctaac gacgcttccg ctgccgctgt tgccaacagc aacaacgaca tcgcaaaatc
61 ttcaagtgac ggtgtgtttt ctaacatggc tgtcactttt aagcgggccc tcggggcgcg
121 gcctaaacag ccgcccccga aggaaatacc acccagaccc ccgcgaccac ccacaccaga
181 attggtcaaa aagatccctc ctcccccacc caacggggag gatgaactag tggtctctta
241 cagcgccaaa gatggcgttt ccggactgcc tgagctcacc actgtcagac atccggaaga
301 agccaacacg gcgttcagtg tccccccact cagccaaagg gaaagcaggg acgccaagga
361 gccactaact gggacaatca ttgaaatgtg ggatggagaa atctaccatt acggcctgta
421 cgtggaacga ggtcttatac ttggtgtgca caaaccaccg gcagccatta gccttgccaa
481 ggtcgagcta gcaccgctct ctttgttctg gagacccgta tacaccccac agtatctcat
541 ctctccagac actcttagga gattacatgg agagtcgttc ccctacactg catttgacaa
601 caattgctac gccttttgtt gttgggtatt agacctaaac gactcatggc tgagcaggag
661 aatgattcaa agaacaacag gcttcttcag gccgtaccag gattggaaca ggaaacccct
721 ccccactatg gatgattcca aattaaagaa ggtagccaac atattcttgt gcactttgtc
781 ttcactattc accagaccca ttaaggacat aatagggaag ttgaaacctc ttaacatcct
841 taacattctg gctacatgtg attggacctt cgcaggcata gtggaatcct taatactctt
901 ggcagaactc tttggagtct tctggacacc cccagatgtg tctgcgatga tcgccccctt
961 gctaggtgat tatgaactgc aaggacctga ggaccttgca gtggaattgg tcccaatagt
1021 gatggggggg ataggtttgg tgctaggatt taccaaagag aaaatcggaa agatgctatc
1081 atccgccgca tccactttaa gagcttgtaa agaccttggt gcatacggac tggaaatctt
1141 aaaattggtc atgaagtggt tcttcccaaa gaaagaggaa gcaaatgaac tggctatggt
1201 gagatccatc gaggatgcag tactagacct cgaggcaatt gaaaacaacc acatgaccac
1261 cctactcaaa gacaaagaca gcttggcaac ctacatgaga acccttgacc ttgaggagga
1321 gaaagccaga aaactctcaa ccaaatctgc ttcacccgat attgtgggca caatcaactc
1381 tcttctggca agaatcgctg ctgcacgttc cctagtgcat cgggcgaaag aagagctctc
1441 cagcaggccg agacctgtcg ttgtgatgat atcgggaaga ccagggatag ggaaaactca
1501 ccttgccagg gagctggcca agaagatcgc ggcctctctc acaggggacc agcgtgtggg
1561 tcttatccca cgcaatggtg tcgaccactg ggacgcatac aagggcgaaa gagttgtcct
1621 atgggacgac tatggaatga gcaaccccat ccatgatgcc ctcaggttgc aggagcttgc
1681 tgacacttgc cccctcacgc taaattgtga cagaattgag aacaagggga aagtctttga
1741 cagtgatgcc ataattatca ccaccaatct ggccaaccca gcaccactgg attatgtcaa
1801 ctttgaagcg tgctcgagac gcattgattt cctcgtgtac gcagaagccc ctgaggtgga
1861 gaaggcaaag cgcgacttcc caggtcaacc tgacatgtgg aagaacgctt tcagtcctga
1921 cttctcacac ataaaactgt cattggctcc acagggtggt ttcgacaaga acggcaacac
1981 cccgcatgga aaaggggtca tgaagaccct caccactggc tccctcatcg cccgagcatc
2041 agggttactc catgagaggc tagatgaata tgaactgcaa ggcccagccc tcaccacttt
2101 caactttgac cgcaacaaga tacttgcttt tagacagctt gctgctgaaa acaagtatgg
2161 gttgatggac acaatgagag ttggaaaaca gctcaaggat gtcaagacca tgtcagacct
2221 caaacaagca ctcaagaaca tcgcgatcaa gaagtgccag atagtgtaca atggtggcac
2281 ctacacactt gaagctgatg gcaagggtag tgtgaaagtt gacagagtgc aaagtgccac
2341 tgtgcagacc aacaatgaac tagccggtgc cctacaccac ctaaggtgcg ctagaatcag
2401 atactatgtt aagtgcgtcc aggaggcact gtattccatc atccaaatcg ctggggctgc
2461 attcgtcacc acgcgcatcg ctaagcgcat gaatatacaa aatctctggt ccaagccaca
2521 ggtggaagac acagaagaga cggccaacaa agatggttgc ctgaaaccca aagatgatga
2581 agagtttgtc gtctcatccg acgacatcaa aactgagggc aagaaaggga agaacaagtc
2641 cggccgtggc aagaagcaca cggccttttc aagtaaaggg ctcagtgatg aggagtacga
2701 tgagtacaag agaatcagag aagaaaggaa tggtaagtac tccatagaag agtaccttca
2761 ggacagagac aggtactacg aggaggtggc cattgccagg gcaaccgaag aggacttctg
2821 tgaagaagaa gaggccaaaa tccggcagag aatttttagg ccaacaagga aacaacgcaa
2881 agaagaaagg gcctctctcg gcttggtcac aggctctgaa atcaggaaga gaaacccaga
2941 agacttcaaa cccaagggaa agttgtgggc tgatgatgac agaagtgttg actacaatga
3001 gaaactcaac tttgaagccc caccaagcat ctggtcgcgg atagtcaact ttggctcagg
3061 ctggggcttc tgggtctccc ccagtctgtt tataacatca acccatgtca taccccaagg
3121 tgcaaaagag ttcttcggag tccctatcaa gcaaatccag atacacaagt caggtgaatt
3181 ctgccggttg agattcccaa agccaatcag aactgatgtg acgggcatga ttctagaaga
3241 aggtgcgccc gaggggaccg tggtcacact gctcatcaag agaccaactg gagagctcat
3301 gcccctggca gccagaatgg ggacccatgc aaccatgaaa attcaggggc gcacagttgg
3361 agggcaaatg ggtatgctcc tgacagggtc caacgccaag agtatggacc taggcacaac
3421 accaggcgac tgcggctgcc cctacatcta caagaggggg aatgactacg tggtcatagg
3481 agtccatacg gccgctgccc gtggaggaaa cactgtcata tgtgccaccc aggggagtga
3541 gggagaagcc acactcgaag gaggtgacag taaagggaca tactgtggcg caccaatctt
3601 gggcccaggg agcgctccga agctcagcac caagactaag ttttggagat cgtccacaac
3661 accactccca cctggcacct acgaaccagc ctatctcggt ggcaaagacc ctagagtcaa
3721 aggtggccct tcattgcaac aagttatgag ggaccagctg aagccattca cagaacccag
3781 aggtaaacca ccaagaccaa atgtgttgga agctgccaag aaaaccatca tcaatgtcct
3841 tgagcaaaca attgatccac cccaaaaatg gtcatttgcg caagcttgcg catcccttga
3901 caaaaccacc tccagcggcc acccgcacca catgcggaaa aacgactgtt ggaatgggga
3961 gtccttcaca ggaaaattgg ctgatcaagc ctccaaggcc aacctaatgt ttgaagaggg
4021 aaagaacatg actccagtct acacaggtgc acttaaagat gagttggtaa agaccgaaaa
4081 agtttatggt aaggtcaaga agaggcttct gtggggttca gatctggcga ccatgatacg
4141 gtgcgcccga gcttttggag gccttatgga tgaactcaag gcacactgtg tcacacttcc
4201 tgtcagagtt ggtatgaaca tgaatgagga tggccccatc atctttgaga agcactccag
4261 atatagatat cactatgatg ctgattattc ccggtgggac tcaacacagc aaagggatgt
4321 gctagcagca gcactagaaa tcatggttaa gttctctcca gaaccacacc tggcccagac
4381 agttgcagaa gacctccttt cccctagcgt gatggatgta ggtgactttc aaatatcaat
4441 aagtgagggt ctcccctctg gggtaccttg tacctcccag tggaattcca tcgcccactg
4501 gctcctcact ctgtgtgcac tctctgaagt cacggacctg tcccctgata tcattcaggc
4561 caactccctt ttctccttct atggtgatga tgagattgta agcacagaca taaagttgga
4621 cccagagaag ctgacagcaa aacttaagga gtatgggctg aaaccaaccc gccccgacaa
4681 aactgaagga ccccttgtta tctctgaaga cctggatggc ctgacattcc tccggagaac
4741 tgtgacccgt gatccagctg gctggtttgg aaaattggaa caaagttcaa ttctcaggca
4801 aatgtactgg accaggggtc ccaaccatga agatcctttt gaaacaatga taccacactc
4861 ccaaagaccc atacaattga tgtccttgct gggcgaggct gcgctccacg gcccggcatt
4921 ctatagcaaa attagcaaat tagtcattgc agagttgaag gaaggtggca tggattttta
4981 cgtacccaga caagagccaa tgttcagatg gatgagattc tcagatctga gcacgtggga
5041 gggcgatcgc aatctggctc ccagttttgt gaatgaagat ggcgtcgagt gacgccaacc
5101 catctgatgg gtccgcagcc aacctcgtcc cagaggtcaa caatgaggtt atggctctgg
5161 agcccgttgt tggtgccgcc attgcggcac ccgtagcggg ccaacaaaat gtaattgacc
5221 cctggattag aaataatttt gtacaagccc ctggtggaga gtttacagta tcccctagaa
5281 acgctccagg tgaaatacta tggagcgcac ccttgggccc tgatctaaat ccctacctat
5341 cccatttggc cagaatgtac aatggttatg caggtggttt tgaagtgcag gtaattctcg
5401 cggggaacgc gttcaccgcc gggaaggtta tatttgcagc agtcccacca aattttccaa
5461 ctgaaggctt gagccccagc caggttacta tgttccccca tatagtagta gatgttaggc
5521 aactagaacc tgtgttgatt cccttacccg atgttaggaa taatttctat cattataatc
5581 aatcaaatga ccccaccatt aagttgatag caatgttgta tacaccactt agggctaaca
5641 atgctgggga tgatgtcttc acagtttctt gccgagttct cacgagacct tcccccgatt
5701 ttgatttcat atttctagtg ccacccacag ttgagtcaag aactaaacca ttctctgtcc
5761 cagttttaac tgttgaggag atgaccaatt caagattccc cattcctttg gaaaagttgt
5821 tcacgggtcc cagcagtgcc tttgttgtcc aaccacaaaa cggtaggtgc acgactgatg
5881 gcgtgctcct aggcaccacc caactgtctc ccgtcaacat ctgcaccttc agaggagatg
5941 tcacccatat cacaggtagt cataactaca caatgaattt ggcttctcaa aattggagca
6001 attatgaccc aacagaagaa atcccagccc ctctaggaac tccagacttt gtggggaaga
6061 ttcaaggcgt gctcacccaa accacaagga cagatggctc aacacgcggc cacaaagcca
6121 cagtgtacac tgggagcgcc gactttgctc caaaactggg tagagttcaa tttgaaactg
6181 acacaaacca tgattttgaa gttaatcaaa acacaaagtt caccccagtt ggtgtcatcc
6241 aagatggtgg caccacccac cgaaatgaac cccaacagtg ggtgctccca agttactcag
6301 gcagaaatac tcctaatgtg catctggccc ccgctgtagc ccccactttt ccgggtgagc
6361 aacttctctt cttcagatcc accatgcccg gatgcagcgg gtaccccaac atggatttgg
6421 actgtctgct cccccaggaa tgggtgcagt acttctacca agaggcagcc ccagcacaat
6481 ctgatgtggc tctgctaaga tttgtgaatc cagacacagg tagggttttg tttgagtgta
6541 agcttcataa atcaggctat gttacagtgg ctcacactgg ccaacatgat ttggttatcc
6601 cccccaatgg ttattttagg tttgattcct gggtcaacca attttacacg cttgccccca
6661 tgggaaatgg aacggggcgt agacgtgcat tataatggct ggagctttct ttgctggatt
6721 ggcatctgat gtccttggct ctggacttgg ttcccttatc aatgctgggg ctggggccat
6781 caatcaaaaa gttgagtatg aaaataacag aaaattgcaa caagcatcct tccaatttag
6841 cagcaatcta caacaggctt cttttcaaca tgacaaagag atgctccaag cacaaattga
6901 ggccaccaaa aggctacaac aagaaatgat gaaagttaag caggcaatgc tcctagaggg
6961 tgggttctct gagacagatg cagcccgcgg ggcaatcaac gcccccatga caaaagcttt
7021 ggactggagc gggacaaggt actgggctcc cgatgctagg actacaacat acaatgcagg
7081 ccgcttttcc acccctcaac catcgggggc actgccagga agagctaatc ttagggatgc
7141 tgtccctgct cggggttcct ccagcaagcc ttctaattct tctactgcca cttctgtgta
7201 ctcaaatcaa actacttcaa cgagacttgg ttctacagct ggttctggta ccagtgtctc
7261 gagcttcccg tcaactgcaa ggactaggag ctgggttgag gatcaaagta ggaatttgtc
7321 acctttcatg aggggggccc acaacatatc gtttgtcacc ccaccatcta gcagatcctc
7381 tagccaaggc acagtctcaa ccgtgcctaa agagattttg gactcctgga ctggcgcttt
7441 caacacgcgc aggcagccac tcttcgctca cattcgtaag cgaggggagt cacgggcgta
7501 atgtgaaaag acaaaattga ttatctttct tttctttagt gtcttttaaa aaaaaaa
//