Typing tool
|
Complete norovirus genomes
MK907796 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 1..5092 ORF2: 5073..6695 ORF3: 6695..7501LOCUS MK907796 7557 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_032 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK907796 VERSION MK907796.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7557) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7557) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7557 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_032" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="12-May-2014" /note="genotype: GII.4-GII.Pe" gene <1..5092 /gene="ORF1" CDS <1..5092 /gene="ORF1" /codon_start=2 /product="nonstructural polyprotein" /protein_id="QCO93085.1" /translation="ASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPKQP PPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRHPEEAN TAFSVPPLSQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAK VELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLS RRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKP LNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAV ELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKE EANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSA SPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKK IAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLT LNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKR DFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGL LHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDL KQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDRVQSATVQTNNELAGALHHLRCAR IRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETANKDGCLKP KDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYS IEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGS EIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLF ITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVV TLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGC PYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGS APKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGK PPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGE SFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTEKVYGKVKKRLLWGSDLATM IRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQ QRDVLAAALEIMVKFSPEPHLAQTVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQW NSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYG LKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHE DPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMF RWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..982 /gene="ORF1" /product="p48" mat_peptide 983..2080 /gene="ORF1" /product="NTPase" mat_peptide 2081..2617 /gene="ORF1" /product="p22" mat_peptide 2618..3016 /gene="ORF1" /product="VPg" mat_peptide 3017..3559 /gene="ORF1" /product="Pro" mat_peptide 3560..5089 /gene="ORF1" /product="RdRp" gene 5073..6695 /gene="ORF2" CDS 5073..6695 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93086.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEVNQNTKFTPVGVIQDG GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6695..7501 /gene="ORF3" CDS 6695..7501 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93087.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEYENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKPS NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA" ORIGIN 1 ggcgtctaac gacgcttccg ctgccgctgt tgccaacagc aacaacgaca tcgcaaaatc 61 ttcaagtgac ggtgtgtttt ctaacatggc tgtcactttt aagcgggccc tcggggcgcg 121 gcctaaacag ccgcccccga aggaaatacc acccagaccc ccgcgaccac ccacaccaga 181 attggtcaaa aagatccctc ctcccccacc caacggggag gatgaactag tggtctctta 241 cagcgccaaa gatggcgttt ccggactgcc tgagctcacc actgtcagac atccggaaga 301 agccaacacg gcgttcagtg tccccccact cagccaaagg gaaagcaggg acgccaagga 361 gccactaact gggacaatca ttgaaatgtg ggatggagaa atctaccatt acggcctgta 421 cgtggaacga ggtcttatac ttggtgtgca caaaccaccg gcagccatta gccttgccaa 481 ggtcgagcta gcaccgctct ctttgttctg gagacccgta tacaccccac agtatctcat 541 ctctccagac actcttagga gattacatgg agagtcgttc ccctacactg catttgacaa 601 caattgctac gccttttgtt gttgggtatt agacctaaac gactcatggc tgagcaggag 661 aatgattcaa agaacaacag gcttcttcag gccgtaccag gattggaaca ggaaacccct 721 ccccactatg gatgattcca aattaaagaa ggtagccaac atattcttgt gcactttgtc 781 ttcactattc accagaccca ttaaggacat aatagggaag ttgaaacctc ttaacatcct 841 taacattctg gctacatgtg attggacctt cgcaggcata gtggaatcct taatactctt 901 ggcagaactc tttggagtct tctggacacc cccagatgtg tctgcgatga tcgccccctt 961 gctaggtgat tatgaactgc aaggacctga ggaccttgca gtggaattgg tcccaatagt 1021 gatggggggg ataggtttgg tgctaggatt taccaaagag aaaatcggaa agatgctatc 1081 atccgccgca tccactttaa gagcttgtaa agaccttggt gcatacggac tggaaatctt 1141 aaaattggtc atgaagtggt tcttcccaaa gaaagaggaa gcaaatgaac tggctatggt 1201 gagatccatc gaggatgcag tactagacct cgaggcaatt gaaaacaacc acatgaccac 1261 cctactcaaa gacaaagaca gcttggcaac ctacatgaga acccttgacc ttgaggagga 1321 gaaagccaga aaactctcaa ccaaatctgc ttcacccgat attgtgggca caatcaactc 1381 tcttctggca agaatcgctg ctgcacgttc cctagtgcat cgggcgaaag aagagctctc 1441 cagcaggccg agacctgtcg ttgtgatgat atcgggaaga ccagggatag ggaaaactca 1501 ccttgccagg gagctggcca agaagatcgc ggcctctctc acaggggacc agcgtgtggg 1561 tcttatccca cgcaatggtg tcgaccactg ggacgcatac aagggcgaaa gagttgtcct 1621 atgggacgac tatggaatga gcaaccccat ccatgatgcc ctcaggttgc aggagcttgc 1681 tgacacttgc cccctcacgc taaattgtga cagaattgag aacaagggga aagtctttga 1741 cagtgatgcc ataattatca ccaccaatct ggccaaccca gcaccactgg attatgtcaa 1801 ctttgaagcg tgctcgagac gcattgattt cctcgtgtac gcagaagccc ctgaggtgga 1861 gaaggcaaag cgcgacttcc caggtcaacc tgacatgtgg aagaacgctt tcagtcctga 1921 cttctcacac ataaaactgt cattggctcc acagggtggt ttcgacaaga acggcaacac 1981 cccgcatgga aaaggggtca tgaagaccct caccactggc tccctcatcg cccgagcatc 2041 agggttactc catgagaggc tagatgaata tgaactgcaa ggcccagccc tcaccacttt 2101 caactttgac cgcaacaaga tacttgcttt tagacagctt gctgctgaaa acaagtatgg 2161 gttgatggac acaatgagag ttggaaaaca gctcaaggat gtcaagacca tgtcagacct 2221 caaacaagca ctcaagaaca tcgcgatcaa gaagtgccag atagtgtaca atggtggcac 2281 ctacacactt gaagctgatg gcaagggtag tgtgaaagtt gacagagtgc aaagtgccac 2341 tgtgcagacc aacaatgaac tagccggtgc cctacaccac ctaaggtgcg ctagaatcag 2401 atactatgtt aagtgcgtcc aggaggcact gtattccatc atccaaatcg ctggggctgc 2461 attcgtcacc acgcgcatcg ctaagcgcat gaatatacaa aatctctggt ccaagccaca 2521 ggtggaagac acagaagaga cggccaacaa agatggttgc ctgaaaccca aagatgatga 2581 agagtttgtc gtctcatccg acgacatcaa aactgagggc aagaaaggga agaacaagtc 2641 cggccgtggc aagaagcaca cggccttttc aagtaaaggg ctcagtgatg aggagtacga 2701 tgagtacaag agaatcagag aagaaaggaa tggtaagtac tccatagaag agtaccttca 2761 ggacagagac aggtactacg aggaggtggc cattgccagg gcaaccgaag aggacttctg 2821 tgaagaagaa gaggccaaaa tccggcagag aatttttagg ccaacaagga aacaacgcaa 2881 agaagaaagg gcctctctcg gcttggtcac aggctctgaa atcaggaaga gaaacccaga 2941 agacttcaaa cccaagggaa agttgtgggc tgatgatgac agaagtgttg actacaatga 3001 gaaactcaac tttgaagccc caccaagcat ctggtcgcgg atagtcaact ttggctcagg 3061 ctggggcttc tgggtctccc ccagtctgtt tataacatca acccatgtca taccccaagg 3121 tgcaaaagag ttcttcggag tccctatcaa gcaaatccag atacacaagt caggtgaatt 3181 ctgccggttg agattcccaa agccaatcag aactgatgtg acgggcatga ttctagaaga 3241 aggtgcgccc gaggggaccg tggtcacact gctcatcaag agaccaactg gagagctcat 3301 gcccctggca gccagaatgg ggacccatgc aaccatgaaa attcaggggc gcacagttgg 3361 agggcaaatg ggtatgctcc tgacagggtc caacgccaag agtatggacc taggcacaac 3421 accaggcgac tgcggctgcc cctacatcta caagaggggg aatgactacg tggtcatagg 3481 agtccatacg gccgctgccc gtggaggaaa cactgtcata tgtgccaccc aggggagtga 3541 gggagaagcc acactcgaag gaggtgacag taaagggaca tactgtggcg caccaatctt 3601 gggcccaggg agcgctccga agctcagcac caagactaag ttttggagat cgtccacaac 3661 accactccca cctggcacct acgaaccagc ctatctcggt ggcaaagacc ctagagtcaa 3721 aggtggccct tcattgcaac aagttatgag ggaccagctg aagccattca cagaacccag 3781 aggtaaacca ccaagaccaa atgtgttgga agctgccaag aaaaccatca tcaatgtcct 3841 tgagcaaaca attgatccac cccaaaaatg gtcatttgcg caagcttgcg catcccttga 3901 caaaaccacc tccagcggcc acccgcacca catgcggaaa aacgactgtt ggaatgggga 3961 gtccttcaca ggaaaattgg ctgatcaagc ctccaaggcc aacctaatgt ttgaagaggg 4021 aaagaacatg actccagtct acacaggtgc acttaaagat gagttggtaa agaccgaaaa 4081 agtttatggt aaggtcaaga agaggcttct gtggggttca gatctggcga ccatgatacg 4141 gtgcgcccga gcttttggag gccttatgga tgaactcaag gcacactgtg tcacacttcc 4201 tgtcagagtt ggtatgaaca tgaatgagga tggccccatc atctttgaga agcactccag 4261 atatagatat cactatgatg ctgattattc ccggtgggac tcaacacagc aaagggatgt 4321 gctagcagca gcactagaaa tcatggttaa gttctctcca gaaccacacc tggcccagac 4381 agttgcagaa gacctccttt cccctagcgt gatggatgta ggtgactttc aaatatcaat 4441 aagtgagggt ctcccctctg gggtaccttg tacctcccag tggaattcca tcgcccactg 4501 gctcctcact ctgtgtgcac tctctgaagt cacggacctg tcccctgata tcattcaggc 4561 caactccctt ttctccttct atggtgatga tgagattgta agcacagaca taaagttgga 4621 cccagagaag ctgacagcaa aacttaagga gtatgggctg aaaccaaccc gccccgacaa 4681 aactgaagga ccccttgtta tctctgaaga cctggatggc ctgacattcc tccggagaac 4741 tgtgacccgt gatccagctg gctggtttgg aaaattggaa caaagttcaa ttctcaggca 4801 aatgtactgg accaggggtc ccaaccatga agatcctttt gaaacaatga taccacactc 4861 ccaaagaccc atacaattga tgtccttgct gggcgaggct gcgctccacg gcccggcatt 4921 ctatagcaaa attagcaaat tagtcattgc agagttgaag gaaggtggca tggattttta 4981 cgtacccaga caagagccaa tgttcagatg gatgagattc tcagatctga gcacgtggga 5041 gggcgatcgc aatctggctc ccagttttgt gaatgaagat ggcgtcgagt gacgccaacc 5101 catctgatgg gtccgcagcc aacctcgtcc cagaggtcaa caatgaggtt atggctctgg 5161 agcccgttgt tggtgccgcc attgcggcac ccgtagcggg ccaacaaaat gtaattgacc 5221 cctggattag aaataatttt gtacaagccc ctggtggaga gtttacagta tcccctagaa 5281 acgctccagg tgaaatacta tggagcgcac ccttgggccc tgatctaaat ccctacctat 5341 cccatttggc cagaatgtac aatggttatg caggtggttt tgaagtgcag gtaattctcg 5401 cggggaacgc gttcaccgcc gggaaggtta tatttgcagc agtcccacca aattttccaa 5461 ctgaaggctt gagccccagc caggttacta tgttccccca tatagtagta gatgttaggc 5521 aactagaacc tgtgttgatt cccttacccg atgttaggaa taatttctat cattataatc 5581 aatcaaatga ccccaccatt aagttgatag caatgttgta tacaccactt agggctaaca 5641 atgctgggga tgatgtcttc acagtttctt gccgagttct cacgagacct tcccccgatt 5701 ttgatttcat atttctagtg ccacccacag ttgagtcaag aactaaacca ttctctgtcc 5761 cagttttaac tgttgaggag atgaccaatt caagattccc cattcctttg gaaaagttgt 5821 tcacgggtcc cagcagtgcc tttgttgtcc aaccacaaaa cggtaggtgc acgactgatg 5881 gcgtgctcct aggcaccacc caactgtctc ccgtcaacat ctgcaccttc agaggagatg 5941 tcacccatat cacaggtagt cataactaca caatgaattt ggcttctcaa aattggagca 6001 attatgaccc aacagaagaa atcccagccc ctctaggaac tccagacttt gtggggaaga 6061 ttcaaggcgt gctcacccaa accacaagga cagatggctc aacacgcggc cacaaagcca 6121 cagtgtacac tgggagcgcc gactttgctc caaaactggg tagagttcaa tttgaaactg 6181 acacaaacca tgattttgaa gttaatcaaa acacaaagtt caccccagtt ggtgtcatcc 6241 aagatggtgg caccacccac cgaaatgaac cccaacagtg ggtgctccca agttactcag 6301 gcagaaatac tcctaatgtg catctggccc ccgctgtagc ccccactttt ccgggtgagc 6361 aacttctctt cttcagatcc accatgcccg gatgcagcgg gtaccccaac atggatttgg 6421 actgtctgct cccccaggaa tgggtgcagt acttctacca agaggcagcc ccagcacaat 6481 ctgatgtggc tctgctaaga tttgtgaatc cagacacagg tagggttttg tttgagtgta 6541 agcttcataa atcaggctat gttacagtgg ctcacactgg ccaacatgat ttggttatcc 6601 cccccaatgg ttattttagg tttgattcct gggtcaacca attttacacg cttgccccca 6661 tgggaaatgg aacggggcgt agacgtgcat tataatggct ggagctttct ttgctggatt 6721 ggcatctgat gtccttggct ctggacttgg ttcccttatc aatgctgggg ctggggccat 6781 caatcaaaaa gttgagtatg aaaataacag aaaattgcaa caagcatcct tccaatttag 6841 cagcaatcta caacaggctt cttttcaaca tgacaaagag atgctccaag cacaaattga 6901 ggccaccaaa aggctacaac aagaaatgat gaaagttaag caggcaatgc tcctagaggg 6961 tgggttctct gagacagatg cagcccgcgg ggcaatcaac gcccccatga caaaagcttt 7021 ggactggagc gggacaaggt actgggctcc cgatgctagg actacaacat acaatgcagg 7081 ccgcttttcc acccctcaac catcgggggc actgccagga agagctaatc ttagggatgc 7141 tgtccctgct cggggttcct ccagcaagcc ttctaattct tctactgcca cttctgtgta 7201 ctcaaatcaa actacttcaa cgagacttgg ttctacagct ggttctggta ccagtgtctc 7261 gagcttcccg tcaactgcaa ggactaggag ctgggttgag gatcaaagta ggaatttgtc 7321 acctttcatg aggggggccc acaacatatc gtttgtcacc ccaccatcta gcagatcctc 7381 tagccaaggc acagtctcaa ccgtgcctaa agagattttg gactcctgga ctggcgcttt 7441 caacacgcgc aggcagccac tcttcgctca cattcgtaag cgaggggagt cacgggcgta 7501 atgtgaaaag acaaaattga ttatctttct tttctttagt gtcttttaaa aaaaaaa //