Typing tool
|
Complete norovirus genomes
MK907798 | GII.3 | ||
---|---|---|---|
GII.P21 |
ORF1: 1..5099 ORF2: 5080..6726 ORF3: 6726..7480LOCUS MK907798 7480 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_034 nonstructural polyprotein (ORF1) gene, partial cds; VP1 (ORF2) gene, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION MK907798 VERSION MK907798.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7480) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7480) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7480 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_034" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="10-Jan-2008" /note="genotype: GII.3-GII.P21" gene <1..5099 /gene="ORF1" CDS <1..5099 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="QCO93092.1" /translation="KMASNDASAAAAANSNNDNAKSSSDGVLSNMAVTFKRALGARSK QPPPRDKPPKPPRPPTPELVKAIPPPPPNGEDEPIISYNVKGGVSGLPELSTVTQLEE SSTAFSVPPLSQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISL AKVELTPLSLYWRPVYTPQYLISPDTLRKLHGETFPYTAFDNNCYAFCCWVLDLNDSW LSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKVKKVANVVLCALSSLFTRPIKDIIGKL KPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK KEEKNELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDMEEEKARKLSTK SASPDIVGTINALLSRIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELA KKIASSLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCP LTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPEVEKA KRDFPGQPDMWKDTFKPDFSHIKLTLAPQGGFDKNGNTPHGKGVMKTLTASSLVARAS GLLHERLDEYELQGPTPTTFNFDRNKVLAFRQLAAENKYGLVDTMRVGSQLKNVKTMT ELKQALKNISVKKCQLVYGGGTYTLESDGKGNVHVEKVNNTSVQTNNELSGALHHLRC ARIRYYVKCVQEALYSILQIAGAAFITTRIAKRMNIQNLWSKPQAEDPEETNNEDGCP KPKNDEEFVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGR YSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVT GSEVRKRNPDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPS LFITSTHVIPQGAQEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGT VATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDC GCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGNEGEAMLEGGDNKGTYCGAPILGP GNAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPR GKPPKPSVLEAAKKTIINVLEQTIDPPQKWTYAQACASLDKTTSSGHPYHMRKNDCWN GETFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGTIKKRLLWGSDLS TMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWDS TQQRAVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFKISINEGLPSGVPCTS QWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTTKLKE YGLKPTRPDKTEGPLIISEDLDGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTKGPN HEDPFETMIPHSQRPIQLMSLLGEAALHGPSFYSKISKLVISELKEGGMDFYVPRQEP MFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..989 /gene="ORF1" /product="p48" mat_peptide 990..2087 /gene="ORF1" /product="NTPase" mat_peptide 2088..2624 /gene="ORF1" /product="p22" mat_peptide 2625..3023 /gene="ORF1" /product="VPg" mat_peptide 3024..3566 /gene="ORF1" /product="Pro" mat_peptide 3567..5096 /gene="ORF1" /product="RdRp" gene 5080..6726 /gene="ORF2" CDS 5080..6726 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93093.1" /translation="MKMASNDAAPSNDGAAGLVPEINNEAMALEPVAGAAIAAPLTGQ QNIIDPWIMNNFVQAPGGEFTVSPRNSPGEVLLNLELGPEINPYLAHLARMYNGYAGG FEVQVVLAGNAFTAGKIIFAAIPPNFPIDNLSAAQITMCPHVIVDVRQLEPVNLPMPD VRNNFFHYNQGSDSRLRLIAMLYTPLRANNSGDDVFTVSCRVLTRPSPDFSFNFLVPP TVESKTKPFSLPILTISEMSNSRFPVPIDSLHTSPTENIVVQCQNGRVTLDGELMGTT QLLPSQICALRGVLTRSTSRASDQADTATPRLFNYYWHIQLDNLNGTPYDPAEDIPGP LGTPDFRGKVFGVASQRNPDATTRAHEAKIDTTSGRFTPKLGSLEISTESDDFDQNKP TRFTPVGIGVDHEADFQQWTLPDYAGQFTHNMNLAPAVAPNFPGEQLLFFRSQLPSSG GRSNGILDCLVPQEWVQHFYQESAPSQSQVALVRYINPDTGRVLFEAKLHKLGFMTIA KNGDSPITVPPNGYFRFESWVNPFYTLAPMGTGNGRRRIQ" gene 6726..>7480 /gene="ORF3" CDS 6726..>7480 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93091.1" /translation="MAGAFIAGLAGDMLTNTVGSLVNAGANAINQTIDFENNKYLQNA SFNHDREMLNAQIEATKRLQADMIAIKQGVLTAGGFSPTDAARGAINAPMTKVLDWNG TRYWAPGATSTTSMSGGFTNQTVHRSTPNFKTNQAPKSTPSSGSSVRSNSTQITSLSS HSSGSSRSSGSTVVSSMPSSNRTRDWVNQQNFNLEPHMPGSLRTAFVTPPSSTASSSG TVSTVPKNVLDSWTSAFNTRRQPLFAHLRRRGES" ORIGIN 1 tgaagatggc gtctaacgac gcttccgctg ccgctgctgc caatagcaac aacgacaacg 61 caaaatcttc aagtgacgga gtattatcta atatggctgt cacttttaaa cgagccctcg 121 gggcgcggtc taaacagccg cccccgaggg acaaaccacc aaaaccccca agaccaccca 181 caccagagtt ggttaaggca atcccccctc ccccacccaa cggggaggac gaaccaatca 241 tttcttacaa cgtcaaaggg ggtgtttctg gtttgcctga actctcaact gtcacccaac 301 tggaagagag ctctacagca ttcagcgttc cccctcttag tcagagggag aacagagatg 361 caaaggaacc attgactgga accatcctgg agatgtggga cggggagatc taccattatg 421 gcttatacgt ggaacgaggg ttagtgctcg gtgtacacaa accaccagca gccatcagcc 481 ttgctaaggt tgagctgacc cccttgtctt tgtattggag accagtgtac accccacagt 541 acctcatttc ccctgacact ctcagaaagt tgcatgggga gacgttccct tatacagcct 601 ttgacaacaa ctgttatgcc ttctgctgtt gggtacttga cctgaacgat tcatggctga 661 gcagaaggat gatacagagg acaacaggct tcttccgacc ctaccaggac tggaacagga 721 aacccctccc cacgatggat gactccaaag tgaaaaaggt ggccaatgtt gtcctctgtg 781 ctctctcatc attgtttaca agaccaatta aggacattat tgggaagttg aaacccctaa 841 acattctcaa catcctagcc acatgtgatt ggacttttgc aggcatagta gaatccctga 901 tccttttggc tgaactgttt ggagttttct ggacaccccc agatgtgtct gcaatgatcg 961 ctcccttact gggtgattat gaactacagg gacccgagga cctcgctgtg gaactcgtac 1021 ccgtagtaat gggagggata ggtttggtgc tgggattcac caaagagaaa atcggcaaaa 1081 tgctttcatc tgctgcttca accctgaggg catgcaaaga tctcggtgca tatgggttgg 1141 agatcctcaa attggttatg aaatggtttt tcccaaagaa agaagagaaa aacgaattgg 1201 caatggtgag atccatcgag gatgcagtgc tagatcttga ggccattgag aacaaccata 1261 tgacagctct actcaaggat aaagacagcc ttgcaaccta catgaggact cttgacatgg 1321 aggaggagaa ggcgaggaaa ctttccacca agtcggcctc gccggatatt gtgggcacga 1381 taaacgccct actgtcaagg attgcagctg cccggtctct agtgcacagg gctaaggagg 1441 agctgtcaag caggccccga ccggtcgttg taatgatttc agggagaccg ggtataggga 1501 aaacccatct agctagagag ttggcaaaga agatcgcctc ctcactcaca ggtgaccaga 1561 gggtgggtct tatcccacgt aacggggtcg accactggga tgcatacaaa ggcgaaagag 1621 tcgttctctg ggacgactac gggatgagca accctatcca cgacgccctc agactccagg 1681 agctcgctga tacctgtcct cttacactca actgtgacag gattgagaac aaaggcaaag 1741 tctttgacag tgacgccata ataataacca ccaatttggc caacccagcg ccactggatt 1801 acgtcaactt tgaagcttgc tcaagacgca tagacttcct cgtctatgct gatgcccctg 1861 aagttgagaa ggctaaacgg gacttcccag gccaaccaga catgtggaaa gacaccttta 1921 aacccgactt ctcacatata aaactgacat tagccccgca aggaggtttt gacaagaatg 1981 gtaacactcc tcatgggaag ggtgtcatga agaccctcac tgccagttcc ctcgttgccc 2041 gagcatcagg gctcctccac gagagattag acgaatatga gctgcagggc ccaactccca 2101 caacattcaa cttcgaccgc aacaaggtgc ttgcttttag gcagcttgct gctgaaaaca 2161 agtacggtct tgttgacaca atgagggtcg ggtcacagct caagaatgtc aaaactatga 2221 cagaactcaa gcaggccctc aagaacatct cagtcaagaa atgtcagctt gtgtatggtg 2281 ggggcacgta cacacttgaa tctgatggca aaggcaatgt gcatgtcgaa aaggtgaaca 2341 acaccagtgt gcaaactaac aacgagctct ccggggcttt gcaccatctc aggtgtgcca 2401 gaatcaggta ctatgttaag tgtgtccagg aagctctcta ctccatcttg caaattgccg 2461 gggctgcatt cattaccacg cgcattgcaa agcgcatgaa catacaaaac ctctggtcca 2521 aaccacaagc agaagatccg gaggagacta acaacgagga tggttgtcca aaacctaaaa 2581 atgatgagga gtttgtcatc tcctctgatg acatcaaaac cgagggtaag aaaggaaaga 2641 acaagactgg ccgtggtaag aagcacacag ccttttccag caaaggactc agtgatgagg 2701 agtacgatga gtataagaga attagggaag agagaaatgg caggtactcc atagaggaat 2761 acctacagga cagagacaaa tactatgagg aggtggccat agccagggca actgaggaag 2821 acttctgcga agaagaggag gctaaaatcc ggcaaaggat attcagacca acaaggaaac 2881 aacgcaaaga ggaaagagcc tctcttggtt tggttacagg ttctgaggtc aggaagagaa 2941 acccagatga cttcaagccc aaagggaaac tatgggccga tgatgacagg agtgttgact 3001 acaatgagaa acttagtttc gaggctccac caagcatttg gtcacgaata gtcaactttg 3061 gctcagggtg gggcttttgg gtctcgccca gcctcttcat aacatcaacc catgtcattc 3121 cccaaggcgc acaggagttc tttggagtgc ccatcaaaca aatacaaatt cacaagtcag 3181 gtgagttctg ccggcttagg ttcccaaaac caatcaggac agatgtcaca ggcatgatct 3241 tggaggaagg tgctccagaa ggcactgttg ccacacttct catcaagaga ccaactgggg 3301 aactcatgcc cttggcagcc agaatgggca cccacgcgac catgaaaatt cagggtcgca 3361 ctgttggtgg acagatgggc atgctactca cagggtctaa cgctaagagt atggatttgg 3421 gcacaactcc tggcgattgt ggttgtccct atatctacaa gagagggaac gactacgtgg 3481 tcattggagt tcacactgcc gccgctcgtg gaggaaacac cgtcatctgt gcaacccaag 3541 gaaacgaggg tgaggccatg ctagaaggtg gtgacaacaa gggaacttat tgtggagcac 3601 caatattagg ccctggaaat gcccccaaac tcagcaccaa aaccaaattc tggaggtctt 3661 ccaccacccc cctgccaccc ggaacctatg agccagctta tctgggtggc aaggacccta 3721 gagtgaaagg tggcccctca ctgcaacagg ttatgaggga ccaactaaaa ccattcactg 3781 agcccagagg caaaccaccc aaaccaagtg tgctagaagc cgccaagaag actataatca 3841 atgtgcttga acaaacaata gacccacctc aaaaatggac atacgcacaa gcgtgtgcat 3901 cactagacaa gaccacttcc agtggccacc cttaccacat gcggaagaac gattgctgga 3961 atggggaaac tttcacagga aaactggcag accaagcatc aaaggctaac ctaatgtttg 4021 aggaaggaaa gaacatgacc ccagtataca caggagctct gaaagatgag ctagtcaaga 4081 ctgataagat ctatgggaca atcaagaaga gactcctgtg gggttcagac ctatcaacca 4141 tgatacggtg tgcacgagcc ttcggtgggc taatggacga gctcaaggcc cattgcgtca 4201 cactaccagt cagggttggt atgaacatga atgaggatgg acccataata tttgagaaac 4261 attccagata caaataccat tatgatgcag attactcccg ctgggactca acacaacaaa 4321 gagcagtgct ggctgcagcc ctggaaataa tggtcaaatt ctcaccagaa ccccatctgg 4381 cccaagtagt tgcagaagac ctcttgtccc ccagtgtgat ggatgtgggt gatttcaaga 4441 tatcaatcaa cgagggatta ccctctggtg ttccttgcac ctcacaatgg aactccattg 4501 cccactggct cctcacacta tgtgcactgt ctgaagtcac agacctgtcc cctgacatca 4561 tccaggcgaa ttccctgttc tccttttatg gtgatgatga aatagtgagc acagacatca 4621 aactggaccc agagaaattg acaacaaaat tgaaggaata cgggctcaaa ccaacccgtc 4681 ctgataaaac agaaggaccc ttaattatct ctgaagattt ggatggcctg accttcttac 4741 ggagaacagt gacccgtgat ccggccgggt ggtttggcaa actggaccaa agttcaatac 4801 tcaggcagat gtactggacc aagggaccaa accatgagga cccctttgaa acaatgatac 4861 cacactccca aagacccata caactgatgt cattactggg tgaagctgca ttgcatggtc 4921 catcgttcta cagtaaaatc agcaaattgg tcatctcaga attgaaagag ggtggaatgg 4981 atttttacgt gcccagacaa gaaccaatgt tcaggtggat gagattctca gatttgagca 5041 cgtgggaggg cgatcgcaat ctggctccca gttttgtgaa tgaagatggc gtcgaatgac 5101 gccgctccat ctaatgatgg tgccgccggc ctcgtcccag agatcaataa tgaggcaatg 5161 gcgctagagc cagtggcggg tgcagcgata gcagcacccc tcactggtca gcaaaatata 5221 attgatccct ggattatgaa taattttgtg caagcacctg gtggtgagtt tacagtatcc 5281 cctagaaatt cccctggtga agttcttctt aatttggaat tgggcccaga aataaatccc 5341 tatttggccc atcttgctag aatgtataat ggttatgcag gtggatttga agtgcaggtg 5401 gtcctagctg gaaatgcgtt tacagcagga aagataatct ttgcagctat tccccccaat 5461 tttccaattg acaatctaag tgcagcacag attacaatgt gcccacatgt gattgtggat 5521 gtcagacagt tggaaccagt caacctcccg atgcctgacg ttcgtaacaa cttctttcat 5581 tacaatcaag ggtccgattc gagattgcgc ctaattgcaa tgctatacac acctcttagg 5641 gcaaataatt ctggggatga tgtttttact gtgtcttgca gagtgctaac tagacctagt 5701 cctgacttct catttaattt ccttgtgcca cctactgtgg agtcaaagac aaaacccttt 5761 tccctcccta ttctgactat ctctgaaatg tccaattcta ggttcccagt accaattgat 5821 tctctgcaca ccagtcctac tgagaatatt gttgtccagt gccagaatgg gcgcgtcacc 5881 cttgatggtg agttgatggg caccactcaa ctcttaccta gccaaatctg tgctctcagg 5941 ggcgttctca ccagatcaac aagcagggcc agtgaccagg ccgatacagc aacccctaga 6001 ttgtttaatt attattggca catacaattg gataatctaa atggaactcc ttatgatcct 6061 gcagaagaca taccaggccc cctagggaca ccagatttcc ggggcaaagt ctttggcgtg 6121 gccagccaga gaaatcctga tgccacgact agggcacatg aagcaaagat agacacaaca 6181 tctggccgtt tcactccaaa gctaggctca ttagagatat ccactgaatc tgatgatttt 6241 gatcaaaaca aaccaacaag attcacccca gttggcattg gggttgacca tgaggcagac 6301 ttccaacaat ggactctacc cgactacgct ggccagttca cccacaacat gaacttagcc 6361 ccagctgttg ctcccaactt ccctggtgag cagctccttt tcttccgctc acagttgcca 6421 tcttctggtg ggcgatccaa cgggattcta gactgcctgg tcccccaaga atgggtacag 6481 cacttctacc aagaatcagc cccctctcaa tctcaagtgg ccctggttag gtatatcaac 6541 cctgacactg gtagagtgtt atttgaggcc aagctgcaca aattaggttt catgactata 6601 gccaagaatg gtgactctcc aataacagtc cctccaaatg gatactttag gtttgaatct 6661 tgggtgaacc ccttttacac acttgccccc atgggaactg ggaatgggcg tagaaggatt 6721 caataatggc tggagctttt atagcaggat tggctggtga catgctcaca aatactgtag 6781 gatctttagt taatgcaggg gctaatgcca ttaatcaaac aattgatttt gaaaataata 6841 aatatttgca aaatgcttct ttcaatcatg atagggagat gttgaatgca caaattgagg 6901 caacaaagag gttacaggct gacatgattg ctatcaaaca aggggttttg accgctggcg 6961 gcttctcccc tactgatgca gcccgtgggg caattaatgc ccccatgaca aaagtcctag 7021 attggaatgg aacgagatac tgggcaccag gtgccacctc cacaacctcg atgtcgggtg 7081 gctttacaaa tcaaactgtg cacagatcca caccaaattt taaaacgaac caggctccca 7141 aatccacacc cagcagtggg tcttcagtga ggtcaaattc aacccaaatc actagcctga 7201 gctcacactc gtccgggtcg tctcgatcca gcgggtctac agttgtcagc tcaatgccat 7261 cctctaacag gactagggac tgggtcaacc aacaaaattt taatttggaa ccacacatgc 7321 ctggatctct taggacagct tttgtcactc caccatctag tacagcctct agctcaggca 7381 cagtctcaac tgtgcccaaa aatgttttgg actcctggac atctgcgttt aacacgcgca 7441 gacagccgct attcgcacac cttcgcagaa ggggggagtc //