Typing tool
|
Complete norovirus genomes
MK907799 | GII.4 Sydney | ||
---|---|---|---|
GII.P4 New Orleans |
ORF1: 1..4899 ORF2: 4880..6502 ORF3: 6502..7308LOCUS MK907799 7390 bp RNA linear VRL 02-NOV-2019 DEFINITION Norovirus GII isolate G19_035 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MK907799 VERSION MK907799.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7390) AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le Guyader,S. TITLE Optimisation of agnostic metagenomic approaches to characterise human enteric viruses in sewage JOURNAL Unpublished REFERENCE 2 (bases 1 to 7390) AUTHORS Le Guyader,S. and Strubbia,S. TITLE Direct Submission JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES 44311, France COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7390 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="G19_035" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="France: Nantes" /collection_date="01-Oct-2008" /note="genotype: GII.4-GII.P4_New Orleans 2009" gene <1..4899 /gene="ORF1" CDS <1..4899 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QCO93096.1" /translation="IPPPPPNGEDEIVVSYSVKDGVSGLPDLSTVRQPEESNTAFSVP PLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLAKVELAPL SLYWRPVYTPQYLISPDTLKKLSGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQR TTGFFRPYQDWNRKPLPTMDDSKIKKVANIFLCALSSLFTRPIKDIIGKIRPLNILNI LASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVV MGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANELA IVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVG TINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREVAKRIAASLT GDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRI ENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPDVEKAKRDFPGQP DMWKNAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKSLTTGSLIARASGLLHERLD EFELQGPALTTFNFDRNKVLAFRQLAAENKYGLLDTMRVGKQLKDVKTMPELKQALKN VSIKKCQIVYSGCTYMLDSDGKGNVKVDRIQSATVQTNNELVGALHHLRCARIRYYVK CVQEALYSIIQIAGAAFVTTRIAKRMNIQDLWSKPQVENTEETTSKDGCPRPKDDEEF VISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQ DRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERVSLGLVTGSEIRKRN PDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHV IPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTLLIKR STGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKR GNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILGPGSAPTLST KTKFWRSSTASLPPGTYEPAYLGGKDPRVKGGPSLQQVMREQLKPFTEPRGKPPKPSV LEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKL ADQASKANLMFEEGKNMTPVYTAALKDELVKTDKIYGKIKKRLLWGSDLATMIRCARA FGGLMDELKAHCVTLPIRVGMNMNEDGPIIFERHSRYTYHYDADYSRWDSTQQRAVLA AALEIMVKFSPEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCTSQWNSIAHW LLTLCALSEVTNLSPDTIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLREYGLKPTRP DKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHGDPSETM IPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFS DLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..789 /gene="ORF1" /product="p48" mat_peptide 790..1887 /gene="ORF1" /product="NTPase" mat_peptide 1888..2424 /gene="ORF1" /product="p22" mat_peptide 2425..2823 /gene="ORF1" /product="VPg" mat_peptide 2824..3366 /gene="ORF1" /product="Pro" mat_peptide 3367..4896 /gene="ORF1" /product="RdRp" gene 4880..6502 /gene="ORF2" CDS 4880..6502 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QCO93094.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGAPDFVGKIQGML TQTTRADGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRVV" gene 6502..7308 /gene="ORF3" CDS 6502..7308 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QCO93095.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVLARGSSSKSY NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPLMRGAHNI SFVTPPSSRSSSQGTVSTVPKEVLNSWTGAFNTRRQPLFAHIRKRGESRV" ORIGIN 1 attccccccc ccccacccaa cggagaggat gaaatagtgg tctcttatag tgtcaaagat 61 ggtgtttccg gcttgcctga cctttccacc gtcaggcagc cggaagaatc taacacggcc 121 ttcagtgtcc ctccactcaa ccagagggag aatagagatg ctaaggaacc actcactgga 181 acaattctgg aaatgtggga cggggaaatc taccattatg gcctgtatgt ggagcgaggt 241 cttgtactag gcgtgcacaa accgccagct gccattagcc tcgctaaggt tgagttagca 301 ccactctcat tgtactggag acctgtgtac actcctcagt acctcatctc tccagacact 361 ctcaagaaat tgtccggaga aacgttcccc tacacagcct ttgacaacaa ctgctatgcc 421 ttttgttgct gggtcctgga cctaaatgac tcgtggctga gcaggagaat gatccagaga 481 acaactggtt tcttcaggcc ctaccaagac tggaatagga aaccccttcc cactatggat 541 gactccaaaa taaagaaggt ggccaacata tttctgtgtg ctctgtcctc gctattcacc 601 aggcccataa aagatataat agggaagata aggcctctta acatcctcaa catcttagcc 661 tcatgtgatt ggacctttgc gggtatagtg gagtccctga tactcttggc agaactcttt 721 ggagttttct ggacaccccc agatgtgtct gcgatgattg cccccttact tggtgattac 781 gagctacaag ggcctgagga ccttgcagtg gagctcgtcc ccgtggtgat ggggggaatt 841 ggtttggtgc taggattcac caaagagaag attgggaaga tgttgtcatc agctgcgtcc 901 accttaagag cttgcaaaga ccttggtgca tatgggctag agatcctaaa gttagtcatg 961 aagtggttct tcccgaagaa ggaggaggcg aatgagctgg ctatagtgag gtccatcgag 1021 gatgcagtcc tggatctcga agcaattgaa aacaatcata tgaccacctt gcttaaagat 1081 aaagacagtc tggcaaccta catgagaaca cttgaccttg aagaggagaa agccaggaaa 1141 ctctcaacca aatctgcctc acccgacatc gtgggcacaa tcaacgctct cctggcgaga 1201 atcgctgccg cacgttctct ggtgcatcga gcgaaggagg agctttccag cagaccaaga 1261 cctgtggtgt tgatgatatc aggcaggcca ggaataggga agactcacct cgctagggaa 1321 gtggctaaga gaatcgcagc ctcccttaca ggagaccaac gtgttggtct catcccacgc 1381 aatggcgtcg accattggga tgcgtacaag ggggagaggg tcgtcctatg ggacgattat 1441 ggaatgagca accctattca cgatgccctc aggctgcaag aactcgctga cacttgcccc 1501 ctcactctga actgtgacag gattgaaaat aaaggaaagg tctttgacag cgatgtcatc 1561 attatcacca ccaatctggc caacccagcc ccactggact atgtcaactt tgaagcatgc 1621 tcgaggcgca ttgacttcct cgtgtatgca gaagcccctg atgtcgaaaa ggcgaagcgt 1681 gacttcccag gccagcctga catgtggaag aacgctttca gttctgattt ctcacacata 1741 aaactagcac tggccccaca gggtggtttc gacaagaacg ggaacacccc acatggaaag 1801 ggcgtcatga agtctctcac cactggctcc cttattgccc gggcatcagg gctgctccat 1861 gagaggttag atgaatttga actgcagggc ccagctctca ccaccttcaa tttcgatcgc 1921 aataaagtgc tagcctttag acagcttgct gctgaaaata aatatggatt gttggacaca 1981 atgagggttg ggaaacagct caaggacgtc aaaaccatgc cagaactcaa acaagcactc 2041 aagaatgtct caatcaagaa gtgtcaaata gtgtatagtg gttgcaccta catgcttgat 2101 tctgatggca agggcaatgt gaaagttgac aggatccaaa gcgccaccgt gcagaccaac 2161 aatgagctgg ttggtgccct gcaccacttg aggtgcgcca gaatcagata ctatgtcaag 2221 tgtgtccagg aagccctgta ttccatcatt caaattgctg gggctgcatt tgtcaccacg 2281 cgcattgcca agcgcatgaa catacaagac ctatggtcca agccacaagt ggaaaacaca 2341 gaggagacta ccagcaagga cgggtgccca agacccaagg atgatgagga gtttgtcatt 2401 tcgtccgacg acatcaaaac tgagggaaaa aaagggaaga acaagactgg ccgtggcaag 2461 aagcacacag cattttcaag caaaggcctc agtgatgaag agtacgatga atacaagagg 2521 atcagagaag aaaggaatgg caagtactct atagaagagt accttcagga cagggacaaa 2581 tactatgaag aggtggccat tgccagagcg actgaggaag acttctgtga agaggaggaa 2641 gccaagatcc gacaaaggat ctttaggcca acaaggaaac aacgcaagga ggaaagagtc 2701 tctctcggtt tggtcacggg ttctgaaatt aggaaaagaa acccagatga cttcaaaccc 2761 aaggggaaat tgtgggctga cgatgacagg agtgtggact acaatgagaa actcagtttt 2821 gaggccccgc caagcatttg gtcaagaata gtcaactttg gttcaggctg gggattctgg 2881 gtctccccca gcttgttcat aacatcaacc catgttatac cccagggcgc aaaggagttc 2941 tttggagtcc ccatcaaaca aatacaggta cacaagtcag gcgagttctg tcgcttgaga 3001 ttccctaaac caattaggac tgatgtgacg ggtatgatct tagaagaagg cgcacctgag 3061 ggcaccgtgg tcacactact catcaaaagg tccactgggg aacttatgcc cctagcagct 3121 aggatgggga cccatgcgac catgaagatc caagggcgca ctgttggggg ccagatgggc 3181 atgcttctga caggatccaa cgccaagagt atggacctgg gtaccacacc aggtgattgt 3241 ggctgcccct acatctacaa gagaggtaat gactatgtgg tcattggagt ccacacggct 3301 gccgcacgtg gggggaacac tgtcatatgt gccacccaag ggagtgaagg agaggctaca 3361 cttgagggtg gtgacaacaa ggggacatac tgtggtgcac caatcctagg cccagggagt 3421 gccccaacac ttagcaccaa gaccaaattc tggaggtcgt ccacagcatc actcccacct 3481 ggcacctatg aaccagccta tcttggtggc aaggacccta gagttaaggg tggcccttca 3541 ctgcagcaag tcatgaggga acagttgaag ccattcacag agcccagggg taagccacca 3601 aagccaagtg tgttagaagc tgccaagaaa accatcatta atgtccttga gcaaacaatt 3661 gatccacctg agaaatggtc gttcgcacaa gcttgcgcgt ctcttgacaa gaccacttcc 3721 agtggtcatc cgcaccacat gcggaaaaac gactgctgga acggggagtc cttcacaggc 3781 aagctggcag accaggcttc taaggccaac ctgatgtttg aagaagggaa gaacatgacc 3841 ccagtctaca cagctgcgct caaggatgag ttagttaaaa ctgacaaaat ttatggtaag 3901 atcaagaaga ggcttctttg gggctcggac ttggcgacca tgatccggtg tgctcgagca 3961 ttcggaggcc taatggatga actcaaagcg cactgtgtca cacttcccat tagagttggc 4021 atgaatatga atgaggatgg ccccatcatc ttcgagaggc attccaggta cacgtaccac 4081 tatgatgctg attactctcg atgggattca acacaacaga gagccgtgtt ggcagcagcc 4141 ctagaaatca tggtaaaatt ctccccagaa ccacatttgg ctcaggtagt tgctgaagac 4201 cttctttctc ctagcgtggt ggacgtgggc gacttcacaa tatcaatcaa cgagggcctt 4261 ccctctgggg tgccttgcac ctcccaatgg aactccatcg cccactggct tctcactctc 4321 tgtgcgctct ccgaagtcac aaacctgtct cctgatacca tacaggctaa ttctctcttc 4381 tctttttatg gtgatgatga aattgttagc acagacataa aattggaccc agagaaattg 4441 acagcaaagc tcagagaata tgggttaaag ccaacccgcc ctgacaaaac tgaagggccc 4501 cttgtcatct ctgaagacct gaatggtcta actttcctgc ggagaactgt gacccgcgac 4561 ccagctggtt ggtttggaaa actggagcag agttcaatac tcaggcaaat gtactggact 4621 aggggtccca accatggaga cccatctgaa actatgattc cacactccca aaggcccata 4681 caattgatgt ccctactggg ggaggccgct ctccacggcc cagcatttta cagtaaaatt 4741 agcaaattgg tcattgcaga gctaaaagaa ggtggtatgg atttttacgt gcccagacaa 4801 gagccaatgt tcagatggat gagattctca gatctgagca cgtgggaggg cgatcgcaat 4861 ctggctccca gttttgtgaa tgaagatggc gtcgagtgac gccaacccat ctgatgggtc 4921 cgcagccaac ctcgtcccag aggtcaacaa tgaggttatg gctctggagc ccgttgttgg 4981 tgccgccatt gcggcacctg tagcgggcca acaaaatgta attgacccct ggattagaaa 5041 taattttgta caagcccctg gtggagagtt tacagtgtcc cccagaaacg ctccaggtga 5101 aatactatgg agcgcgccct taggccctga tctaaatccc tacctatccc atttggccag 5161 aatgtataat ggttatgcag gtggttttga agtgcaggta attctcgcgg ggaacgcgtt 5221 caccgccggg aaagtcatat ttgcagcagt cccaccaaat ttcccaactg aaggcttgag 5281 ccccagccag gtcactatgt tcccccatat agtagtagat gttaggcaac tagaacctgt 5341 gttgattccc ttacccgatg ttaggaataa tttctaccat tacaatcaat caaatgaccc 5401 caccattaag ttgatagcaa tgttgtacac accacttagg gctaataatg ctggggacga 5461 tgtcttcaca gtttcttgcc gagttctcac gagaccatcc cccgattttg atttcatatt 5521 tctagtgcca cccacagttg agtctagaac taaaccattc tctgtcccag ttttaactgt 5581 tgaggagatg accaattcaa gattccccat ccctctggaa aagttgttca cgggtcccag 5641 cagtgccttt gttgttcaac cacaaaatgg taggtgcacg actgatggcg tgctcctagg 5701 caccacccaa ttgtctcctg tcaacatctg caccttcaga ggggatgtca cccacattac 5761 aggtagtcgt aactacacaa tgaatttggc ttctcaaaat tggaacaatt atgacccaac 5821 agaagaaatc ccagcccctc taggagctcc agattttgtg gggaagattc aaggcatgct 5881 cacccaaacc acaagggcag atggctcaac acgcggccac aaagccacgg tgtacactgg 5941 gagcgccgac tttgctccaa aactgggcag agttcaattt gaaactgaca cagaccatga 6001 ttttgaagct aaccaaaaca caaagttcac cccagtcggt gtcatccaag atggcagcac 6061 cacccaccga aatgaacccc aacagtgggt gctcccaagt tactcaggca gaaatactca 6121 caatgttcat ctggcccccg ctgtagcccc tacttttccg ggtgagcaac ttctcttctt 6181 taggtccact atgcccggat gtagcgggta ccccaacatg gatttggact gtctgctccc 6241 ccaggaatgg gtgcagtact tctaccaaga ggcagcccca gcacaatctg atgtggctct 6301 gctaagattt gtgaatccag acacaggtag ggttttgttt gagtgtaaac ttcataaatc 6361 aggctatgtt acagtggctc acactggcca acatgatttg gttatccccc ccaatggtta 6421 ttttaggttt gattcctggg tcaaccagtt ctacacgctt gcccccatgg gaaatggagc 6481 ggggcgtaga cgtgtagtat aatggctgga gctttctttg ctggattagc atctgatgtc 6541 cttggctctg gacttggctc ccttatcaat gctggggctg gggccatcaa ccaaaaagtt 6601 gagtttgaaa ataacagaaa attgcaacaa gcatccttcc aatttagcag caatctacaa 6661 caggcttcct ttcaacatga caaagaaatg ctccaagcac aaattgaggc caccaaaaag 6721 ctacaacagg aaatgatgag agttaagcag gcaatgctcc tagagggtgg gttctctgag 6781 acagatgcag cccgcggggc aatcaacgcc cccatgacaa aagctttgga ctggagcggg 6841 acaaggtact gggctcctga tgctaggacc acaacataca atgctggccg cttttccacc 6901 cctcaaccat cgggggcact gccaggaaga gctaatctta gggatgctgt ccttgctcgg 6961 ggttcctcta gcaaatctta taactcttct actgctactt ctgtgtactc aaatcaaacc 7021 acttcaacga gacttggttc tacagctggt tctggtacca gtgtctcgag tctcccgtca 7081 actgcaagga ctaggagctg ggttgaggat caaagtagga atttgtcccc tctcatgagg 7141 ggggcccaca acatatcgtt tgtcacccca ccatctagca gatcctctag ccaaggcaca 7201 gtctcaaccg tgcctaaaga ggttttgaac tcctggactg gcgctttcaa cacgcgcagg 7261 cagcctctct tcgctcatat tcgtaaacga ggggagtcac gggtgtaatg tgaaaagaca 7321 aaattgatta tctttctttc tctttagtgt cttttaaaaa aaaaaaaaaa aaaaaaaaaa 7381 aaaaaaaaaa //