Typing tool
|
Complete norovirus genomes
OP727614 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 6..5105 ORF2: 5086..6708 ORF3: 6708..7512LOCUS OP727614 7512 bp RNA linear VRL 01-NOV-2022 DEFINITION Norovirus GII isolate OES4 nonstructural polyprotein (ORF1) and VP1 (ORF2) genes, complete cds; and VP2 (ORF3) gene, partial cds. ACCESSION OP727614 VERSION OP727614.1 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7512) AUTHORS Kelly,D., Allen,D.J. and Iturriza-Gomara,M. TITLE Direct Submission JOURNAL Submitted (27-OCT-2022) Clinical Infection, Microbiology and Immunity, Institute of Infection and Global Health, 8 West Derby Street, Liverpool, Please select. L697BE, United Kingdom COMMENT ##Assembly-Data-START## Assembly Method :: SPAdes v. 3.15.4 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7512 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="OES4" /isolation_source="stool sample" /host="Homo sapiens" /db_xref="taxon:122929" /country="United Kingdom" /collection_date="Jan-2017" /note="genotype: GII.P31/GII.4" gene 6..5105 /gene="ORF1" CDS 6..5105 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="UYR40592.1" /translation="MKMASNDASAAAVANSNNDIAKSSSDGMFSNMAVTFKRALGARP KQPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPE ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHRPPAAIS LARVELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST KSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA SGLLHERLDEYELQGPALTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGKQLKDVKTM SDLKQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLR CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMANKDGC LKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG KYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP NLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP RGKPPRPSVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDL ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD STQQRDVLATALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCT SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide 6..995 /gene="ORF1" /product="p48" mat_peptide 996..2093 /gene="ORF1" /product="NTPase" mat_peptide 2094..2630 /gene="ORF1" /product="p22" mat_peptide 2631..3029 /gene="ORF1" /product="VPg" mat_peptide 3030..3572 /gene="ORF1" /product="Pro" mat_peptide 3573..5102 /gene="ORF1" /product="RdRp" gene 5086..6708 /gene="ORF2" CDS 5086..6708 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="UYR40593.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHTTGSHNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG GTTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6708..>7512 /gene="ORF3" CDS 6708..>7512 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="UYR40594.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPRRANLRDTVPTRGSSSKSS NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQNRNLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA" ORIGIN 1 cgtgaatgaa gatggcgtct aacgacgctt ccgctgccgc tgttgccaac agcaacaacg 61 acatcgcaaa atcttcaagt gacggtatgt tttctaacat ggctgtcact tttaagcggg 121 ccctcggggc gcggcctaaa cagccgcccc cgaaggaaat accacccaga cccccgcgac 181 cacccacacc agaattggtc aaaaagatcc ctcctccccc acccaacggg gaggatgaac 241 tagtggtctc ttacagcgcc aaagatggcg tttccggact gcctgagctc accactgtca 301 gacaaccgga agaaaccaac acggcgttca gtgttccccc actcaaccaa agggagagca 361 gggacgccaa ggagccacta actgggacaa ttattgaaat gtgggatgga gaaatctacc 421 attacggcct gtacgtggaa cgaggtctta tacttggtgt gcacaggcca ccggcagcca 481 ttagccttgc tagggtcgag ctagcaccgc tctctttgtt ctggagacct gtatacaccc 541 cccagtatct catctcccca gacactctta ggagattaca tggggagtca ttcccctaca 601 ctgcatttga caacaattgc tacgcctttt gttgctgggt attggaccta aacgactcat 661 ggctaagcag gagaatgatt cagagaacaa caggcttctt caggccgtac caggattgga 721 acaggaaacc cctccccact atggatgatt ccaaattaaa gaaggtagcc aacatattct 781 tgtgcacttt gtcttcacta ttcaccagac ccattaagga cataataggg aagttgaaac 841 ctcttaacat ccttaacatt ctggctacat gtgattggac cttcgcaggc atagtggaat 901 ccttaatact cttggcagaa ctctttggag ttttctggac acccccagat gtgtctgcga 961 tgatcgcccc cttgctaggt gattatgaac tgcaaggacc tgaggacctt gcagtggaac 1021 tggtcccaat agtgatgggg gggataggtt tggtgctagg atttaccaaa gagaaaatcg 1081 gaaagatgct atcatccgct gcatccactt taagagcttg taaagacctt ggtgcatacg 1141 gactggaaat cttaaaattg gtcatgaagt ggttcttccc aaagaaagag gaagcaaatg 1201 aactggctat ggtgagatcc atcgaggatg cagtgttaga cctcgaggca attgaaaaca 1261 accacatgac caccctactc aaagacaaag acagcttggc aacctacatg agaacccttg 1321 accttgagga ggagaaagcc agaaaactct caaccaaatc tgcttcaccc gatattgtgg 1381 gcacaatcaa ctctcttctg gcaagaatcg ctgctgcacg ctccctagtg catcgggcga 1441 aagaagagct ctccagcagg ccgagacctg tcgttgtgat gatatcgggg aggccaggga 1501 tagggaaaac tcaccttgcc agggagctgg ccaagaagat cgcggcctcc ctcacagggg 1561 accagcgtgt gggccttatc ccacgcaatg gtgtcgacca ttgggacgca tacaagggcg 1621 aaagagttgt cctatgggac gactatggaa tgagcaaccc catccacgac gccctcaggt 1681 tgcaggagct tgctgacact tgccccctca cgctaaattg tgacagaatt gagaacaaag 1741 ggaaagtctt tgacagtgat gccataatca tcaccaccaa tctggccaac ccagcaccac 1801 tggattatgt caactttgaa gcgtgctcga gacgtattga tttcctcgtg tacgcagaag 1861 cccctgaggt ggagaaggca aagcgcgact tcccaggtca acctgacatg tggaagaacg 1921 ccttcagtcc tgacttctca cacataaaac tgtcattggc tccacagggt ggttttgaca 1981 agaacggcaa caccccgcat ggaaaagggg tcatgaagac cctcaccact ggctccctca 2041 tcgcccgagc atcagggtta ctccatgaga ggctagatga atatgaactg caaggcccag 2101 ccctcaccac cttcaacttt gaccgcaaca aggtacttgc ttttagacaa cttgctgctg 2161 aaaacaagta tgggctgatg gacacaatga gagttggaaa acagctcaag gatgtcaaga 2221 ccatgtcaga cctcaaacaa gcactcaaga acatcgcgat caagaagtgc cagatagtgt 2281 acaatggtgg cacctacaca cttgaggctg atggcaaggg tagtgtgaaa gttgacaaag 2341 tgcaaagtgc cactgtgcag accaacaatg aactagccgg tgccctacac cacctaaggt 2401 gcgctagaat cagatactat gtcaagtgcg tccaggaggc actgtattcc atcatccaaa 2461 tcgctggggc tgcattcgtc accacgcgca tcgctaagcg catgaatata caaaatctct 2521 ggtccaagcc acaggtggaa gacacagaag agatggccaa caaagatggt tgcctaaaac 2581 ccaaagatga tgaggagttt gtcgtctcat ccgacgacat caaaactgag ggcaagaaag 2641 ggaagaacaa gtctggccgt ggcaagaagc acacagcctt ttcaagtaaa gggctcagtg 2701 atgaggagta cgatgagtac aagagaatca gagaagaaag gaatggtaag tactccatag 2761 aagagtacct tcaggacaga gacaggtact acgaggaggt ggccattgcc agggcaaccg 2821 aagaggactt ctgtgaagaa gaagaggcca aaatccggca gagaattttc agaccaacaa 2881 ggaaacaacg caaagaagag agggcctctc tcggcttggt cacaggctct gaaatcagga 2941 aaagaaaccc agaagacttc aaacccaagg gaaagctgtg ggctgatgat gacagaagtg 3001 ttgactacaa tgagaaactc aactttgagg ccccaccaag catctggtcg cggatagtca 3061 actttggttc gggctggggc ttctgggtct cccccaatct gtttataaca tcaacccatg 3121 tcatacccca aggtgcaaaa gagttcttcg gagtccctat caagcaaatc caaatacaca 3181 agtcaggtga attctgccgg ttgagattcc caaagccaat cagaactgat gtgactggca 3241 tgattctaga agaaggtgcg cccgagggga ccgtggccac actgctcatc aagagaccaa 3301 ctggagagct catgcctctg gcagccagaa tggggaccca tgcaaccatg aaaattcagg 3361 ggcgcacagt tggagggcaa atgggtatgc tcctgacagg atccaacgcc aagagtatgg 3421 acctaggcac aacaccaggc gactgcggct gcccctacat ctacaagagg gggaatgact 3481 acgtggtcat aggagtccat acggccgctg cccgtggagg aaacactgtc atatgtgcca 3541 cccaggggag tgagggagaa gccacacttg aaggaggtga cagtaaaggg acatactgtg 3601 gcgcaccaat cttgggccca gggagcgctc cgaagctcag caccaagact aagttttgga 3661 gatcatccac aacaccactc ccacctggca cctacgaacc agcctacctc ggtggcaagg 3721 accctagagt caaaggtggc ccttcattgc aacaagttat gagggaccag ctgaagccat 3781 tcacagagcc cagaggcaaa ccaccaagac caagtgtgct ggaggctgcc aagaaaacca 3841 tcatcaatgt ccttgagcaa acaattgatc caccccaaaa atggtcattt gcgcaagctt 3901 gcgcatccct tgacaaaacc acctccagcg gccacccgca ccacatgcgg aaaaacgact 3961 gttggaatgg ggagtccttc acaggaaaat tggctgatca agcctccaag gccaacctaa 4021 tgtttgagga gggaaagaac atgactccag tctacacagg tgcacttaaa gatgagttgg 4081 taaagaccga taaagtttat ggtaaggtca agaagaggct tctgtggggt tcagatctag 4141 cgaccatgat acggtgcgcc cgagcttttg gaggccttat ggatgaactc aaggcacact 4201 gtgtcacact tcctgtcaga gttggtatga acatgaatga ggatggcccc atcatctttg 4261 agaagcactc cagatataga tatcactatg atgctgatta ctcccggtgg gactcaacac 4321 agcaaaggga tgtgctagca acagcactag aaatcatggt taagttctct ccagaaccac 4381 acctggccca aatagttgca gaagacctcc tttcccctag cgtgatggat gtaggtgact 4441 ttcaaatatc aataagtgag ggtctcccct ctggggtacc ttgtacctcc cagtggaatt 4501 ccatcgccca ctggctcctc actctgtgtg cactctctga agtcacggac ctatcccctg 4561 atattattca ggccaactcc cttttctcct tctatggtga tgatgagatt gtaagcacag 4621 atataaagtt ggacccagag aagctgacag caaaactcaa ggagtacggg ctgaaaccaa 4681 cccgccctga caaaactgaa ggaccccttg ttatctctga agacctggat ggcttgacat 4741 tcctccggag aactgtgacc cgtgatccag ctggctggtt tggaaaatta gaacaaagtt 4801 caattctcag gcaaatgtac tggaccaggg gtcctaacca tgaagaccca tttgaaacaa 4861 tgataccaca ctcccaaaga cccatacaat tgatgtcctt gctgggcgag gctgcactcc 4921 acggcccggc attctatagc aaaattagca aattagtcat tgcagagttg aaggagggtg 4981 gcatggattt ttacgtaccc agacaagagc caatgttcag atggatgaga ttctcagatc 5041 tgagcacgtg ggagggcgat cgcaatctgg ctcccagttt tgtgaatgaa gatggcgtcg 5101 agtgacgcca acccatctga tgggtccgca gccaacctcg tcccagaggt caacaatgag 5161 gttatggctc tggagcccgt tgttggtgcc gccattgcgg cacctgtagc gggccaacaa 5221 aatgtaattg acccctggat tagaaataat tttgtacaag cccctggtgg agagttcaca 5281 gtatccccta gaaacgctcc aggtgaaata ctatggagcg cgcccctggg ccctgatcta 5341 aatccctacc tatcccattt ggccagaatg tacaatggtt atgcaggtgg ttttgaagtg 5401 caggtaattc tcgcggggaa cgcgttcacc gccgggaagg tcatatttgc agcagttcca 5461 ccaaattttc caactgaagg cttgagcccc agccaggtca ctatgtttcc ccatatagta 5521 gtggatgtta ggcaattaga acctgtgttg attcccttac ccgatgttag gaataatttc 5581 tatcattaca atcaatcaaa tgaccccacc attaagttga tagcaatgtt gtatacacca 5641 cttagggcta ataatgctgg ggatgatgtc ttcacagttt cctgccgagt tctcacgaga 5701 ccatcccccg attttgattt catattccta gtgccaccca cagttgagtc aagaactaaa 5761 ccattctctg tcccagtttt aactgttgag gagatgacca attcaagatt ccccattcct 5821 ttggaaaagt tgttcacggg tcccagcagt gcctttgttg tccaaccaca aaacggcagg 5881 tgcacgactg atggcgtgct cctaggcacc acccaactgt ctcctgtcaa catctgcacc 5941 ttcagaggag atgtcaccca taccacaggt agtcataact acacaatgaa tttggcttct 6001 caaaattgga gcaattatga cccaacagaa gaaatcccag cccctctagg aactccagat 6061 tttgtgggga agattcaggg cgtgctcacc caaaccacaa ggacagatgg ctcaacacgc 6121 ggccacaaag ccacagtgta tactgggagc gccgactttg ctccaaaact gggtagagtt 6181 caatttgaaa ctgacacaaa ccatgatttt gaagctaacc aaaacacaaa gttcacccca 6241 gttggtgtca tccaagatgg tggcaccacc caccgaaatg agccccaaca gtgggtgctc 6301 ccaagttact caggcaggaa cactcccaat gtgcatctgg cccccgctgt agcccccact 6361 tttccgggtg agcaacttct cttcttcaga tccaccatgc ccggatgcag cgggtacccc 6421 aacatggatt tggactgtct gctcccccag gaatgggtgc agtacttcta ccaagaggcg 6481 gccccagcac aatctgatgt ggctctgcta agatttgtga atccagacac aggtagggtt 6541 ttgtttgagt gtaagcttca taaatcaggc tatgttacag tggctcacac tggccaacat 6601 gatttggtta ttccccccaa tggttatttt aggtttgatt cctgggtcaa ccagttctac 6661 acgcttgccc ccatgggaaa tggaacgggg cgtagacgtg cactataatg gctggagctt 6721 tctttgctgg attggcatct gatgtccttg gctctggact tggttccctt atcaatgctg 6781 gggctggggc cattaaccaa aaagttgagt ttgaaaataa cagaaaattg caacaagcat 6841 ccttccaatt tagcagcaat ctacaacagg cttcctttca acatgacaaa gagatgctcc 6901 aagcacaaat tgaggccacc aaaaggctac aacaggaaat gatgaaagtt aagcaggcaa 6961 tgctcctgga gggtggattc tctgagacag atgcagctcg cggggcaatc aacgccccca 7021 tgacaaaagc tttggactgg agcgggacaa ggtactgggc tcccgacgct aggactacaa 7081 catacaatgc aggccgcttt tccacccctc aaccatcggg ggcactgcca agaagagcta 7141 atctcaggga tactgtccct actcggggtt cctccagtaa gtcttctaat tcttctactg 7201 ctacttctgt gtactcaaat caaaccactt caacgagact tggttctaca gctggttctg 7261 gcaccagtgt ctcgagcttc ccgtcaactg caaggactag gagctgggtt gaggatcaaa 7321 acaggaattt gtcacctttc atgagggggg cccacaacat atcgtttgtc accccaccat 7381 ctagcagatc ctctagccaa ggcacagtct caaccgtgcc taaagagatt ttggactcct 7441 ggactggcgc tttcaacacg cgcaggcagc cactcttcgc tcacattcgt aagcgagggg 7501 agtcacgggc gt //