Typing tool
|
Complete norovirus genomes
OR039448 | GII.4 Sydney | ||
---|---|---|---|
GII.P31 |
ORF1: 1..4970 ORF2: 4951..6573 ORF3: 6573..7379LOCUS OR039448 7420 bp RNA linear VRL 29-MAY-2023 DEFINITION Norovirus GII isolate stool nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION OR039448 VERSION OR039448.1 DBLINK BioProject: PRJNA490509 BioSample: SAMN35345295 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7420) AUTHORS Yang,Z. TITLE Direct Submission JOURNAL Submitted (24-MAY-2023) DMB, CFSAN OARSA foodborne pathogen submission group, 8301 Muirkirk Rd, Laurel, MD 20708, USA COMMENT ##Assembly-Data-START## Assembly Method :: CLC GWB v. 10 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7420 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="stool" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA: CA" /collection_date="Feb-2014" /note="genotype: GII.4 Sydney_2012[P31]" gene <1..4970 /gene="ORF1" CDS <1..4970 /gene="ORF1" /codon_start=3 /product="nonstructural polyprotein" /protein_id="WHU05298.1" /translation="KQPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDG VSGLPELTTVRQPEETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVER GLILGVHKPPAAISLAKVELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNN CYAFCCWVLDLNDSWLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTL SSLFTRPIKDIIGKLKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMI APLLGDYELQGPEDLAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAY GLEILKLVMKWFFPKKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMR TLDLEEEKARKLSTKSASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMIS GRPGIGKTHLARELAKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNP IHDALRLQELADTCPLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRR IDFLVYAEAPEVEKAKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKG VMKTLTTGSLIARASGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMD TMRVGKQLKDVKTMSDLKQALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATV QTNNELAGALHHLRCARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKP QVEDTEEMANKDGCLKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDE EYDEYKRIREERNGKYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPT RKQRKEERASLGLVTGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSR IVNFGSGWGFWVSPSLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRT DVTGMILEEGAPEGTVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTG SNAKSMDLGTTPGDCGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEG GDSKGTYCGAPILGPGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSL QQVMRDQLKPFTEPRGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTT SSGHPHHMRKNDCWNGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKV YGKVKKRLLWGSDLATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHS RYRYHYDADYSRWDSTQQRDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQ ISISEGLPSGVPCTSQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVST DIKLDPEKLTAKLKEYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLE QSSILRQMYWTRGPNHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAE LKEGGMDFYVPRQEPMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..860 /gene="ORF1" /product="p48" mat_peptide 861..1958 /gene="ORF1" /product="NTPase" mat_peptide 1959..2495 /gene="ORF1" /product="p22" mat_peptide 2496..2894 /gene="ORF1" /product="VPg" mat_peptide 2895..3437 /gene="ORF1" /product="Pro" mat_peptide 3438..4967 /gene="ORF1" /product="RdRp" gene 4951..6573 /gene="ORF2" CDS 4951..6573 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="WHU05299.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGVL TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFGTDTDHDFEANQNTKFTPVGVIQDG STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL" gene 6573..7379 /gene="ORF3" CDS 6573..7379 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="WHU05300.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKRAMLLEGGFSETDAARGAIN APMTKALDWNGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKSS NSSTATSVYSNQTTSTRLGSTAGSGTSVSSFPSTARTRSWVEDQSRNLSPFMRGAHNI SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRA" ORIGIN 1 ctaaacagcc gcccccgaag gaaataccac ccagaccccc gcgaccaccc acaccagaat 61 tggtcaaaaa gatccctcct cccccaccca acggggagga tgaactagtg gtctcttaca 121 gcgccaaaga tggcgtttcc ggactgcctg agctcaccac tgtcagacaa ccggaagaaa 181 ccaacacggc gttcagtgtc cccccactca accaaaggga gagcagggac gccaaggagc 241 cactaactgg aacaatcatt gaaatgtggg atggagaaat ctaccattac ggcctgtacg 301 tggaacgagg tcttatactt ggtgtgcaca agccaccggc agccattagc cttgccaagg 361 tcgagctagc accgctctct ttgttctgga gacctgtata caccccccag tatctcatct 421 ctccagacac tcttaggaga ttacatggag agtcattccc ctacactgca tttgacaaca 481 attgctacgc cttttgttgt tgggtattag acctaaacga ctcatggcta agcaggagaa 541 tgattcagag aacaacaggc ttcttcaggc cgtaccagga ttggaacagg aaacccctcc 601 ccactatgga tgattccaaa ttaaagaagg tagccaacat attcttgtgc actttgtctt 661 cactattcac cagacccatt aaggacataa tagggaagtt gaaacctctt aacatcctta 721 acattctggc tacatgtgat tggaccttcg caggcatagt ggaatcctta atactcttgg 781 cagaactctt tggagttttc tggacacccc cagatgtgtc tgcgatgatc gcccccttgc 841 taggtgatta tgaactgcaa ggacctgagg accttgcagt ggaactggtc ccaatagtga 901 tgggggggat aggtttggtg ctaggattta ccaaagagaa aatcggaaag atgctatcat 961 ccgctgcatc cactttaaga gcttgtaaag accttggtgc atacggactg gaaatcttaa 1021 aattggtcat gaagtggttc ttcccaaaga aagaggaagc aaatgaactg gctatggtga 1081 gatccatcga ggatgcagta ctagacctcg aggcaattga aaacaaccac atgaccaccc 1141 tactcaaaga caaagacagc ttggcaacct acatgagaac ccttgacctt gaggaggaga 1201 aagccagaaa actctcaacc aaatctgctt cacccgatat tgtgggcaca atcaactctc 1261 ttttggcaag aatcgctgct gcacgctccc tagtgcaccg ggcgaaagaa gagctctcca 1321 gcaggccgag acctgtcgtt gtgatgatat cgggaagacc agggataggg aaaactcacc 1381 ttgccaggga gctggccaag aagatcgcgg cctccctcac aggggaccag cgtgtgggtc 1441 ttatcccacg caatggtgtc gaccactggg acgcatacaa gggcgaaaga gttgtcctat 1501 gggacgacta tggaatgagc aaccccatcc atgatgccct caggttgcag gagcttgctg 1561 acacttgccc cctcacgcta aattgtgaca gaattgagaa caaagggaaa gtctttgaca 1621 gtgatgccat aattatcacc accaatctgg ccaacccagc accactggat tatgtcaact 1681 ttgaagcgtg ctcgagacgc attgatttcc tcgtgtacgc agaagcccct gaggtggaga 1741 aggcaaagcg cgacttccca ggtcaacctg acatgtggaa gaacgctttc agtcctgact 1801 tctcacacat aaaactgtca ttggctccac agggtggttt tgataagaac ggcaacaccc 1861 cgcatggaaa aggggtcatg aagaccctca ccactggctc cctcatcgcc cgagcatcag 1921 ggttactcca tgagaggcta gatgaatatg aactgcaagg cccagccctc accactttca 1981 actttgaccg taacaagata cttgctttta gacagcttgc tgctgaaaac aagtatgggc 2041 tgatggacac aatgagagtt ggaaaacagc tcaaggatgt caagaccatg tcagacctca 2101 aacaagcact caagaacatc gcgatcaaga agtgccagat agtgtacaat ggtggcacct 2161 acacacttga ggctgatggc aagggtagtg tgaaagttga caaagtgcaa agcgccactg 2221 tgcagaccaa caatgaacta gccggtgccc tacaccacct aaggtgcgct agaatcagat 2281 actatgttaa gtgcgtccag gaggcactgt attccatcat ccaaatcgct ggggctgcgt 2341 tcgtcaccac gcgcatcgct aagcgcatga atatacagaa tctctggtcc aagccacagg 2401 tggaagacac agaagagatg gccaacaaag atggttgcct aaaacccaaa gatgatgaag 2461 agtttgtcgt ctcatccgac gacatcaaaa ctgagggcaa gaaagggaag aacaagtccg 2521 gccgtggcaa gaagcacaca gccttttcaa gtaaagggct cagtgatgag gagtacgatg 2581 agtacaagag aatcagagaa gaaaggaatg gtaagtactc catagaagag taccttcagg 2641 acagagacag gtactacgag gaggtggcca ttgccagggc aaccgaagag gacttctgtg 2701 aagaagaaga ggccaaaatc cggcagagaa tcttcagacc aacaaggaaa caacgcaaag 2761 aagagagggc ctctctcggc ttggtcacag gctctgaaat caggaagaga aacccagaag 2821 acttcaaacc caagggaaag ctgtgggctg atgatgacag aagtgttgac tacaatgaga 2881 aactcaactt tgaggcccca ccaagcatct ggtcgcggat agtcaacttt ggttcaggct 2941 ggggcttctg ggtctccccc agtctgttta taacatcaac ccatgtcata ccccaaggtg 3001 caaaagagtt cttcggagtc cctatcaagc aaatccagat acacaagtca ggtgaattct 3061 gccggttgag attcccaaag ccaatcagaa ctgatgtgac gggcatgatt ctagaagaag 3121 gtgcgcccga ggggaccgtg gccacactgc tcatcaagag accaactgga gagctcatgc 3181 ctctggcagc cagaatgggg acccatgcaa ccatgaaaat tcaggggcgc acagttggag 3241 ggcaaatggg tatgctcctg acaggatcca acgccaagag tatggaccta ggcacaacac 3301 caggcgactg cggctgcccc tacatctaca agagggggaa tgactacgtg gtcataggag 3361 tccatacggc cgctgcccgt ggaggaaaca ctgtcatatg tgccacccag gggagtgagg 3421 gagaagccac acttgaagga ggtgacagta aagggacata ctgtggcgca ccaatcttgg 3481 gcccagggag tgctccgaag ctcagcacca agactaagtt ttggagatca tccacaacac 3541 cactcccacc tggcacttac gaaccagcct acctcggtgg caaagaccct agagtcaaag 3601 gtggcccttc attgcaacaa gttatgaggg accagctgaa gccattcaca gaacccagag 3661 gcaaaccacc aagaccaaat gtgttggaag ctgccaagaa aaccatcatc aatgtccttg 3721 agcaaacaat tgatccaccc caaaaatggt catttgcgca agcttgcgca tcccttgaca 3781 aaaccacctc cagcggccac ccgcaccaca tgcggaaaaa cgactgttgg aatggggagt 3841 ccttcacagg aaaattggct gatcaagcct ccaaggccaa cctaatgttt gaagagggaa 3901 agaacatgac tccagtctac acaggtgcac ttaaagatga gttggtaaag accgataaag 3961 tttatggtaa ggtcaagaag aggcttctgt ggggttcaga tctggcgacc atgatacggt 4021 gcgcccgagc ttttggaggc cttatggatg aactcaaggc acactgtgtc acacttcctg 4081 tcagagttgg tatgaacatg aatgaggatg gccccatcat ctttgagaag cactccagat 4141 atagatatca ctatgatgct gattattctc ggtgggactc aacacaacaa agggatgtgc 4201 tagcagcagc actagaaatc atggttaagt tctctccaga accacacctg gcccagatag 4261 ttgcagaaga cctcctttcc cctagcgtga tggatgtagg tgactttcaa atatcaataa 4321 gtgagggtct cccctctggg gtaccttgta cctcccagtg gaattctatc gcccactggc 4381 tcctcactct gtgtgcactc tctgaagtca cggacctgtc ccctgatatc attcaggcca 4441 actccctttt ctccttctat ggtgatgatg agattgtaag cacagacata aaactggacc 4501 cagagaagct gacagcaaag ctcaaggagt acgggctgaa accaacccgc cccgacaaaa 4561 ctgaaggacc ccttgttatc tctgaagacc tggatggcct gacattcctc cggagaactg 4621 tgacccgtga tccagctggc tggtttggaa aattggaaca aagttcaatt ctcaggcaaa 4681 tgtactggac caggggtccc aaccatgaag atccatttga aacaatgata ccacactccc 4741 aaagacccat acaattgatg tccttgctgg gcgaggctgc actccacggc ccggcatttt 4801 atagcaaaat tagcaaacta gtcattgcag agttgaagga aggtggcatg gatttttacg 4861 tacccagaca agagccaatg ttcagatgga tgagattctc agatctgagc acgtgggagg 4921 gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgagtga cgccaaccca 4981 tctgatgggt ccgcagccaa cctcgtccca gaggtcaaca atgaggttat ggctctggag 5041 cccgttgttg gtgccgccat tgcggcacct gtagcgggcc aacaaaatgt aattgacccc 5101 tggattagaa ataattttgt acaagcccct ggtggagagt ttacagtgtc ccctagaaac 5161 gctccaggtg aaatactatg gagcgcgccc ttgggccctg atctaaatcc ctacctatcc 5221 catttggcca gaatgtacaa tggttatgca ggtggttttg aagtgcaggt aattctcgcg 5281 gggaacgcgt tcaccgccgg gaaggtcata tttgcagcag tcccaccaaa ttttccaact 5341 gaaggcttga gccccagcca ggtcactatg ttcccccata tagtagtaga tgttaggcaa 5401 ttagaacctg tgttgattcc cttacccgat gttaggaata atttctatca ttacaatcaa 5461 tcaaatgacc ccaccattaa gttgatagca atgttgtata caccacttag ggctaataat 5521 gctggggatg atgtcttcac agtttcttgc cgagttctca cgagaccatc ccccgatttt 5581 gatttcatat ttctagtacc acccacagtt gagtcaagaa ctaaaccatt ctctgtccca 5641 gttttaactg ttgaggagat gaccaattca agattcccca ttcctttgga aaagttgttc 5701 acgggtccca gcagtgcctt tgttgtccaa ccacaaaacg gtaggtgcac gactgatggc 5761 gtgctcctag gcaccaccca actgtctcct gtcaacatct gcaccttcag aggagatgtc 5821 acccatatca caggtagtcg taactacaca atgaatttgg cttctcaaaa ttggagcaat 5881 tatgacccaa cagaagaaat cccagcccct ctaggaactc cagattttgt ggggaagatt 5941 caaggcgtgc tcacccaaac cacaaggaca gatggctcaa cacgcggcca caaagccaca 6001 gtgtacactg ggagcgccga ctttgctcca aaactgggta gagttcaatt tggaactgac 6061 acagaccatg attttgaagc taaccaaaac acaaaattca ccccagttgg tgtcatccaa 6121 gatggtagca ccacccaccg aaatgaaccc caacagtggg tgctcccaag ttactcaggc 6181 agaaatactc ctaatgtgca tctggccccc gctgtagccc ccacttttcc gggtgagcaa 6241 cttctcttct tcagatccac catgcccgga tgcagcgggt accccaacat ggatttggac 6301 tgtctgctcc cccaggaatg ggtgcagtac ttctaccaag aggcagcccc agcacaatct 6361 gatgtggctc tgctaagatt tgtgaatcca gacacaggta gggttttgtt tgagtgtaag 6421 cttcataaat caggctatgt tacagtggct cacactggcc aacatgattt ggttatcccc 6481 cccaatggtt attttaggtt tgattcctgg gtcaaccagt tttacacgct tgcccccatg 6541 ggaaatggaa cggggcgtag acgtgcacta taatggctgg agctttcttt gctggattgg 6601 catctgatgt ccttggctct ggacttggtt cccttatcaa tgctggggct ggggccatca 6661 accaaaaagt tgagtttgaa aataacagaa aattgcaaca agcatccttc caatttagca 6721 gcaatctaca acaggcttcc tttcaacatg acaaagagat gctccaagca caaattgagg 6781 ccaccaaaag gctacaacag gaaatgatga aagttaagcg ggcaatgctc ctagagggtg 6841 ggttctctga gacagatgca gcccgcgggg caatcaacgc ccccatgaca aaagctttgg 6901 actggaacgg gacaaggtac tgggctcccg atgctaggac tacaacatac aatgcaggcc 6961 gcttttccac ccctcaacca tcgggggcac tgccaggaag agctaatctt agggatgctg 7021 tccctgctcg gggttcctcc agtaagtctt ctaattcttc tactgctact tctgtgtact 7081 caaatcaaac tacttcaacg agacttggtt ctacagctgg ttctggtacc agtgtctcga 7141 gcttcccgtc aactgcaagg actaggagct gggttgagga tcaaagtagg aatttgtcac 7201 ctttcatgag gggggcccac aacatatcgt ttgtcacccc accatctagc agatcctcta 7261 gccaaggcac agtctcaacc gtgcctaaag agattttgga ctcctggact ggcgctttca 7321 acacgcgcag gcagccactc ttcgcccaca ttcgtaagcg aggggagtca cgggcgtaat 7381 gtgaaaagac aaaattgatt atctttcttt ttctttagtg //