Typing tool
|
Complete norovirus genomes
MW009070 | GII.4 Sydney | ||
---|---|---|---|
GII.P4 New Orleans |
ORF1: 1..5094 ORF2: 5075..6697 ORF3: 6697..7500LOCUS MW009070 7561 bp RNA linear VRL 21-SEP-2020 DEFINITION Norovirus GII isolate 432-2 nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds. ACCESSION MW009070 VERSION MW009070.1 DBLINK BioProject: PRJNA604000 KEYWORDS . SOURCE Norovirus GII ORGANISM Norovirus GII Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes; Picornavirales; Caliciviridae; Norovirus; Norwalk virus. REFERENCE 1 (bases 1 to 7561) AUTHORS Yang,Z., Silva,A.J., Wolfe,J., Hirneisen,K., Ruelle,S., Torres,A., Williams-Hill,D., Kulka,M. and Hellberg,R.S. TITLE Direct Submission JOURNAL Submitted (15-SEP-2020) Molecular Virology Team/Devision of Molecular Biology, OARSA/CFSAN/FDA, 8301 Muirkirk Rd., Laurel, MD 20708, USA COMMENT ##Assembly-Data-START## Assembly Method :: CLC GWB v. 11 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..7561 /organism="Norovirus GII" /mol_type="genomic RNA" /isolate="432-2" /isolation_source="stool" /host="Homo sapiens" /db_xref="taxon:122929" /country="USA" /collection_date="10-Apr-2017" /note="genotype: GII.P4-GII.4" gene <1..5094 /gene="ORF1" CDS <1..5094 /gene="ORF1" /codon_start=1 /product="nonstructural polyprotein" /protein_id="QNS37114.1" /translation="MASNDATAAAVANSNNDTAKSSSDGVLSSMAVTFKRALGARPKQ PPPREKPQRPPRPPTPELVKNIPPPPPNGEDEIVVTYSVKDGVSGLPDLSTVRQPEES NTAFSVPPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLA KVELAPLSLYWRPVYTPQYLISPDTLKKLSGETFPYTAFDNNCYAFCCWVLDLNDSWL SRRMIQRTTGFFRPYQDWNRKPLPTMDDSKIKKVANIFLCALSSLFTRPIKDIIGKIR PLNILNILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLA VELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKK EEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKS ASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREVAK RIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPL TLNCDRIENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPDVEKAK RDFPGQPDMWKNAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASG LLHERLDEFELQGPALTTFNFDRNKVLAFRQLAAENKYGLMDTMRVGKQLKDVKTMPE LKQALKNVSIKKCQIVYNGCTYMLESDGKGNVKVDRIQSAAVQTNNELAGALHHLRCA RIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQDLWSKPQVENTEETTSKDGCPK PKDDEEFVISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKY SIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERVSLGLVTG SEIRKRNPDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSL FITSTHVIPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTV VTLLIKRSTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCG CPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILGPG SAPTLSTKTKFWRSSTASLPPGTYEPAYLGGKDPRVKGGPSLQQVMREQLKPFTEPRG KPPKPSVLEAAKKTIINVLEQTIDPPERWSFAQACASLDKTTSSGHPHHMRKNDCWNG ESFTGKLADQASKANLMFEEGKNMTPVYTAALKDELVKTDKIYGKIKKRLLWGSDLAT MIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFERHSRYTYHYDADYSRWDST QQRAVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCTSQ WNSIAHWLLTLCALSEVTNLSPDTIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEY GLKPTRPDKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNH GDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPM FRWMRFSDLSTWEGDRNLAPSFVNEDGVE" mat_peptide <1..984 /gene="ORF1" /product="p48" mat_peptide 985..2082 /gene="ORF1" /product="NTPase" mat_peptide 2083..2619 /gene="ORF1" /product="p22" mat_peptide 2620..3018 /gene="ORF1" /product="VPg" mat_peptide 3019..3561 /gene="ORF1" /product="Pro" mat_peptide 3562..5091 /gene="ORF1" /product="RdRp" gene 5075..6697 /gene="ORF2" CDS 5075..6697 /gene="ORF2" /note="major capsid protein" /codon_start=1 /product="VP1" /protein_id="QNS37115.1" /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGAPDFVGKIQGML TQTTRADGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVV" gene 6697..7500 /gene="ORF3" CDS 6697..7500 /gene="ORF3" /note="minor structural protein; small basic protein" /codon_start=1 /product="VP2" /protein_id="QNS37116.1" /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGVLPGRTNLRDAVPARGSSKSSN SSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPFMRGAHNIS FVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRA" ORIGIN 1 atggcgtcta acgacgctac cgctgccgct gttgctaaca gcaacaacga caccgcaaaa 61 tcttcaagtg acggagtgct ttctagcatg gctgtcactt ttaaacgagc cctcggggcg 121 cggcctaaac agcccccccc gagggaaaaa ccacaaagac ccccacgacc acccacacca 181 gaactggtta aaaatattcc ccccccccca cccaacggag aggatgaaat agtggttact 241 tatagtgtca aagatggtgt ttccggcttg cctgaccttt ccaccgtcag acagccggaa 301 gaatccaaca cggccttcag tgtccctcca ctcaatcaga gggagaacag agatgctaag 361 gaaccactca ctggaacaat tctggaaatg tgggacgggg aaatctacca ttatggcctg 421 tatgtggagc gaggtcttgt actaggcgta cacaaaccgc cagctgccat cagcctcgct 481 aaggttgagt tggcaccact ctccttatac tggaggcctg tgtacactcc tcagtacctc 541 atctctccag acactctcaa gaaattgtcc ggggaaacgt tcccctacac agcatttgac 601 aacaactgct atgccttttg ttgctgggtt ctggacctaa atgactcgtg gctgagcagg 661 agaatgatcc agaggacaac tggtttcttc aggccctacc aagactggaa taggaaaccc 721 cttcccacta tggatgactc caaaataaag aaggtggcca atatatttct gtgtgccctg 781 tcctcgctat tcaccaggcc cataaaagat ataataggga aaataaggcc tcttaacatc 841 ctcaacatct tggcctcatg tgattggacc tttgcgggca tagtagagtc cctgatactc 901 ttagcagagc tctttggagt tttctggaca cccccagatg tgtctgcaat gattgccccc 961 ttacttggtg actacgagct acaaggacct gaggaccttg cagtggagct cgtccccgtg 1021 gtgatggggg gaattggttt ggtgctagga ttcaccaaag agaaaattgg gaaaatgttg 1081 tcatcagctg cgtccacctt aagagcttgc aaagaccttg gtgcatatgg gctagagatc 1141 ctaaagctag tcatgaagtg gttcttcccg aagaaggagg aggcaaatga gctggctatg 1201 gtgaggtcca tcgaggatgc agtcctggat ctcgaagcaa ttgaaaacaa tcatatgacc 1261 accttgctta aagataaaga cagtctggca acctacatga gaacactcga tcttgaagag 1321 gagaaagcca ggaaactctc aaccaagtct gcctcacctg acatcgtggg cacaatcaac 1381 gccctcctgg cgagaatcgc tgccgcacgt tctctggtgc atcgagcgaa ggaggagctt 1441 tccagcagac caagacctgt ggtgttgatg atatcaggta ggccaggaat agggaagacc 1501 cacctcgcta gggaagtggc taagagaatc gcagcctccc ttacaggaga ccagcgtgtt 1561 ggtctcatcc cacgcaatgg cgtcgaccac tgggatgcgt ataaggggga gagggtcgtc 1621 ctatgggacg attatggaat gagcaaccct attcacgatg ccctcaggct gcaagaactc 1681 gctgacactt gccccctcac tctgaactgt gacaggattg aaaataaggg aaaggtcttt 1741 gacagcgatg tcatcattat caccactaat ctggccaacc cagccccact ggactatgtc 1801 aactttgaag catgctcgag gcgcattgac ttcctcgtat atgcagaagc ccctgatgtc 1861 gaaaaggcga agcgtgactt cccaggccag cctgacatgt ggaagaacgc tttcagttcc 1921 gatttctcac acataaaact agcactggcc ccacagggtg gtttcgacaa gaacgggaac 1981 accccacacg gaaagggcgt catgaagact ctcaccactg gctcccttat tgcccgggca 2041 tcagggctac tccatgagag gttagatgaa tttgaactgc agggcccagc tctcaccacc 2101 ttcaatttcg atcgcaataa agtgcttgcc tttagacagc ttgctgctga aaacaaatat 2161 ggactgatgg acacaatgag ggttgggaaa cagctcaagg atgtcaaaac catgccagaa 2221 ctcaaacaag cactcaagaa tgtctcaatc aagaagtgtc aaatagtgta taatggttgc 2281 acctacatgc ttgagtctga tggcaagggc aatgtgaaag ttgacaggat ccagagcgcc 2341 gccgtgcaga ccaacaatga gctggctggt gccctgcacc acttgaggtg cgccagaatc 2401 agatactatg tcaagtgtgt ccaggaagcc ctgtattcca tcattcaaat tgctggggct 2461 gcatttgtca ccacgcgcat tgccaagcgc atgaacatac aggatctatg gtccaagcca 2521 caagtggaaa atacagagga aactaccagc aaggacgggt gcccaaaacc caaggatgat 2581 gaggagtttg tcatttcatc cgacgacatc aaaactgagg gcaagaaagg gaagaacaag 2641 actggccgtg gcaagaagca cacagcattt tcaagcaaag gcctcagtga tgaagagtac 2701 gatgaataca agaggatcag agaagaaagg aatggcaagt actctataga agagtacctc 2761 caggacaggg acaaatatta tgaggaggtg gccattgcca gagcgactga ggaagacttc 2821 tgtgaagagg aggaagccaa gatccgacaa aggatcttta ggccaacaag gaaacaacgc 2881 aaggaggaaa gagtctctct cggtttagtc acgggctctg aaattaggaa aagaaaccca 2941 gatgatttca aacccaaagg gaaattgtgg gctgacgatg acaggagtgt ggactacaat 3001 gagaaactca gttttgaggc cccgccaagc atttggtcaa gaatagtcaa ctttggttca 3061 ggctggggat tctgggtctc ccccagcttg ttcataacat caacccatgt tataccccag 3121 ggcgcaaagg agttctttgg agtccccatc aaacaaatac aggtacacaa gtcaggcgag 3181 ttctgtcgct tgagattccc taaaccaatc aggactgatg tgacgggcat gatcttagaa 3241 gaaggcgcac ctgagggcac cgtggttaca ctactcatca aaaggtccac cggggaactt 3301 atgcccctag cagctaggat ggggacccat gcgaccatga agatccaagg gcgcactgtt 3361 gggggccaga tgggcatgct tctgacaggg tccaacgcca agagtatgga cctgggtact 3421 acaccaggtg attgtggctg cccctacatc tacaagagag gtaatgacta tgtggtcatt 3481 ggagtccaca cggccgccgc acgtgggggg aacactgtca tatgtgccac ccaggggagt 3541 gaaggagagg ctacacttga gggtggtgac aacaagggga cttactgtgg tgcaccaatc 3601 ctaggcccag ggagtgcccc aacacttagc accaagacca aattctggag atcgtccaca 3661 gcatcactcc cacctggcac ctatgaacca gcctatcttg gtggcaagga ccctagagtc 3721 aagggtggcc cttcactaca gcaagtcatg agggaacagt tgaagccatt cacagagccc 3781 agaggtaagc caccaaaacc aagtgtgtta gaagctgcca agaaaaccat catcaatgtt 3841 cttgagcaaa caattgatcc acctgagaga tggtcgttcg cacaagcttg cgcgtctctt 3901 gacaagacca cttccagtgg tcatccgcat cacatgcgga aaaacgactg ctggaacggg 3961 gagtccttca caggcaagct ggcagaccag gcttctaagg ccaacctgat gtttgaagaa 4021 ggaaagaaca tgaccccagt ctacacagct gcgctcaagg atgagctagt taaaactgac 4081 aaaatttatg gtaagatcaa gaagaggctt ctttggggct cggacttggc gaccatgatc 4141 cggtgtgctc gggcattcgg aggcctaatg gatgaactca aagcacactg tgtcacactt 4201 cccgttagag ttggcatgaa tatgaatgag gatggcccca tcatcttcga gaggcattcc 4261 aggtacacgt accactatga tgctgattac tctcgatggg attcaacaca acagagagcc 4321 gtgttggcgg cagctctaga aatcatggta aagttctccc cagaaccaca tttggcccag 4381 gtagtcgctg aagaccttct ttctcctagc gtggtggacg tgggcgactt cacaatatca 4441 atcaacgagg gtcttccctc tggggtgccc tgcacctccc aatggaactc catcgcccac 4501 tggcttctca ctctctgcgc gctctccgaa gtcacaaacc tgtctcctga caccatacag 4561 gctaattctc tcttttcttt ttatggtgat gatgaaattg ttagcacaga cataaaattg 4621 gacccagaga aattgacagc aaaactcaaa gaatacgggt taaaaccaac ccgccctgac 4681 aaaactgaag gaccccttgt catctctgaa gacctgaatg gcctaacttt cctgcggaga 4741 actgtgaccc gcgacccagc tggttggttt ggaaaactgg agcagagttc aatactcagg 4801 caaatgtact ggactagggg tcccaaccat ggagacccat ctgaaacaat gattccacat 4861 tcccaaagac ccatacaatt gatgtcccta ctgggggagg ccgctctcca cggcccagca 4921 ttttacagta agattagcaa attggtcatt gcagagctaa aagaaggtgg tatggatttt 4981 tacgtgccca gacaagagcc aatgttcaga tggatgagat tctcagatct gagcacgtgg 5041 gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga gtgacgccaa 5101 cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg ttatggctct 5161 ggagcccgtt gttggtgccg ctattgcggc acctgtagcg ggccaacaaa atgtaattga 5221 cccctggatt agaaacaatt ttgtacaagc ccctggtgga gagtttacag tgtcccccag 5281 aaacgctcca ggtgaaatac tatggagcgc gcccttaggc cctgatttaa atccctacct 5341 atcccacttg gccagaatgt ataatggtta tgcaggtggt tttgaagtgc aggtgattct 5401 cgcggggaac gcgttcaccg ccgggaaggt catatttgca gcagtcccac caaattttcc 5461 aactgaaggc ttgagcccca gccaggtcac tatgttcccc catatagtag tggatgttag 5521 gcaactagaa cctgtgttga ttcccttacc cgatgttagg aataatttct accattacaa 5581 tcaatcaaat gaccccacca ttaagttgat agcaatgtta tatacaccac ttagggctaa 5641 taatgctggg gatgatgtct tcacagtttc ttgccgagtt ctcacgaggc catcccccga 5701 ttttgatttc atatttctag tgccacccac agttgagtct agaaccaaac cattttccgt 5761 cccagttttg actgttgagg agatgaccaa ttcaagattc cccattcctt tggaaaagtt 5821 gttcacgggc cccagtagtg cctttgttgt tcaaccacaa aacggtaggt gcacgactga 5881 tggcgtgctc ctaggcacca cccaattgtc tcctgtcaac atctgcacct tcagagggga 5941 tgtcacccac attacaggta gtcgtaacta cacaatgaat ttggcttctc aaaattggaa 6001 caattatgac ccaacagaag aaatcccagc ccctctagga gctccagatt ttgtggggaa 6061 gattcaaggc atgctcaccc aaaccacaag ggcagatggc tcaacacgcg gccacaaggc 6121 tacggtgtac actgggagcg ccgacttcgc tccaaaactg ggcagagttc aatttgaaac 6181 tgacacagac catgattttg aagctaacca aaacacaaag ttcactccag tcggtgtcat 6241 ccaagatggc agcaccaccc accgaaatga accccaacag tgggtgctcc caagttactc 6301 aggcagaaat actcataatg ttcatctggc ccccgctgtg gcccccactt ttccgggtga 6361 acaacttctc ttctttagat ccaccatgcc cggatgtagc gggtatccca acatggattt 6421 ggactgtttg ctcccccagg aatgggtgca gtacttctac caagaggcag ccccagcaca 6481 atctgatgtg gccctgctaa gatttgtgaa tccagacaca ggtagggttt tgtttgaatg 6541 caaacttcat aaatcaggct atgttacagt ggctcacact ggccaacatg atttggtcat 6601 cccccccaat ggttatttta ggtttgattc ctgggtcaac cagttctaca cgcttgcccc 6661 catgggaaat ggaacggggc gtagacgcgt agtataatgg ctggagcttt ctttgctggg 6721 ttagcatctg atgtcctcgg ctctggactt ggttccctta tcaatgctgg ggctggggcc 6781 atcaaccaaa aagttgagtt tgaaaataac agaaaattgc aacaagcatc cttccaattt 6841 agcagcaatt tacaacaggc ttcctttcaa catgacaaag aaatgcttca agcacaaatt 6901 gaggccacta aaaagctaca acaggaaatg atgaaagtta agcaggcaat gctcctagag 6961 ggtgggttct ctgaaacaga tgcagcccgc ggggcaatca acgcccccat gacaaaagct 7021 ttggactgga gcgggacaag gtactgggct cctgatgcta ggactacaac ttataatgca 7081 ggccgctttt ccacccctca accatcgggg gtactaccag gaaggactaa tcttagggat 7141 gctgtccctg ctcggggttc ctctaaatct tctaactctt ctactgctac ttctgtgtac 7201 tcaaatcaaa ccacttcaac gagacttggt tctacagctg gttctggcac cagtgtctcg 7261 agtctcccgt caactgcaag gaccaggagc tgggttgagg atcaaagtag gaatttgtct 7321 cctttcatga ggggggccca caacatatcg tttgtcaccc caccatctag cagatcctct 7381 agccaaggca cagtctcaac cgtgcctaaa gaggttttgg actcctggac tggcgccttc 7441 aacacgcgca ggcagcctct cttcgctcac attcgtaaac gaggggagtc acgggcgtaa 7501 tgtgaaaaga caaaattgat tatctttctt tttctttagt gtcttttaaa aaaaaaaaaa 7561 a //