![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MT238668 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P16 |
ORF1: 1..5093
ORF2: 5074..6696
ORF3: 6696..7502
LOCUS MT238668 7556 bp RNA linear VRL 01-MAY-2020
DEFINITION Norovirus GII isolate 782-3 nonstructural polyprotein (ORF1) gene,
partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds.
ACCESSION MT238668
VERSION MT238668.1
DBLINK BioProject: PRJNA604000
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7556)
AUTHORS Yang,Z., Williams-Hill,D., Wolfe,J., Silva,A.J., Hirneisen,K.,
Ruelle,S., Kulka,M. and Hellberg,R.
TITLE Direct Submission
JOURNAL Submitted (24-MAR-2020) Molecular Virology Team/Devision of
Molecular Biology, OARSA/CFSAN/FDA, 8301 Muirkirk Rd., Laurel, MD
20708, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: CLC Genomics Workbench v. 11
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7556
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="782-3"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="USA"
/collection_date="17-Dec-2018"
/note="genotype: GII.P16-GII.4"
gene <1..5093
/gene="ORF1"
CDS <1..5093
/gene="ORF1"
/codon_start=3
/product="nonstructural polyprotein"
/protein_id="QIQ09391.1"
/translation="ASNDATVAVACNNNNDKEKSSGEGLFTNMSFTLKKALGARPKQP
APRDEPQKPPRPPTPELVKRIPPPPPNGEGEEEPVIRYEVKSGISGLPELTTVPQPDV
ANTAFSVPPLSLRENREAKEPLTGAILEMWDGEIYHYGLYVEKGLVLGVHKPPAAISM
ARVELTPLSLYWRVVYTPQYLISPETLRRLNGEAFPYTAFDNNCYAFCCWVLDLNDSW
LSRRMVQRTTGFFRPYQEWNRKPLPTMDDSKIKKVANIFLCSLSTLFTRPIKDLIGKI
KPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL
AVELVPVVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK
KEEANELAMVRAIEDAVLDLEAIENNHMTTLLKDKDSLATYMKTLDMEEEKARRLSTK
SASPDIVGTINALLARIAAARSLVHRAKEELSSRIRPVVVMISGKPGIGKTHLAREVA
RKVASTLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELADTCP
LTLNCDRIENKGKVFDSEVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA
KRDFPGQPDMWKDAFKPDFSHIKLQLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS
GLLHERMDEFELQGPTITTFNFDRNRITAFRQLAAENKYGLVDTMKVGNQLKGVKTME
ELKQAIRNVTIKRCRIIYGGSTYDLESDGKGKVLVEKVKNTSVQTNNELAGALHHLKH
ARIRYYVKCVQEAVYSIIQIAGAAFVTTRIARRMNIQELWSKPQLDQNESETKEEAPK
SEDDEFIISSKDIKEEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYS
IEEYLQDRDRYYEELAIAKATEEDFCEEEEIKIRQRIFRPTRKQRKEERATLGLVTGS
EIRKRNPDDFKPKGKLWADDNRSVDYNEKLDFEAPPSIWSRIVSFGSGWGFWVSPSLF
ITSTHVIPAGITEAFGVPIKQIQIHKSGEFCRFRFPRPIRPDVTGMILEEGAPEGTVA
TVLIKRPTGELMPLAARMGTHATMKIQGRMVGGQMGMLLTGSNAKGMDLGTTPGDCGC
PYIYKRGNDYIVIGVHTAAARGGNTVICATQGSEGEATLEGGYDKGTYCGAPILGPGG
APKLSTKTKFWRSSNTPLPPGTYEPAYLGGRDPRVKGGPSLQQVMRDQLKPFTEPRGK
PPRPSVLEAAKQTVINVLEQTLDPPQKWTYAQACASLDKTTSSGHPHHVRKNEFWNGE
TFTGKLADQASKANLMFEEGKHMTPVYTAALKDELVKTEKIYRKIKKRLLWGSDLSTM
IRCARSFGGLMDEMKAHCISLPVRVGMNVNEDGPIIFEKHSRYKYHYDADYSRWDSTQ
QRAVLAAALEIMVRFSAEPQLAQIVAEDLLAPSVVDVGDFKITINEGLPSGVPCTSQW
NSIAHWLLTLCALSEVTKLSPDIIQANSMFSFYGDDEIVSTDIKLDPEQLTAKLKEYG
LKPTRPDKTEGPLIISEDLNGLTFLRRTVTRDPAGWFGKLDQSSILRQMYWTRGPNHE
DPNETMIPHSQRPIQLMALLGEASLHGPSFYSRISKLVITELKEGGMDFYVPRQEPMF
RWMRFSDLSTWEGDRNLAPNFVNEDGVE"
mat_peptide <1..989
/gene="ORF1"
/product="p48"
mat_peptide 990..2087
/gene="ORF1"
/product="NTPase"
mat_peptide 2088..2618
/gene="ORF1"
/product="p22"
mat_peptide 2619..3017
/gene="ORF1"
/product="VPg"
mat_peptide 3018..3560
/gene="ORF1"
/product="Pro"
mat_peptide 3561..5090
/gene="ORF1"
/product="RdRp"
gene 5074..6696
/gene="ORF2"
CDS 5074..6696
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QIQ09392.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHIIVDVRQLEPVLIPLPD
VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGML
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRVV"
gene 6696..7502
/gene="ORF3"
CDS 6696..7502
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QIQ09393.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGPSNKSS
NSSTATSVYSNQTISTRLGSTAGSGTSVSSLPSTARTRSWVEDQNRNLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN
1 tggcgtctaa cgacgctacc gttgccgttg cttgcaacaa caacaacgac aaggaaaaat
61 cttcaggtga aggcttattc acaaatatgt ctttcacctt aaagaaagcc ctcggggcta
121 ggcccaaaca gcctgccccg agagacgaac cacaaaagcc cccaagacca ccaacccccg
181 agttggtcaa gaggataccc cctcctccac ctaatggcga aggagaagaa gaaccagtca
241 ttaggtatga ggttaagagt gggatctctg gcctgcccga gctcacaaca gtcccccaac
301 cggacgtggc caacacagca ttcagtgttc caccactgag cttgagagaa aacagggagg
361 ccaaggaacc gctaacaggg gcaatattag agatgtggga tggagagata taccactatg
421 gcctgtacgt ggagaaaggc ttagtgttgg gtgtgcacaa accacctgca gccataagca
481 tggcaagagt ggagctgacg ccgctgtcat tgtactggcg tgtggtgtac actccccaat
541 acctcatctc ccctgaaact ctcaggaggc tcaacggaga ggcgttccct tacaccgcct
601 tcgacaacaa ctgctacgcc ttttgctgct gggtgttaga cctcaatgac tcatggctta
661 gcaggaggat ggtgcaaaga acaacgggct tcttcagacc ttaccaagag tggaacagaa
721 agcccctgcc taccatggat gactccaaaa ttaagaaggt agcaaatata ttcctatgtt
781 cattgtccac attattcacc agacccataa aagacctcat agggaaaatt aaaccattaa
841 acatattgaa catcctggca acgtgtgact ggacgtttgc cggaatagtg gagtctctga
901 tattacttgc tgaactcttc ggagttttct ggacgccccc agatgtgtct gctatgatcg
961 ctcccttact cggggactac gagttgcaag ggccagaaga cctcgccgtt gaactcgtac
1021 ctgtggtaat gggagggatt ggtttggtgt tgggattcac caaagagaaa attggcaaaa
1081 tgttgtcctc agcagcatca acactcaggg cttgcaaaga tcttggtgcc tatggcttag
1141 agatactcaa gttggtcatg aagtggttct tcccaaagaa agaggaggcc aatgagctag
1201 ccatggtgag ggccatagag gatgccgtgc tagatcttga ggcaatagaa aataaccaca
1261 tgacaaccct gttgaaagac aaagacagct tagcaacata catgaaaaca ctggacatgg
1321 aggaggagaa agccagaagg ttgtccacaa aatctgcatc ccctgacata gttgggacaa
1381 tcaacgccct gctggctcga atagcagcgg ccaggtcatt agtccacagg gccaaggaag
1441 agctatctag caggataagg ccagtagttg ttatgatatc tggcaaacca ggaataggca
1501 aaactcatct ggccagggag gtggcaagaa aggtggcatc cactctcaca ggggaccaaa
1561 gagtcggact cataccaaga aacggtgtgg accattggga tgcatacaaa ggtgagagag
1621 tcgtgctgtg ggacgactat ggcatgagta accccatcca tgatgctctt cgcatacaag
1681 aattggctga tacgtgtccc cttaccttaa attgtgacag aattgaaaat aagggaaaag
1741 tttttgacag tgaagtcata ataattacaa caaaccttgc caatccagcc ccacttgatt
1801 atgtcaactt tgaggcctgt tccaggagaa ttgatttcct ggtgtacgct gaggcaccag
1861 aagtagaaaa ggcaaaacgg gactttcctg gtcagccaga tatgtggaag gacgccttca
1921 agccggactt ttcacacatc aagctacagc ttgcacctca gggcggcttt gacaagaatg
1981 gcaacacccc acatgggaaa ggagtgatga agaccctcac taccggttct ctgattgccc
2041 gtgcatcagg cctactacat gagaggatgg atgaatttga actccaaggt cccacaatca
2101 ccaccttcaa tttcgaccga aacagaatca cagcattcag acaattggct gcagaaaaca
2161 agtatggatt ggtggatacc atgaaagttg gcaatcaatt aaaaggagtg aaaaccatgg
2221 aagaactcaa acaagcaatc agaaatgtga ccatcaagag gtgccggatc atctacggtg
2281 gctccacgta tgaccttgaa tctgatggca agggcaaagt tttggtggaa aaggtcaaga
2341 acacctctgt acagaccaac aacgagttgg ccggggccct gcaccatctc aaacacgccc
2401 gaatcaggta ctatgtcaaa tgtgtgcaag aagcagtcta ttccatcata caaattgccg
2461 gcgctgcgtt tgtcaccacg cgcattgcac gccgcatgaa catacaagaa ctctggtcga
2521 agccacaatt agatcaaaat gaatcagaga ctaaggaaga ggcccccaaa tcagaagatg
2581 acgagttcat catatcttct aaggacatca aggaggaagg aaagaagggc aaaaacaaaa
2641 ctggccgtgg caagaaacac actgcattct ccagcaaggg cttgagcgat gaggagtatg
2701 acgagtacaa gaggataaga gaagagagaa atgggaagta ctctatagag gagtatcttc
2761 aagacagaga caggtactat gaggagctcg ccattgccaa ggccacggaa gaagacttct
2821 gtgaagagga ggagataaaa atccgtcaga gaattttccg tcccaccagg aaacaaagaa
2881 aggaagagag ggccacatta ggactggtaa caggttcaga aatcagaaaa agaaaccctg
2941 atgacttcaa acccaaaggg aagctgtggg ccgatgacaa cagaagtgtt gactataatg
3001 agaaactgga ctttgaggcc cccccaagca tatggtctag gattgtgagc tttggttctg
3061 gctggggctt ctgggtatca ccaagcctgt tcataacatc aactcatgta atccccgcag
3121 gcataacaga agcatttgga gtccccatca aacaaattca gatccacaaa tcaggtgaat
3181 tttgccgatt cagattccca agaccaatta gaccagacgt gacaggaatg atcttggaag
3241 aaggtgcgcc tgaaggcacc gtggcaactg tgctcatcaa acgccccacc ggagagctca
3301 tgcctcttgc agccagaatg ggaacacacg caaccatgaa aattcaaggc cgcatggttg
3361 gcggacagat gggtatgttg ctcactggat caaatgctaa aggaatggat ttgggaacaa
3421 ctcctggtga ctgtggctgt ccttacatct acaaaagggg caatgactat atagtcattg
3481 gggtgcacac tgcagcagcc cgtggtggaa acaccgtcat ctgtgccaca cagggaagtg
3541 agggtgaggc aactcttgag ggtggatatg acaaaggaac atactgtggg gcacccattc
3601 taggccctgg gggtgcacca aagttgagca ccaaaaccaa attttggagg tcatcgaaca
3661 cgcccctccc accagggaca tatgagcctg cctacctcgg tggccgtgat ccgcgtgtta
3721 agggtgggcc ctccttgcag caggtaatga gagaccagtt gaagccattc actgaaccca
3781 ggggcaaacc tccaagacca agtgtattgg aagcagccaa acaaaccgtt atcaatgtcc
3841 tcgaacaaac cctggatcct ccacaaaaat ggacatacgc acaggcgtgt gcctcacttg
3901 acaaaaccac ttccagcggg catcctcatc acgtccgaaa gaatgaattc tggaatggtg
3961 agaccttcac cggcaaattg gcagaccaag catcaaaagc aaacctaatg tttgaggaag
4021 ggaaacacat gacaccagtg tatacagcag cactcaagga cgagctagtc aagactgaga
4081 aaatctatag aaagatcaag aagagactgc tctggggctc tgacttgtcc accatgatcc
4141 ggtgcgctag gtcatttggt gggctcatgg acgagatgaa ggcacactgc atatcactcc
4201 cagtacgagt tggcatgaat gtgaatgaag atggcccaat aatatttgag aaacattcca
4261 gatacaaata ccactatgac gcagactact ctcgttggga ttcaacacaa cagagggcag
4321 tactagcagc agccttggaa atcatggtca gattctctgc agaaccacaa ttggcacaaa
4381 tagtcgctga ggatcttctg gcccctagcg tagtagatgt aggagacttt aaaatcacta
4441 taaatgaagg gctcccatct ggtgtgccat gcacctccca atggaactcc atcgcacact
4501 ggctgctaac tctctgtgcc ttgtctgaag tcaccaaact gtcccctgac attatacaag
4561 caaattccat gttctcattt tacggtgatg acgagattgt cagcaccgac ataaaattgg
4621 accctgaaca gttaaccgcc aagttgaagg agtatggcct gaaaccaacc cgcccagaca
4681 agaccgaggg acccctgatc atcagtgaag atttgaacgg actcactttc ctccgaagga
4741 cggtgactcg tgacccagct ggctggtttg gaaaactgga ccaaagttca attttgaggc
4801 agatgtactg gactagagga ccaaatcacg aagatcccaa tgagacaatg ataccccatt
4861 ctcaaagacc catacagctc atggcactgc ttggtgaagc ctctcttcac ggaccctctt
4921 tctacagtag aatcagtaaa ttggtcataa ctgaactcaa agaaggtggg atggactttt
4981 acgtgccaag gcaggaaccc atgttcaggt ggatgaggtt ttctgacttg agcacgtggg
5041 agggcgatcg caatctggct cccaattttg tgaatgaaga tggcgtcgag tgacgccaac
5101 ccatctgatg ggtccgcagc caacctcgta ccagaggtca acaatgaggt tatggctttg
5161 gagcccgttg tcggtgccgc tattgcggca cctgtagcgg gccaacaaaa tgtaattgac
5221 ccctggatta gaaataattt tgtacaagcc cctggtgggg agtttacagt atcccccaga
5281 aacgctccag gtgaaatact atggagcgcg cccctaggcc ctgacctaaa tccctaccta
5341 tcccatttgg ccagaatgta caatggttat gcaggtggtt ttgaagtgca ggtaattctc
5401 gcggggaacg cgttcaccgc cgggaagatc atatttgcag cagtcccacc aaattttcca
5461 actgaaggct taagtcctag ccaggtcact atgttccccc atataatagt agatgttaga
5521 cagttagaac ctgtgctgat tcctttaccc gatgttagga ataatttcta tcattacaat
5581 cagtcaaatg actccactat taagttgata gcaatgttgt acacaccact tagggctaat
5641 aatgctgggg atgatgtttt cacagtttcg tgccgagttc tcacgagacc atcccccgat
5701 tttgatttca tatttttagt gccacccaca gttgagtcaa gaactaaacc attctctgtc
5761 ccagttttaa ctgttgagga gatgaccaat tcaagattcc ccatcccttt ggaaaagctg
5821 ttcacaggcc ccagcagtgc ctttgttgtt caaccacaaa acggcaggtg cacaactgat
5881 ggcgtgctcc taggcaccac ccaactttct cctgtcaaca tctgcacctt cagaggggat
5941 gtcacccaca tcacaggtag tcgcaactac acaatgaatt tggcttctca aaattggaac
6001 aactatgacc caacagaaga aatcccagcc cctctaggaa ctccagattt tgtggggaag
6061 attcaaggca tgctcaccca aaccacaagg acagatggtt caacacgcgg ccacaaagct
6121 acagtgtaca ctgggagcgc cgactttgct ccaaaactgg gtagagttca atttgaaact
6181 gacacagacc atgattttga agctaatcaa aacacaaagt tcaccccagt cggtgtcatc
6241 caagatggta gcaccaccca tcgaaacgaa ccccaacagt gggtgctccc aagttactca
6301 ggcagaaata ctcacaatgt acatctggcc cccgctgtag cccccacctt tccgggtgag
6361 caacttctct tcttcagatc caccatgccc ggatgcagcg ggtaccccaa catggatttg
6421 gactgtctgc tcccccagga atgggtgcag tacttctatc aagaggcagc cccagcacaa
6481 tctgatgtgg ctctgctaag atttgtgaat ccagacacag gtagggtttt gtttgagtgc
6541 aagcttcaca aatcaggcta tgttacagtg gctcacactg gccaacatga tttggttatc
6601 ccccccaatg gctactttag atttgattcc tgggtcaacc agttctatac gcttgccccc
6661 atgggaaatg gaacggggcg tagacgtgta gtataatggc tggagctttc tttgctggat
6721 tggcatctga tgtccttggc tctggacttg gttccctcat caatgctggg gctggggcca
6781 tcaaccaaaa agttgagttt gaaaataaca gaaaattgca acaagcatcc ttccaattta
6841 gcagcaatct gcaacaggct tcctttcaac atgataaaga gatgctccaa gcacaaattg
6901 aggccactaa aaagctacaa caggaaatga tgaaagttaa gcaggcaatg ctcctagagg
6961 gtgggttctc tgagacagat gcagcccgcg gggcaattaa cgcccccatg acaaaagctt
7021 tggactggag tgggacaagg tactgggctc ccgatgctag gactacaaca tacaatgcag
7081 gccgcttttc cacccctcaa ccatcggggg cactgccagg aagagctaat cttagggatg
7141 ctgtccctgc tcggggaccc tccaacaaat cttctaactc ttctactgcc acctctgtgt
7201 attcaaatca aactatttca acgagacttg gttctacagc tggttctgga accagtgtct
7261 cgagcctccc gtcaactgca aggactagga gctgggttga ggatcaaaat aggaatttgt
7321 cacctttcat gaggggggcc cacaacatat catttgtcac cccaccatct agcagatcct
7381 ctagccaagg cacagtctca accgtgccta aagagatttt ggactcctgg actggcgctt
7441 tcaacacgcg caggcagcct ctcttcgctc acattcgtaa gcgaggggag tcacgggtgt
7501 aatgtgaaaa gacaaaattg attatttttc tttttcttta gtgtctttta aaaaaa
//