![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MT742777 | GII.4 Hong Kong | ||
|---|---|---|---|
| GII.P31 |
ORF1: 1..5070
ORF2: 5051..6673
ORF3: 6673..7476
LOCUS MT742777 7501 bp RNA linear VRL 30-DEC-2020
DEFINITION Norovirus GII isolate WT-NORO-0887 nonstructural polyprotein (ORF1)
gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
cds.
ACCESSION MT742777
VERSION MT742777.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7501)
AUTHORS Chan,M.C., Roy,S., Bonifacio,J., Zhang,L.-Y., Chhabra,P.,
Chan,J.C., Celma,C., Igoy,M.A., Lau,S.-L., Mohammad,K.N., Vinje,J.,
Vennema,H., Breuer,J., Koopmans,M. and deGraaf,M.
TITLE Detection of a New Norovirus GII.4 Variant, GII.4 Hong Kong, in
Asia and Europe, 2017-2019
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 7501)
AUTHORS Roy,S., Williams,R.J., Tutill,H.J., Sheth,S., Celma,C., Allen,D.
and Breuer,J.
TITLE Direct Submission
JOURNAL Submitted (12-JUL-2020) Infection & Immunity, UCL, Cruciform
Building, Gower Street, London WC1E 6BT, United Kingdom
COMMENT ##Assembly-Data-START##
Assembly Method :: CLC Genomics Workbench v. 11.01
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7501
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="WT-NORO-0887"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="United Kingdom"
/collection_date="Mar-2019"
/note="genotype: GII.P31_GII.4_Hong_Kong_2019"
gene <1..5070
/gene="ORF1"
CDS <1..5070
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="QLG43138.1"
/translation="AAVAKSNNDIAKSSSDGVLSNMAVTFKRALGARPKQPPPKEIPP
RPPRPPTPELIKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVGQPDETNTAFSVPP
LNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKVELTPLS
LYWRPVYTPQYLISPDTLRRLHGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMISRT
TGFFRPYQDWNRKPLPTVDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPLNILNIL
ATCDWTFAGIVESLILMAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVVM
GGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEVNELAM
VRSIEDAVLDLEAIENNHMTALLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVGT
INALLARIAAARSLVHRAKEELSSRLRPVVVMISGKPGIGKTHLARELAKKIAASLNG
DQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRIE
NKGKVFDSDAIIITTNLTNPAPLDYVNFEACSRRIDFLVYAEAPDVEKAKRDFPGQPD
MWKNAFSPDFSHLKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLVARASGLLHERLDE
YELQGPTLTTFNFDRNKILAFRQLAAENKYGLVDTMRIGKQLKDVKTMPDLKQALKNV
SIKKCQIVYSGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARIRYYVKC
VQEALYSIIQIAGAAFVTTRITKRMNIQNLWSRPQVEDTEETANKDGCPKPKDDEEFV
VASDDVKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQD
RDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTKKQRKEERASLGLVTGSEIRKRNP
EDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHVI
PQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVATLLIKRP
TGELMPLAVRMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKRG
NDYVVIGVHTAAARGGNTVICATQGNEGEATLEGGDNKGTYCGAPILGPGSAPKLSTK
TKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKPPRPSVL
EAAKKTIVNVLEQTIDPPQKWSFAQACASLDKTTSSGYPHHVRKNDCWNGDSFTGKLA
DQASKANLMYEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDLATMIRCARAF
GGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQRDVLAA
ALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWNSIAHWL
LTLCALSEVTDLSPDIVQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGLRPTRPD
KTEGPLVISEDLDGLTFLRRTVTRDIAGWFGKLEQSSILRQMYWTKGPNHEDPSETMI
PHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFSD
LSTWEGDRNLAPSFVNEDGVE"
mat_peptide <1..960
/gene="ORF1"
/product="p48"
mat_peptide 961..2058
/gene="ORF1"
/product="NTPase"
mat_peptide 2059..2595
/gene="ORF1"
/product="p22"
mat_peptide 2596..2994
/gene="ORF1"
/product="VPg"
mat_peptide 2995..3537
/gene="ORF1"
/product="Pro"
mat_peptide 3538..5067
/gene="ORF1"
/product="RdRp"
gene 5051..6673
/gene="ORF2"
CDS 5051..6673
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QLG43139.1"
/translation="MKMASNDANPSDGSTANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNLIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLSPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKIIFAAVPPNFPTEGLSPSQVTMFPHVIVDVRQLEPVLIPLPD
VRNNFYHYNQSNDSTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFTVPVLTVEEMTNSRFPIPLEKLYTGPSAAFVVQPQNGRCTTDGVLLGTT
QLSAVNICTFRGDLTHIAGTRQFTMHLASPNWNNYDPTEEIPAPLGTPDFVGKIQGML
TQTTKTDGSTRGHKAMVSTGAADFAPKLGNIRFGTDTENDLQSGINTKFTPIGVVQDG
ENPHFSEPQQWVLPSYTGRTGHNVHLAPAVAPTYPGEQLLFFRSTMPGLSGYPNLTID
CLLPQEWVRHFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYITVAHTGPHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRVL"
gene 6673..7476
/gene="ORF3"
CDS 6673..7476
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QLG43140.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATQKLQQGMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARVTTYNAGRFSTSQPSGALPGRTNLRATLPTRGSSSVTS
ASSATSVYSNKTVSTRLGSSAGSGTSASSTPSTTRTRSWVEDQNRNLSPFMSGALNTS
FVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHVRRRGESRV"
ORIGIN
1 gccgctgttg ccaaaagcaa caacgacatt gcaaaatctt caagtgacgg tgtgttatct
61 aacatggctg tcacttttaa acgggccctc ggggcgcggc ctaaacagcc gcccccgaag
121 gaaataccac ccagaccccc acgtccaccc acaccagaat tgatcaaaaa gattcctccc
181 cccccaccca acggggagga tgaactggtg gtttcctaca gtgccaaaga tggcgtgtct
241 ggcctgcctg agctcactac tgtcggacaa ccagatgaga ccaacacggc gttcagtgtc
301 cccccgctta accagaggga gaatagggac gctaaggaac cactaactgg aacaatcatt
361 gaaatgtggg atggagaaat ataccattac ggcctgtacg tggaacgagg tcttatactt
421 ggtgtgcaca agccaccggc ggccattagc cttgccaagg tcgagctaac accactctct
481 ttgtactgga gacctgtata caccccccag tatctcatct ctccagacac tctcaggagg
541 ctgcatggag agacattccc ctacactgca tttgacaaca actgctacgc cttctgttgt
601 tgggtgttag acctaaacga ctcgtggcta agcaggagaa tgattagcag aacaacaggt
661 ttcttcaggc cataccagga ttggaacagg aaacccctcc ccactgtgga tgattccaaa
721 ttgaaaaagg tagccaacat attcttgtgc actttgtctt cactattcac cagacctatt
781 aaggacataa tagggaagct gaaacccctt aacatcctca atattctggc cacatgcgac
841 tggaccttcg caggcatagt ggagtcctta atactcatgg cagaactctt tggagttttc
901 tggacacccc cagatgtgtc tgcgatgatc gcccccttac taggtgatta tgaactgcaa
961 ggacctgagg accttgcagt agaactagtc ccagtggtaa tgggggggat aggtttggtg
1021 ctaggattta ccaaagagaa aattggaaag atgttgtcat ctgctgcatc cactttgaga
1081 gcttgcaaag accttggtgc atacggactg gaaattttaa aattggtcat gaaatggttc
1141 ttcccaaaga aagaggaagt aaatgaattg gctatggtga gatccatcga ggacgcagta
1201 ctggacctcg aggcaattga aaataaccac atgaccgcac tgctcaagga caaagacagt
1261 ctggcaacct atatgaggac cctcgacctt gaggaggaga aagccagaaa actctcaacc
1321 aaatctgctt cacccgacat tgtgggcaca atcaacgctc ttctggcaag gatcgccgct
1381 gcacgttccc tagtgcatcg ggcaaaagaa gagctctcta gcaggctgag acccgttgtt
1441 gtgatgatat cggggaagcc agggataggg aagactcacc tcgccagaga gctagccaag
1501 aagatcgcgg cctccctcaa tggggaccag cgtgtgggtc tcatcccgcg caatggcgtc
1561 gatcactggg atgcgtataa gggagaaaga gtagttcttt gggacgatta tggaatgagt
1621 aaccccatcc atgatgccct cagattgcag gaacttgctg acacctgtcc cctcacatta
1681 aattgtgata ggattgagaa caaagggaaa gtctttgaca gtgatgctat aattattacc
1741 accaacctga ccaacccagc accactggac tatgtcaatt ttgaagcgtg ttcgaggcgc
1801 attgacttcc tcgtgtacgc agaagctcct gatgtggaaa aggcaaagcg cgacttccca
1861 ggtcaacctg acatgtggaa gaacgctttc agccctgatt tctcgcacct gaaactgtca
1921 ttggctccgc agggtggttt tgataagaac ggcaacaccc cgcatggaaa aggcgtcatg
1981 aagaccctca ccactggctc cctcgtcgcc cgagcttcag gcttgctcca tgagaggcta
2041 gacgagtatg aactacaagg cccaactctc accactttca actttgaccg taacaaaatc
2101 cttgctttca ggcagcttgc cgccgaaaac aaatatgggc tggtggacac aatgaggatt
2161 ggaaaacagc tcaaggatgt caagaccatg ccagacctca aacaggcact caagaatgtc
2221 tctatcaaga agtgccagat agtgtacagt ggtggcacct acacacttga ggctgacggc
2281 aagggcagtg tgaaagttga caaagtgcaa agcgccactg tgcaaaccaa caatgaacta
2341 gctggtgccc tacaccacct aaggtgcgcc agaattaggt actatgtcaa gtgcgtccag
2401 gaggcactgt attccatcat ccaaattgct ggggctgcat ttgtcaccac gcgcatcact
2461 aagcgcatga acatacagaa tctctggtcc aggccacagg tggaagacac agaggagacg
2521 gctaacaaag atggttgccc aaaacctaaa gatgatgaag agtttgtcgt cgcatccgac
2581 gacgtcaaga ctgaaggcaa gaaagggaaa aacaagactg gccgtggtaa gaagcacaca
2641 gccttctcaa gcaaagggct cagtgatgag gagtacgatg agtacaagag aatcagagaa
2701 gaaaggaatg gcaagtattc catagaggag taccttcagg acagggacaa gtactatgag
2761 gaggtggcta tcgcaagggc aaccgaagag gacttctgtg aagaagaaga ggccaaaatc
2821 cggcagagaa tcttcagacc aacaaagaaa caacgcaaag aggagagggc ttctctcggt
2881 ttggtcacag gctctgaaat taggaagaga aacccagaag atttcaaacc caaggggaaa
2941 ctatgggctg acgatgacag aagtgttgac tacaatgaga aactcagctt tgaggcccca
3001 ccaagcatct ggtcacggat agtcaacttt ggctcaggtt ggggcttctg ggtctccccc
3061 agtctgttta taacctcaac ccatgtcata ccccaaggtg caaaagagtt ctttggagtc
3121 cccatcaaac aaatccaaat acacaaatca ggtgaattct gccgattgag gttcccaaaa
3181 ccaatcagaa ctgatgtgac gggcatgatt ctagaagaag gtgcgcccga aggaaccgtg
3241 gccacacttc tcatcaagag accaactgga gaactcatgc ctttggcagt gagaatggga
3301 acccatgcaa ccatgaagat tcaagggcgc acagttggag gacaaatggg catgctcttg
3361 acaggatcca acgctaagag tatggacctg ggcacaacac caggtgactg cggctgtccc
3421 tacatttaca agagggggaa tgattacgta gtcataggag tccacacggc cgctgcccgt
3481 ggaggaaaca ctgtcatatg tgctacccag gggaatgaag gagaggccac actcgaaggt
3541 ggcgacaata aaggaacata ctgtggtgca ccaatcttgg gcccagggag tgccccgaaa
3601 ctcagcacca agaccaagtt ttggagatca tccacaacac ctctcccacc tggcacctat
3661 gaaccagcct acctcggtgg caaggacccc agagtaaaag gtggtccctc gttgcaacaa
3721 gttatgaggg accagctaaa gccatttaca gaacccagag gtaaaccacc aaggccaagt
3781 gtgttggaag ctgccaagaa aaccattgtt aatgttcttg agcaaacaat tgacccaccc
3841 caaaaatggt cattcgcgca ggcttgcgcg tccctcgaca aaaccacttc cagcggttat
3901 ccgcaccatg tgcgaaagaa tgactgttgg aacggggatt ccttcacagg aaaattggca
3961 gaccaggcct ctaaggccaa tctaatgtat gaagagggga agaacatgac tccagtctac
4021 acaggtgcac ttaaggatga attggtaaag actgacaaaa tttatggtaa gatcaaaaag
4081 aggcttctgt ggggttcaga cctggcgacc atgatacggt gcgcccgagc ttttggaggc
4141 cttatggatg aactcaaggc acactgtgtc acccttcctg tcagagttgg catgaacatg
4201 aatgaggatg gccccatcat ctttgagaaa cactctagat acagatacca ctatgatgct
4261 gattattccc ggtgggactc aacacaacag agggatgtac tggcagcagc ccttgaaatc
4321 atggttaaat tctctccaga accacacctg gcccaggtag ttgcagaaga cctcctttct
4381 cccagtgtaa tggatgtggg tgactttcaa atatcaataa gcgagggact cccctccggg
4441 gtgccttgca cctcccaatg gaattccatt gcccattggc ttctcactct ttgtgcacta
4501 tctgaagtca cggacttatc ccctgatatt gttcaggcca attccctctt ctccttctat
4561 ggtgatgatg agattgtgag cacagacata aagttggacc cagagaagct gacggcaaag
4621 ctcaaggagt acgggctaag accaacccgc cccgacaaaa ctgaagggcc ccttgtcatc
4681 tctgaagatc tggatggcct gacattcctg cggagaactg tgacccgtga catagcaggc
4741 tggtttggca aattggaaca gagctcaatt ctcaggcaaa tgtactggac caagggcccc
4801 aaccatgaag atccatctga aacaatgata ccacactccc aaagacccat acaattgatg
4861 tctctactgg gcgaggctgc actccatggc ccggcattct atagcaaaat tagcaagttg
4921 gtcattgcag agttgaagga aggtggcatg gatttttacg tgcccagaca agagccaatg
4981 ttcagatgga tgagattctc agatctgagc acgtgggagg gcgatcgcaa tctggctccc
5041 agttttgtga atgaagatgg cgtcgaatga cgccaaccca tctgatgggt ccacagccaa
5101 cctcgtccca gaggtcaaca atgaggttat ggctttggag cctgttgtag gcgccgctat
5161 tgcggcacct gtggcgggcc aacaaaactt aattgacccc tggattagaa ataattttgt
5221 acaagcccct ggtggagagt tcacagtgtc ccccagaaac gctccaggtg aaatactatg
5281 gagcgcgcct ttgagccctg atttgaaccc ttatctttcc catttggcca gaatgtacaa
5341 tggttacgca ggtggttttg aggtgcaggt aatccttgcg gggaacgcgt tcaccgccgg
5401 aaaaatcata tttgcagcag tcccaccaaa tttcccaact gaaggcttga gccccagcca
5461 ggttactatg tttccccatg taatcgtaga tgttaggcaa ttagaacctg tgttgatccc
5521 cttacctgat gttaggaata atttctatca ttataatcaa tcaaatgatt ctactattaa
5581 attgatagca atgctgtata caccacttag ggctaataat gctggggatg atgtcttcac
5641 agtttcttgt cgggtcctta cgaggccctc ccctgatttt gatttcatat ttctggtacc
5701 accaacagtt gagtcaagaa ctaaaccttt tactgtccca gtcttaaccg ttgaggaaat
5761 gaccaattca agattcccca ttcccttgga gaagctgtat acgggtccca gtgctgcttt
5821 tgttgttcaa ccacaaaatg gcagatgcac gactgatggc gtgctcttgg gcactaccca
5881 gttgtctgct gtcaacatct gcacctttag aggagatctc acacacattg caggcactcg
5941 ccaattcaca atgcatttgg cctctccgaa ctggaacaac tatgacccaa cagaagaaat
6001 cccagccccc ctgggaaccc cagacttcgt ggggaagatt caaggcatgc tcacccaaac
6061 cacaaaaaca gatggctcga cccgcggcca caaagccatg gtgtccactg gggctgccga
6121 ctttgcccca aaattaggca acattcgatt cggcactgac acagaaaatg atctccaatc
6181 tggcataaac accaaattca ccccaatcgg cgttgtccaa gatggtgaaa acccccattt
6241 tagtgaaccc caacaatggg tgcttccaag ttacacaggt agaactggac ataatgtgca
6301 tttggcccct gctgttgccc ccacttaccc gggtgagcaa ctccttttct ttaggtccac
6361 catgcccgga ctcagcgggt accccaactt gaccatagac tgcttgctcc cccaggaatg
6421 ggtgcggcac ttctaccaag aagcagctcc agcacaatct gatgtggctt tgctaagatt
6481 tgtgaatcca gacacaggta gggttttgtt tgagtgcaag ctccataaat caggatatat
6541 tacagtagct cacactggtc ctcatgattt agttatcccc cccaatggtt attttagatt
6601 tgattcctgg gtcaaccagt tttacacact tgcccccatg ggaaatggag cggggcgcag
6661 acgtgtgtta taatggctgg agctttcttt gctggattgg catctgatgt ccttggctcc
6721 ggacttggtt ccttaatcaa tgctggggct ggagctatca atcaaaaagt tgaatttgaa
6781 aataatagaa aattgcaaca agcttctttc caatttagta gtaatctgca acaagcttcc
6841 tttcagcatg acaaggagat gctccaagca caaattgagg ccactcaaaa attgcaacaa
6901 ggtatgatga aggtcaagca ggcgatgctc ctggaaggtg gattctctga aacagatgca
6961 gcccgtgggg caatcaacgc ccccatgaca aaggccttgg attggagtgg aacaagatac
7021 tgggcacctg atgctagggt tacaacatat aatgcaggcc gcttttccac ctctcaacct
7081 tcgggggcat tgccaggaag gactaacctc agggctactc tccccactcg gggatcttct
7141 agtgtaactt ctgcttcttc tgctacttct gtgtattcaa ataaaacagt ttcaacgagg
7201 cttggttctt cagcaggttc tggcaccagc gcctcaagca ccccgtcaac cacaaggact
7261 aggagctggg ttgaggatca aaacaggaac ttgtcaccct tcatgagtgg ggctctcaac
7321 acatcattcg tcaccccacc atctagcaga tcctccagtc aaggcacagt ctcaaccgtg
7381 cctaaagaaa ttttggactc ctggactggc gctttcaaca cgcgcaggca gcctctcttc
7441 gctcatgttc gtaggcgagg ggagtcacgg gtgtaatgtg aaaagacaaa attgattatc
7501 t
//