![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MG786781 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 1..5090
ORF2: 5071..6693
ORF3: 6693..7499
LOCUS MG786781 7535 bp RNA linear VRL 31-JUL-2018
DEFINITION Norovirus GII.4 strain Hu/GII.4/DBM15-156/2015/THA nonstructural
polyprotein gene, partial cds; and VP1 and VP2 genes, complete cds.
ACCESSION MG786781
VERSION MG786781.1
KEYWORDS .
SOURCE Norovirus GII.4
ORGANISM Norovirus GII.4
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7535)
AUTHORS Guntapong,R., Tacharoenmuang,R., Ruchusatsawat,K., Singchai,P.,
Upachai,S., Aukapaiboon Okada,P., Parnmen,S., Phumee,A.,
Motomura,K., Tatsumi,M., Takeda,N. and Sangkitporn,S.
TITLE Complete Genome Sequence of Human Norovirus GII.4 from the diarrhea
case in Thailand
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 7535)
AUTHORS Guntapong,R., Tacharoenmuang,R., Ruchusatsawat,K., Singchai,P.,
Upachai,S., Aukapaiboon Okada,P., Parnmen,S., Phumee,A.,
Motomura,K., Tatsumi,M., Takeda,N. and Sangkitporn,S.
TITLE Direct Submission
JOURNAL Submitted (12-JAN-2018) Department of Medical Sciences, National
Institute of Health, Tivanon Road, Muang Nonthaburi, Nonthaburi
11000, Thailand
COMMENT ##Assembly-Data-START##
Assembly Method :: CLC Genomics Workbench v. 8.0.1
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7535
/organism="Norovirus GII.4"
/mol_type="genomic RNA"
/strain="Hu/GII.4/DBM15-156/2015/THA"
/isolate="DBM15-156"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:489821"
/geo_loc_name="Thailand"
/collection_date="29-Oct-2015"
/note="genotype: GII.Pe_GII.4_Sydney2012"
CDS <1..5090
/note="ORF1"
/codon_start=3
/product="nonstructural polyprotein"
/protein_id="AVC63694.1"
/translation="SNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPKQPP
PKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPEETNT
AFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLAKV
ELAPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWLSR
RMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLKPL
NILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVE
LVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEE
ANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSAS
PDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAKKI
AASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTL
NCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAKRD
FPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASGLL
HERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSDLK
QALKNIAIKKCQIVYNGGTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCARI
RYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEETANKDGCLKPK
DDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSI
EEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTGSE
IRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSLFI
TSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVAT
LLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCP
YIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPGSA
PKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRGKP
PRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNGES
FTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKVKKRLLWGSDLATMI
RCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDSTQQ
RDVLAAALEIMVKFSPEPHLAQIVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQWN
SIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEYGL
KPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHED
PFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFR
WMRFSDLSTWEGDRNLAPSFVNEDGVE"
CDS 5071..6693
/note="ORF2; major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="AVC63695.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSRNYTMNLASQNWSNYDPTEEIPAPLGTPDFVGKIQGML
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQDTKFTPVGVIQDG
STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAL"
CDS 6693..7499
/note="ORF3; minor structural protein; minor capsid
protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="AVC63696.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKRLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDVAPARGSYSKPS
NSSTVTSVYSNQTISTRLGSTAGSGTSVSSSPSTARIRNWVEDQSRSLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEILDSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN
1 cgtctaacga cgcttccgct gccgctgttg ccaacagcaa caacgacatc gcaaaatctt
61 caagtgacgg tgtgttttct aacatggctg tcacttttaa gcgggccctc ggggcgcggc
121 ctaaacagcc gcccccgaag gaaataccac ccagaccccc gcgaccaccc acaccagaat
181 tggtcaaaaa gatccctcct cccccaccca acggggagga tgaactagtg gtctcttaca
241 gcgccaaaga tggcgtttcc ggactgcctg agctcaccac tgtcagacaa ccggaagaaa
301 ccaatacggc gtttagtgtc cccccactca accaaaggga gagcagggac gccaaggagc
361 cactaactgg aacaatcatt gaaatgtggg atggagaaat ctaccattac ggcctgtatg
421 tggaacgagg tcttatactt ggtgtgcata agccaccggc agccattagc cttgccaagg
481 tcgagctagc accgctctct ttgttctgga gacctgtata caccccccag tatctcatct
541 ctccagacac tcttaggaga ttacatggag agtcattccc ctacactgca tttgataaca
601 attgctacgc cttttgttgt tgggtattag acctgaacga ctcatggcta agcaggagaa
661 tgattcagag aacaacaggc ttctttaggc cttaccagga ttggaacagg aaacccctcc
721 ccactatgga tgattccaaa ttaaagaagg tggccaacat attcttgtgc actttgtctt
781 cactattcac caggcccatt aaggacataa tagggaagtt gaaacctctt aacatcctta
841 acattctggc tacatgtgat tggaccttcg caggcatagt ggaatcctta atactcttgg
901 cagaactctt tggagttttc tggacacccc cagatgtgtc tgcgatgatc gcccccttgc
961 taggtgatta tgaactgcaa ggacctgagg accttgcagt ggaactggtc ccaatagtga
1021 tgggggggat aggtttggtg ctaggattca ccaaagagaa aataggaaag atgctatcat
1081 ccgctgcatc cactttaaga gcttgtaaag accttggtgc atacggattg gaaatcttaa
1141 aattggtcat gaaatggttc ttcccaaaga aagaggaagc aaatgaactg gctatggtga
1201 gatccatcga ggatgcagtg ctagaccttg aggcaattga aaacaaccac atgactaccc
1261 tactcaaaga caaagacagc ttggcaacct acatgagaac ccttgacctt gaggaggaga
1321 aagccagaaa gctctcaacc aaatctgctt cacccgatat tgtgggcaca atcaactctc
1381 ttctggcaag aatcgctgct gcacgttccc tagtgcatcg ggcgaaagaa gagctctcca
1441 gcaggccgag acctgtcgtt gtgatgatat cgggaagacc agggataggg aaaactcacc
1501 ttgccaggga gctggccaag aagatcgcgg cctccctcac aggggaccag cgtgtgggtc
1561 ttatcccacg caatggtgtc gaccactggg acgcatacaa gggcgaaaga gttgtcctat
1621 gggacgacta tggaatgagc aaccccatcc atgatgccct caggttgcag gagcttgctg
1681 acacttgccc cctcacgcta aattgtgaca gaattgagaa caaaggaaaa gtctttgaca
1741 gtgatgccat aattatcacc accaatctgg ccaacccagc accactggat tatgtcaact
1801 ttgaagcgtg ctcgagacgc attgatttcc tcgtgtacgc agaagcccct gaggtggaga
1861 aggcaaagcg cgacttccca ggtcaacctg acatgtggaa gaacgctttc agtcctgact
1921 tctcacacat aaaactgtca ttggctccac agggtggttt cgacaagaac ggcaacaccc
1981 cgcatggaaa aggggtcatg aagaccctca ccactggctc cctcatcgcc cgagcatcag
2041 ggttactcca tgagaggcta gatgaatatg aactgcaagg cccagccctc accactttca
2101 actttgaccg caacaagata cttgctttta gacagcttgc tgctgagaac aagtatgggc
2161 tgatggacac aatgagggtt ggaaaacagc tcaaggatgt caagaccatg tcagacctca
2221 aacaagcact caagaacatc gcgatcaaga agtgccagat agtgtacaat ggtggcacct
2281 acacacttga ggctgatggc aagggtagtg tgaaagttga caaagtgcaa agtgccactg
2341 tgcagaccaa caatgaacta gccggtgccc tacaccacct gaggtgcgct agaatcagat
2401 actatgttaa gtgcgtccag gaggcactgt attccatcat ccaaatcgct ggggctgcgt
2461 tcgtcaccac gcgcatcgct aagcgcatga atatacagaa tctctggtcc aagccacagg
2521 tggaagacac agaagagacg gccaacaaag atggttgcct aaagcccaaa gatgatgaag
2581 agtttgtcgt ctcatccgac gacatcaaaa ctgagggcaa gaaagggaag aacaagtccg
2641 gccgtggcaa gaagcacaca gccttttcaa gtaaagggct cagtgatgag gagtacgatg
2701 agtacaagag aatcagagaa gaaagaaatg gtaagtactc catagaagag taccttcagg
2761 acagagacaa gtactacgag gaggtggcca ttgccagggc aaccgaggag gacttctgtg
2821 aagaagaaga ggccaaaatc cggcagagaa ttttcagacc aacaaggaaa caacgtaaag
2881 aagagagggc ctctctcggc ttggtcacag gctctgagat caggaagaga aacccagaag
2941 acttcaaacc caagggaaag ctgtgggctg atgatgacag aagtgttgac tacaatgaga
3001 aactcaactt tgaggcccca ccaagcatct ggtcgcggat agtcaacttt ggttcaggct
3061 ggggcttctg ggtctccccc agtctgttta taacatcaac ccatgtcata ccccaaggtg
3121 caaaagagtt cttcggagtc cctatcaagc aaatccagat acacaagtca ggtgaattct
3181 gccgattgag attcccaaag ccaatcagaa ctgatgtgac gggcatgatt ctagaagaag
3241 gtgcgcccga ggggaccgtg gccacactgc tcatcaagag accaactgga gagctcatgc
3301 ctctggcagc cagaatgggg acccatgcaa ccatgaaaat tcaggggcgc acagttggag
3361 ggcaaatggg tatgctcctg acaggatcca acgccaagag tatggaccta ggcacaacac
3421 caggcgactg cggctgcccc tacatctaca agaggggaaa tgactatgtg gtcatagggg
3481 tccatacggc cgctgcccgt ggagggaaca ctgtcatatg tgccacccag gggagtgagg
3541 gagaagccac acttgaagga ggtgacagca aagggacata ctgtggcgca ccaatcttgg
3601 gcccagggag cgctccgaag ctcagtacca agactaagtt ttggagatca tccacaacac
3661 cactcccgcc tggcacctac gaaccagcct acctcggtgg caaagaccct agagtcaaag
3721 gtggcccttc attgcaacaa gttatgaggg accagctgaa gccattcaca gaacccagag
3781 gtaaaccacc aagaccaaat gtgttggaag ctgccaagaa aaccatcatc aatgtccttg
3841 agcaaacaat tgatccaccc caaaaatggt catttgcgca agcttgcgca tcccttgaca
3901 aaaccacctc cagcggccac ccgcaccaca tgcggaaaaa cgactgttgg aatggggagt
3961 ccttcacagg aaaattggct gatcaagcct ccaaggccaa tctaatgttt gaagagggaa
4021 agaacatgac tccagtctac acaggtgcac ttaaagatga gttggtgaag accgataaag
4081 tttatggtaa ggtcaagaag aggcttctgt ggggttcaga tctggcgacc atgatacggt
4141 gcgcccgagc ttttggaggc cttatggatg aactcaaggc acactgtgtc acacttcctg
4201 tcagagttgg catgaacatg aatgaggatg gccccatcat ctttgagaag cactccagat
4261 atagatacca ctatgatgct gattattccc ggtgggactc aacacaacaa agggatgtgt
4321 tagcagcagc actagaaatc atggttaagt tctctccaga gccacacctg gcccagatag
4381 ttgcagaaga cctcctttcc cctagcgtga tggatgtagg tgactttcaa atatcaataa
4441 gtgagggtct cccctctggg gtaccttgta cctcccagtg gaattccatc gcccactggc
4501 tcctcactct gtgcgcactc tctgaagtca cggacctgtc ccctgatatc attcaggcca
4561 actccctctt ctccttctat ggtgatgatg agattgtaag cacagacata aagttggacc
4621 cagagaagct gacagcaaaa ctcaaggagt acgggctgaa accaacccgc cccgacaaaa
4681 ctgaaggacc ccttgttatc tctgaagacc tggacggcct gacattcctc cggagaactg
4741 tgacccgtga tccagctggc tggtttggaa aattggaaca aagttcaatt ctcaggcaaa
4801 tgtactggac caggggtccc aaccatgaag atccatttga aacaatgata ccacactccc
4861 aaagacccat acaattgatg tccttgctgg gcgaggctgc actccacggc ccggcattct
4921 atagcaaaat tagcaaatta gtcattgcag agttgaagga aggtggcatg gacttttacg
4981 tacccagaca agagccaatg ttcagatgga tgagattctc agatctgagc acgtgggagg
5041 gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgagtga cgccaaccca
5101 tctgatgggt ccgcagccaa cctcgtccca gaggtcaaca atgaggttat ggctctggag
5161 cccgttgttg gtgccgccat tgcggcacct gtagcgggcc aacaaaatgt aattgacccc
5221 tggattagaa ataattttgt acaagcccct ggtggagagt ttacagtatc ccctagaaac
5281 gctccaggtg aaatactatg gagcgcgccc ctgggccctg atctaaatcc ctacctatcc
5341 catttggcca gaatgtacaa tggttatgca ggtggttttg aagtgcaggt aattctcgcg
5401 gggaacgcgt tcaccgccgg gaaggtcata tttgcagcag tcccaccaaa ttttccaact
5461 gagggcttga gccccagcca ggtcactatg ttcccccata tagtagtaga tgttaggcaa
5521 ttagaacctg tgttgattcc cttacccgat gttagaaata atttctatca ttacaatcaa
5581 tcaaatgatc ccaccattaa gttgatagca atgttgtaca caccacttag ggctaataat
5641 gctggggatg atgtcttcac agtttcttgc cgagttctca cgagaccatc ccccgatttt
5701 gatttcatat ttctagtgcc acccacagtt gaatcaagaa ctaaaccatt ctctgtccca
5761 gttttaactg ttgaggagat gaccaattca agattcccca tccctttgga aaagttgttc
5821 acgggtccca gcagtgcctt tgttgtccaa ccacaaaacg gtaggtgcac gactgacggc
5881 gtgctcctag gcaccaccca actgtctcct gtcaacatct gcaccttcag aggagatgtc
5941 acccatatca caggtagtcg caactacaca atgaatttgg cttctcaaaa ttggagcaat
6001 tatgacccaa cagaagaaat cccagcccct ctaggaactc cagattttgt ggggaagatt
6061 caaggcatgc tcacccaaac cacaaggaca gatggctcaa cacgcggcca caaagccaca
6121 gtgtacactg ggagcgccga ctttgctcca aaactgggta gagttcaatt tgaaactgac
6181 acagaccatg attttgaagc taaccaagac acaaagttca ccccagttgg tgtcatccaa
6241 gatggtagca ccacccaccg aaatgaaccc caacagtggg tgctcccaag ttactcaggc
6301 agaaatactc ctaatgtgca tctggccccc gctgtagccc ccacttttcc gggtgagcaa
6361 cttctcttct tcagatccac catgcccgga tgcagcgggt accccaacat ggatttggac
6421 tgtctgctcc cccaggaatg ggtgcagtac ttctaccaag aggcagcccc agcacaatct
6481 gatgtggctc tgctaagatt tgtgaatcca gacacaggta gggttttgtt tgagtgtaag
6541 cttcataaat caggctatgt cacagtggct cacactggcc agcatgattt ggttatcccc
6601 cccaatggtt attttaggtt tgattcctgg gtcaaccagt tttacacgct tgcccccatg
6661 ggaaatggaa cggggcgtag acgtgcacta taatggctgg agctttcttt gctggattgg
6721 catctgatgt ccttggctct gggcttggtt cccttatcaa tgctggggct ggggccatca
6781 accaaaaagt tgagtttgaa aataacagaa aattgcaaca agcatccttc caatttagca
6841 gcaatctaca acaggcttcc tttcagcatg acaaagagat gctccaagca caaattgagg
6901 ccaccaaaag gctacaacag gaaatgatga aagttaagca ggcaatgctc ctagagggtg
6961 ggttctctga gacagatgcg gcccgcgggg caatcaacgc ccccatgaca aaagctttgg
7021 actggagcgg gacaaggtac tgggctcccg atgctaggac tacaacatac aatgcaggcc
7081 gcttttccac ccctcagcca tcgggggcac tgccaggaag agctaatctt agggatgttg
7141 cccctgctcg gggttcctac agtaagcctt ctaattcttc cactgtcact tctgtgtact
7201 caaatcaaac tatttcaacg agacttggtt ctacagctgg ttctggtacc agtgtctcga
7261 gctccccgtc aactgcaagg attaggaact gggttgaaga tcaaagtagg agtttgtcac
7321 ccttcatgag gggggcccac aacatatcgt ttgtcacccc accatctagc agatcctcta
7381 gccaaggcac agtctcaacc gtgcctaaag agattttgga ctcctggact ggcgctttca
7441 acacgcgcag gcagccactc ttcgctcaca ttcgtaagcg aggggagtca cgggtgtaat
7501 gtgaaaagac aaaattgatt atctttcttt tcttt
//