Typing tool

Complete norovirus genomes

MT474049  GII.4 Sydney
 GII.P31

Length: 7,503 | 3 CDS

ORF1: 1..5094
ORF2: 5075..6697
ORF3: 6697..7503
LOCUS       MT474049                7503 bp    RNA     linear   VRL 15-MAY-2020
DEFINITION  Norovirus GII isolate JH38 nonstructural polyprotein (ORF1) gene,
            partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds.
ACCESSION   MT474049
VERSION     MT474049.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7503)
  AUTHORS   Hernandez,J.M., Silva,L.D., Souza Junior,E.C., Cardoso,J.F.,
            Reymao,T.K., Portela,A.C.R., Lima,C.P., Teixeira,D.M., Lucena,M.S.,
            Nunes,M.R. and Gabbay,Y.B.
  TITLE     Evolutionary and Molecular Analysis of Complete Genome Sequences of
            Norovirus from Brazil: Emerging Recombinant Strain GII.P16-GII.4
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7503)
  AUTHORS   Hernandez,J.M., Silva,L.D., Souza Junior,E.C., Cardoso,J.F.,
            Reymao,T.K., Portela,A.C.R., Lima,C.P., Teixeira,D.M., Lucena,M.S.,
            Nunes,M.R. and Gabbay,Y.B.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-MAY-2020) Virology, Evandro Chagas Institute, BR316,
            Ananindeua, PARA 67030000, Brazil
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: IDBA-UD v. 1
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7503
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /strain="Hu/GII.Pe_GII.4/JH38/2013/AM/BRA"
                     /isolate="JH38"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Brazil: Manaus, AM"
                     /lat_lon="3.1325 S 59.9896 W"
                     /collection_date="02-Apr-2013"
                     /note="genotype: GII.PE/GII.4"
     gene            <1..5094
                     /gene="ORF1"
     CDS             <1..5094
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QJT97685.1"
                     /translation="MASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPKQ
                     PPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAIDGVSGLPELTTVRQPEET
                     NTAFSVPPLNQRENRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLA
                     KVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWL
                     SRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLK
                     PLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLA
                     VELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKK
                     EEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKS
                     ASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAK
                     KIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPL
                     TLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAK
                     RDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASG
                     LLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSD
                     LKQALKNIAIKKCQIVYNGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCA
                     RIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMTNKDGCLK
                     PKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKY
                     SIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTG
                     SEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSL
                     FITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTV
                     ATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCG
                     CPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPG
                     SAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRG
                     KPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNG
                     ESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKIKKRLLWGSDLAT
                     MIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDST
                     QQRDVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQ
                     WNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEY
                     GLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNH
                     EDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPM
                     FRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..984
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     985..2082
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2083..2619
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2620..3018
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3019..3561
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3562..5091
                     /gene="ORF1"
                     /product="RdRp"
     gene            5075..6697
                     /gene="ORF2"
     CDS             5075..6697
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QJT97686.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
     gene            6697..7503
                     /gene="ORF3"
     CDS             6697..7503
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QJT97687.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKSS
                     NSSTATSVHSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTHRQPLFAHIRKRGESRA"
ORIGIN      
        1 atggcgtcta acgacgcttc cgctgccgct gttgccaaca gcaacaacga catcgcaaaa
       61 tcttcaagtg acggtgtgtt ttctaacatg gctgtcactt ttaagcgggc cctcggggcg
      121 cggcctaaac agccgccccc gaaggaaata ccacccagac ccccgcgacc acccacacca
      181 gaattggtca aaaagatccc tcctccccca cccaacgggg aggatgaact agtggtctct
      241 tacagcgcca tagatggcgt ttccggactg cctgagctca ccactgtcag acaaccggaa
      301 gaaaccaaca cggcgttcag tgtcccccca ctcaaccaaa gggagaacag ggacgccaag
      361 gagccactaa ctggaacaat tattgaaatg tgggatggag agatctacca ttacggcctg
      421 tacgtggaac gaggtcttat acttggtgtg cacaagccac cggcagccat cagccttgcc
      481 aaggtcgagc taacaccgct ctctttgttc tggagacctg tatacacccc ccagtatctc
      541 atctctccag acactcttag gagattacat ggagagtcat tcccctacac tgcatttgac
      601 aacaattgct acgccttttg ttgttgggta ttagacctaa acgactcatg gttaagcagg
      661 agaatgattc agagaacaac aggtttcttc aggccgtacc aggattggaa caggaaaccc
      721 ctccccacta tggatgattc caaattaaag aaggtagcca acatattctt gtgcactttg
      781 tcttcactat tcaccagacc cattaaggac ataataggga agttgaaacc tcttaacatc
      841 cttaacattc tggctacatg tgattggacc tttgcaggca tagtggaatc tttaatactc
      901 ttagcagaac tctttggagt tttctggaca cccccagatg tgtctgcgat gatcgccccc
      961 ttgctaggtg attatgagct gcaaggacct gaggaccttg cagtggaact ggtcccaata
     1021 gtgatggggg ggataggttt ggtgctagga tttaccaaag agaaaattgg aaagatgcta
     1081 tcatccgctg catccacttt aagagcttgt aaagaccttg gtgcatacgg actggaaatc
     1141 ttaaaattgg tcatgaagtg gttcttccca aagaaagagg aagcaaatga actggctatg
     1201 gtgagatcca tcgaggatgc agtgctagac ctcgaggcaa ttgaaaacaa ccacatgacc
     1261 accctactca aagacaaaga cagcttggca acctacatga gaacccttga ccttgaggag
     1321 gagaaagcca gaaaactctc aaccaaatct gcttcacccg acattgtggg cacaatcaac
     1381 tctcttctgg caagaatcgc tgctgcacgc tccctagtgc atcgggcgaa agaagagctc
     1441 tccagcaggc cgagacctgt cgttgtgatg atatcgggaa gaccagggat agggaaaact
     1501 caccttgcca gggagctggc caagaagatc gcggcctccc tcacagggga ccagcgtgtg
     1561 ggtcttatcc cacgcaatgg tgtcgatcac tgggacgcat acaagggcga aagagttgtc
     1621 ctatgggacg actatggaat gagcaacccc atccatgatg ccctcaggtt gcaggagctt
     1681 gctgacactt gccccctcac gctaaattgt gacagaattg agaataaagg gaaagtcttt
     1741 gacagtgatg ctataattat caccaccaat ctggccaacc cagcaccact ggattatgtc
     1801 aactttgaag cgtgctcgag acgtattgac ttcctcgtgt acgcagaagc ccctgaggtg
     1861 gagaaggcaa agcgcgactt cccaggtcaa cctgacatgt ggaagaacgc tttcagtcct
     1921 gacttctcac acataaaact gtcattggct ccacagggtg gttttgacaa gaacggcaac
     1981 accccgcatg gaaaaggggt catgaagacc ctcaccactg gctccctcat cgcccgagca
     2041 tcagggttac tccatgagag gctagatgaa tatgaactgc aaggcccagc cctcaccact
     2101 ttcaactttg accgcaacaa gatacttgct tttagacagc ttgctgctga aaacaagtat
     2161 gggctgatgg acacaatgag agttggaaaa cagctcaagg atgtcaaaac catgtcagac
     2221 ctcaaacaag cactcaagaa catcgcgatc aagaagtgcc agatagtgta caatggtagc
     2281 acctacacac ttgaggccga tggcaagggt agtgtgaaag ttgacaaagt gcaaagtgcc
     2341 actgtgcaga ccaacaatga actagccggt gccctacacc acctaaggtg cgctagaatc
     2401 agatactatg ttaagtgcgt ccaggaggca ctgtattcca tcatccaaat cgctggggct
     2461 gcattcgtca ccacgcgcat cgccaagcgc atgaatatac agaatctctg gtccaagcca
     2521 caggtggaag acacagaaga gatgaccaac aaggatggtt gcctaaaacc caaagatgat
     2581 gaagagtttg tcgtctcatc cgacgacatc aaaactgagg gcaagaaagg gaagaacaag
     2641 tccggccgtg gcaagaagca cacagccttt tcaagcaaag ggctcagtga tgaggagtac
     2701 gatgagtaca agagaatcag agaagaaagg aatggcaagt actccataga agagtacctt
     2761 caggacagag acaggtacta cgaggaggtg gccattgcca gggcaaccga agaggacttc
     2821 tgtgaagaag aagaggccaa aatccggcag agaattttca gaccaacaag gaaacaacgc
     2881 aaagaagaga gggcctctct cggcttggtc acaggctctg aaatcaggaa gagaaaccca
     2941 gaagacttca aacccaaggg aaagctgtgg gctgatgatg acagaagtgt tgactacaat
     3001 gagaaactca actttgaggc cccaccaagc atctggtcgc ggatagtcaa ctttggttca
     3061 ggctggggct tctgggtctc ccctagtctg tttataacat caacccatgt cataccccaa
     3121 ggtgcaaaag agttcttcgg agtccctatc aagcaaatcc agatacacaa atcaggtgaa
     3181 ttctgccggc tgagattccc aaagccaatc agaactgatg tgacgggcat gattctagaa
     3241 gaaggtgcgc ccgaggggac cgtggccaca ctgctcatca agagaccaac tggagagctc
     3301 atgcctctgg cagccagaat ggggacccat gcaaccatga aaattcaggg gcgcacagtt
     3361 ggagggcaaa tgggtatgct cctgacagga tccaacgcca agagtatgga cctaggcaca
     3421 acaccaggcg attgcggctg cccctacatc tacaagaggg ggaatgacta cgtggtcata
     3481 ggagtccata cggccgctgc ccgtggagga aacactgtca tatgtgccac ccaggggagt
     3541 gagggagaag ccacacttga aggaggtgac agtaaaggga catactgtgg tgcaccaatc
     3601 ttgggcccag ggagcgctcc gaagctcagt accaagacta agttttggag atcatccaca
     3661 acaccgctcc cacctggcac ctacgaacca gcctacctcg gtggcaaaga ccccagagtc
     3721 aaaggtggcc cttcattgca acaagttatg agggaccagc taaagccatt cacagaaccc
     3781 agaggcaaac caccaagacc aaatgtgttg gaagctgcca agaaaaccat catcaatgtc
     3841 cttgagcaga caattgatcc accccaaaaa tggtcatttg cgcaagcttg cgcatccctt
     3901 gacaaaacca cctccagcgg ccacccgcac cacatgcgga aaaacgattg ttggaatggg
     3961 gagtccttca caggaaaatt ggctgatcaa gcctccaagg ccaacctaat gtttgaagag
     4021 ggaaagaaca tgactccagt ctacacaggt gcacttaaag atgagttggt aaagaccgat
     4081 aaagtttatg gtaagatcaa gaagaggctt ctgtggggtt cagacctggc gaccatgata
     4141 cggtgcgccc gagcttttgg aggccttatg gatgaactca aggcgcactg tgtcacactt
     4201 cctgtcagag ttggtatgaa catgaatgag gatggcccca tcatctttga gaagcactcc
     4261 agatatagat atcactatga tgctgattat tcccggtggg actcaacaca acaaagggat
     4321 gtgctagcag cagcactaga aatcatggtt aagttctctc cagaaccaca cctggcccag
     4381 gtagttgcag aagacctcct ttcccctagc gtaatggatg taggtgactt tcaaatatca
     4441 ataagtgagg gtcttccctc cggggtacct tgtacctccc agtggaattc tatcgcccac
     4501 tggctcctca ctctttgtgc actctctgaa gtcacggacc tgtcccctga catcattcag
     4561 gccaactccc ttttctcctt ctatggtgat gatgagattg taagcacaga cataaagttg
     4621 gacccagaga agctgacagc aaaactcaag gagtacgggc tgaaaccaac ccgccccgac
     4681 aaaactgaag gaccccttgt tatctctgaa gacctggatg gcctgacgtt cctccggaga
     4741 actgtgaccc gtgatccagc tggctggttt ggaaaattgg aacaaagttc aattcttaga
     4801 caaatgtact ggaccagggg tcccaaccat gaagatccat ttgaaacaat gataccacac
     4861 tcccaaagac ccatacaatt gatgtccttg ctgggcgagg ctgcactcca cggcccggca
     4921 ttttatagca aaattagcaa attagtcatt gcagagttga aggaaggtgg catggatttt
     4981 tacgtgccca gacaagagcc aatgttcaga tggatgagat tctcagatct gagcacgtgg
     5041 gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga gtgacgccaa
     5101 cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg ttatggctct
     5161 ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccaacaaa atgtaattga
     5221 cccctggatt agaaacaatt ttgtacaagc ccctggtgga gagtttacag tatcccctag
     5281 aaacgctcca ggtgaaatac tatggagcgc gcccctgggc cctgatctaa atccctacct
     5341 atcccatttg gccagaatgt acaatggtta tgcaggtggt tttgaagtgc aggtaattct
     5401 cgcggggaac gcgttcaccg ccgggaaggt catatttgca gcagtcccac caaattttcc
     5461 aactgaaggc ttaagcccca gccaggtcac tatgttcccc catatagtag tagatgttag
     5521 gcaactagaa cctgtgttga ttcccttacc cgatgttagg aataattttt atcattataa
     5581 tcaatcaaat gaccccacca ttaagttgat agcaatgttg tatacaccac ttagggctaa
     5641 taatgctggg gatgatgtct tcacagtttc ttgccgagtt ctcacgagac catcccccga
     5701 ttttgatttc atatttctag tgccacccac agttgagtca agaactaaac cattctctgt
     5761 cccagtttta actgttgagg agatgaccaa ttcaagattc cccattcctt tggaaaagtt
     5821 gttcacgggt cccagcagtg cctttgttgt ccaaccacaa aacggcaggt gcacgactga
     5881 tggcgtgctc ctaggcacca cccaactgtc tcctgtcaac atctgcacct tcagaggaga
     5941 tgtcacccat atcacaggta gtcgtaacta cacaatgaat ttggcttctc aaaattggaa
     6001 caattatgac ccaacagaag aaatcccagc ccctctagga actccagact ttgtggggaa
     6061 gattcaaggc gtactcaccc aaaccacaag gacagatggc tcaacacgcg gccacaaagc
     6121 cacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc aatttgaaac
     6181 tgacacagac catgattttg aagctaacca aaacacaaag ttcaccccag tcggtgtcat
     6241 ccaagatggt agcaccaccc accgaaatga accccaacag tgggtgctcc caagttactc
     6301 aggcagaaat actcataatg tgcatctggc ccccgctgta gcccccactt ttccgggtga
     6361 gcaacttctc ttcttcagat ccaccatgcc cggatgcagc gggtacccca acatggattt
     6421 ggactgtcta ctcccccagg aatgggtgca gtacttctac caagaggcag ccccggcaca
     6481 atctgatgtg gctctgctaa gatttgtgaa tccagacaca ggtagggttt tgtttgagtg
     6541 taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg atttggttat
     6601 tccccccaat ggttatttta gatttgactc ctgggtcaac cagttctaca cgcttgcccc
     6661 catgggaaat ggaacggggc gtagacgtgc agtataatgg ctggagcttt ctttgctgga
     6721 ttggcatctg atgtccttgg ctctggactt ggttccctta tcaatgctgg ggctggggcc
     6781 atcaaccaaa aagttgagtt tgaaaataac agaaaattgc aacaagcatc cttccaattt
     6841 agcagcaatc tacaacaggc ttcctttcaa catgacaaag agatgctcca agcacaaatt
     6901 gaggccacca aaaagctaca acaggaaatg atgaaagtta agcaggcaat gctcctagag
     6961 ggtgggttct ctgagacaga tgcagcccgc ggggcaatca acgcccccat gacaaaagct
     7021 ttggactgga gcgggacaag gtactgggct cccgatgcta ggaccacaac atacaatgca
     7081 ggccgctttt ccacccctca accatcgggg gcactgccag gaagagctaa tctcagggat
     7141 gctgtccctg ctcggggttc ctccagtaaa tcttctaact cttctactgc tacttctgtg
     7201 cactcaaatc aaaccacttc aacgagactt ggttctacag ctggttctgg taccagtgtc
     7261 tcgagcctcc cgtcaactgc aaggactagg agctgggtag aggatcaaag taggaatttg
     7321 tcacctttca tgaggggggc ccacaacata tcgtttgtca ccccaccatc tagcagatcc
     7381 tctagccaag gcacagtctc aaccgtgcct aaagaggttt tggactcctg gactggcgct
     7441 ttcaacacgc acaggcagcc actcttcgct cacattcgta agcgagggga gtcacgggcg
     7501 taa
//