Typing tool

Complete norovirus genomes

MT474033  GII.4 Sydney
 GII.P31

Length: 7,503 | 3 CDS

ORF1: 1..5094
ORF2: 5075..6697
ORF3: 6697..7503
LOCUS       MT474033                7503 bp    RNA     linear   VRL 15-MAY-2020
DEFINITION  Norovirus GII isolate JH12 nonstructural polyprotein (ORF1) gene,
            partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds.
ACCESSION   MT474033
VERSION     MT474033.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7503)
  AUTHORS   Hernandez,J.M., Silva,L.D., Souza Junior,E.C., Cardoso,J.F.,
            Reymao,T.K., Portela,A.C.R., Lima,C.P., Teixeira,D.M., Lucena,M.S.,
            Nunes,M.R. and Gabbay,Y.B.
  TITLE     Evolutionary and Molecular Analysis of Complete Genome Sequences of
            Norovirus from Brazil: Emerging Recombinant Strain GII.P16-GII.4
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7503)
  AUTHORS   Hernandez,J.M., Silva,L.D., Souza Junior,E.C., Cardoso,J.F.,
            Reymao,T.K., Portela,A.C.R., Lima,C.P., Teixeira,D.M., Lucena,M.S.,
            Nunes,M.R. and Gabbay,Y.B.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-MAY-2020) Virology, Evandro Chagas Institute, BR316,
            Ananindeua, PARA 67030000, Brazil
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: IDBA-UD v. 1
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7503
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /strain="Hu/GII.Pe_GII.4/JH12/2013/PA/BRA"
                     /isolate="JH12"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Brazil: Belem, PA"
                     /lat_lon="1.448822 S 48.473166 W"
                     /collection_date="19-Jul-2013"
                     /note="genotype: GII.PE/GII.4"
     gene            <1..5094
                     /gene="ORF1"
     CDS             <1..5094
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QJT97646.1"
                     /translation="MASNDAVVVAVGNSDNDIAKSSSDGVFSNMAVTFKRALGARPKQ
                     PPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSTKGGVSGLPELTTVRQPEET
                     NTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLA
                     KVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWL
                     SRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLK
                     PLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLA
                     VELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKK
                     EEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKS
                     ASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAK
                     KIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPL
                     TLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAK
                     RDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASG
                     LLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSD
                     LKQALKNIAIKKCQIVYNGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCA
                     RIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMTNKDGCLK
                     PKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKY
                     SIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTG
                     SEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSL
                     FITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTV
                     ATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCG
                     CPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILGPG
                     SAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRG
                     KPPRPNVLEAAKETIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNG
                     ESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKIKKRLLWGSDLAT
                     MIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDST
                     QQRDVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQ
                     WNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEY
                     GLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNH
                     EDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPM
                     FRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..984
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     985..2082
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2083..2619
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2620..3018
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3019..3561
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3562..5091
                     /gene="ORF1"
                     /product="RdRp"
     gene            5075..6697
                     /gene="ORF2"
     CDS             5075..6697
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QJT97647.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSHNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
     gene            6697..7503
                     /gene="ORF3"
     CDS             6697..7503
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QJT97648.1"
                     /translation="MAGAFFAGLASDILGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKSS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN      
        1 atggcgtcta acgacgctgt cgttgttgct gttggcaaca gcgacaacga catcgcaaaa
       61 tcttcaagtg acggtgtgtt ttctaacatg gctgtcactt ttaagcgggc cctcggggcg
      121 cggcctaaac agccgccccc gaaggagata ccacccagac ccccgcgacc acccacacca
      181 gaattggtca aaaagatccc tcctccccca cccaacgggg aggatgaact agtggtctct
      241 tacagcacca aaggtggcgt ttccggactg cctgagctca ccactgtcag acaaccggaa
      301 gaaaccaaca cggcgttcag tgtcccccca ctcaaccaaa gagagagcag ggacgccaag
      361 gagccactaa ctggaacaat tattgaaatg tgggatgggg aaatctacca ttacggcctg
      421 tatgtggaac gaggtcttat acttggtgtg cacaagccac cggcagccat cagccttgcc
      481 aaggttgagc taacaccgct ctctttgttc tggagacctg tgtacacccc ccagtatctc
      541 atctctccag acactcttag gagattacat ggagagtcat tcccctacac tgcatttgac
      601 aacaattgct acgccttttg ttgttgggta ttagacctaa acgactcatg gttaagcagg
      661 agaatgattc agagaacgac aggtttcttc aggccgtacc aggattggaa caggaaaccc
      721 ctccccacta tggatgattc caaattaaag aaggtagcca acatattctt gtgcactttg
      781 tcttcactat tcaccagacc cattaaggat ataataggga agttgaaacc tcttaacatc
      841 cttaacattc tggctacatg tgattggacc ttcgcaggca tagtggaatc tttaatactc
      901 ttagcagaac tctttggagt cttctggaca cccccagatg tgtctgcgat gatcgccccc
      961 ttgctaggtg attatgaact gcaaggacct gaggaccttg cagtggaact ggtcccaata
     1021 gtgatggggg ggataggttt ggtgctagga tttaccaaag agaaaattgg aaagatgcta
     1081 tcatccgctg catccacttt aagagcttgt aaagaccttg gtgcatacgg actggaaatc
     1141 ttaaaattgg tcatgaagtg gttcttccca aagaaagagg aagcaaatga actggctatg
     1201 gtgagatcca tcgaggacgc agtgctagac ctcgaggcaa ttgaaaacaa ccacatgacc
     1261 accctactca aagacaaaga cagcttggca acctacatga gaacccttga ccttgaggag
     1321 gagaaagcca gaaaactctc aaccaaatct gcttcacccg atattgtggg cacaatcaac
     1381 tctcttctgg caagaatcgc tgctgcacgc tccctagtgc atcgggcgaa agaagagctc
     1441 tccagcaggc cgagacctgt cgttgtgatg atatcgggaa gaccagggat agggaaaact
     1501 caccttgcca gggagctggc caagaagatc gcggcctccc tcacagggga ccagcgtgtg
     1561 ggtcttatcc cacgcaatgg tgtcgatcac tgggacgcat acaagggcga aagagttgtc
     1621 ctatgggacg actatggaat gagcaacccc atccatgatg ccctcaggtt gcaggagctt
     1681 gctgacactt gccccctcac gctaaattgt gacagaattg agaacaaagg gaaagtcttt
     1741 gacagtgatg ccataattat caccaccaat ctggccaacc cagcaccact ggattatgtc
     1801 aactttgaag cgtgctcgag acgtattgac ttcctcgtgt acgcagaagc ccctgaggta
     1861 gagaaggcaa agcgcgactt cccaggtcaa cctgacatgt ggaagaacgc tttcagtcct
     1921 gacttctcac acataaaact gtcattggct ccacagggtg gttttgacaa gaacggcaac
     1981 accccgcatg gaaaaggggt catgaagacc ctcaccactg gctccctcat cgcccgagca
     2041 tcagggttac tccatgagag gctagatgaa tatgaactgc aaggcccagc cctcaccact
     2101 ttcaactttg accgcaacaa gatacttgct tttagacagc ttgctgctga aaacaagtat
     2161 gggctgatgg acacaatgag agttggaaaa cagctcaagg atgtcaagac catgtcagac
     2221 ctcaaacagg cactcaagaa catcgcgatc aagaagtgcc agatagtgta caatggtagc
     2281 acctacacac ttgaggccga tggcaagggt agtgtgaaag ttgacaaagt gcaaagtgcc
     2341 actgtgcaga ccaacaatga actagccggt gccctacacc acctaaggtg cgctagaatc
     2401 agatactatg ttaagtgcgt ccaggaggca ctgtattcca tcatccaaat cgctggggct
     2461 gcattcgtca ccacgcgcat cgctaagcgc atgaatatac agaacctctg gtccaaacca
     2521 caggtggaag acacagaaga gatgaccaac aaggatggtt gcctaaaacc caaagatgat
     2581 gaagagtttg tcgtctcatc cgacgacatc aaaactgagg gcaagaaagg gaagaacaag
     2641 tccggccgtg gcaagaagca cacagccttt tcaagcaaag ggctcagtga tgaggaatac
     2701 gatgagtaca agagaataag agaagaaagg aatggtaagt attccataga agagtacctt
     2761 caggacagag acaggtacta cgaggaggtg gccattgcca gggcaaccga agaggacttc
     2821 tgtgaggaag aagaggccaa aatccggcag agaatcttca gaccaacaag gaaacaacgc
     2881 aaagaagaga gggcctctct cggcttggtc acaggctctg aaatcaggaa gagaaaccca
     2941 gaagacttca aacccaaggg aaagctgtgg gctgatgatg acagaagtgt tgactacaat
     3001 gagaaactca attttgaggc cccaccaagc atctggtcgc ggatagtcaa ctttggttca
     3061 ggctggggct tctgggtctc ccccagtctg tttataacat caacccatgt cataccccaa
     3121 ggtgcaaaag agttcttcgg agtccctatc aagcaaatcc agatacacaa atcaggtgaa
     3181 ttctgccgat tgagattccc aaagccaatc agaactgatg tgacgggcat gattctagaa
     3241 gaaggtgcgc ccgaggggac agtggccaca ctgctcatca agagaccaac tggagagctc
     3301 atgcctctgg cagccagaat ggggacccat gcaaccatga aaattcaggg gcgcactgtt
     3361 ggagggcaaa tgggtatgct cctgacaggg tccaacgcca agagtatgga cctaggcaca
     3421 acaccaggcg actgcggctg cccctacatc tacaagaggg ggaatgacta cgtggtcata
     3481 ggagtccata cggccgctgc ccgtggagga aacactgtca tatgtgccac ccaggggagt
     3541 gagggagaag ccacacttga aggaggtgac agtaaaggga catactgtgg cgcaccaatc
     3601 ttgggcccag ggagcgctcc gaagctcagt accaagacta agttttggag atcatccaca
     3661 acaccactcc cacctggcac ttacgaacca gcctacctcg gtggtaaaga ccccagagtt
     3721 aaaggtggcc cttcattgca acaagttatg agggaccagc taaagccatt cacagaaccc
     3781 agaggcaaac caccaagacc aaatgtgttg gaagctgcca aggaaaccat catcaatgtc
     3841 cttgagcaaa caattgatcc accccaaaaa tggtcatttg cgcaagcttg cgcatccctt
     3901 gacaaaacca cctccagcgg ccacccgcac cacatgcgga aaaacgattg ttggaatggg
     3961 gagtccttca cgggaaaatt ggctgatcaa gcctccaagg ccaacctaat gtttgaagag
     4021 ggaaagaaca tgactccagt ctacacaggt gcacttaaag atgagttggt gaagaccgat
     4081 aaagtttatg gtaagatcaa gaagaggctt ctgtggggtt cagacctggc gaccatgata
     4141 cggtgcgccc gagcttttgg aggccttatg gatgaactta aggcgcactg tgtcacactt
     4201 cctgtcagag ttggtatgaa catgaatgag gacggcccca taatctttga gaagcactcc
     4261 agatatagat accactatga cgctgattat tcccggtggg actcaacaca acaaagggat
     4321 gtgctagcag cagcactaga aatcatggtt aagttctctc cagaaccaca cctggcccag
     4381 gtagttgcag aagacctcct ttcccctagc gtgatggatg taggtgactt tcaaatatca
     4441 ataagtgagg gtcttccctc tggggtacct tgtacctccc agtggaattc catcgcccac
     4501 tggctcctca ctctttgtgc actctctgaa gtcacggacc tgtcccctga catcattcag
     4561 gccaactccc ttttctcctt ctatggtgat gatgagattg taagcacaga cataaagttg
     4621 gacccagaga agctgacagc aaaactcaag gagtacgggc tgaaaccaac ccgccccgac
     4681 aaaactgagg gaccccttgt tatctctgaa gacctggatg gcctgacatt cctccggaga
     4741 actgtgaccc gtgatccggc tggctggttt ggaaaattgg aacaaagttc aattctcaga
     4801 caaatgtact ggaccagggg tcccaaccat gaagatccat ttgaaacaat gataccacac
     4861 tcccaaagac ccatacaatt gatgtccttg ctgggcgagg ctgcactcca cggcccggca
     4921 ttttatagca aaatcagcaa attagtcatt gcagagttga aggaaggtgg catggatttt
     4981 tacgtgccca gacaagagcc aatgttcaga tggatgagat tctcagatct gagcacgtgg
     5041 gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga gtgacgccaa
     5101 cccatctgat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg ttatggctct
     5161 ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccaacaaa atgtaattga
     5221 cccctggatt agaaataatt ttgtacaagc ccctggtgga gagtttacag tatcccctag
     5281 aaacgctcca ggtgaaatac tatggagcgc gcccttgggc cctgatctaa atccctacct
     5341 atcccatttg gccagaatgt acaatggtta tgcaggtggt tttgaagtgc aggtaattct
     5401 cgcggggaac gcgttcaccg ccgggaaggt catatttgca gcagtcccac caaattttcc
     5461 aactgaaggc ttgagcccca gccaggtcac tatgttcccc catatagtag tagatgttag
     5521 gcaactagaa cctgtgttga ttcctttacc cgatgttagg aataatttct atcattataa
     5581 tcaatcaaat gaccccacca ttaagttgat agcaatgttg tatacaccac ttagggctaa
     5641 taatgctggg gatgatgtct tcacagtttc ttgccgggtt ctcacgagac catcccccga
     5701 ttttgatttc atatttttag tgccacccac agttgaatca agaactaaac cattctctgt
     5761 cccagtttta actgttgagg agatgaccaa ttcaagattc cctattcctt tggaaaagtt
     5821 gttcacgggt cccagcagtg cctttgttgt ccaaccacaa aacggcaggt gcacgactga
     5881 tggcgtgctc ctaggcacca cccaactgtc tccagtcaac atctgcacct tcagaggaga
     5941 tgtcacccat atcacaggta gtcataacta cacaatgaat ttggcttctc aaaattggaa
     6001 caattatgac ccaacagaag aaatcccagc ccctctagga actccagact ttgtggggaa
     6061 gattcaaggc gtgctcaccc aaaccacaag gacagatggc tcaacacgcg gccacaaagc
     6121 cacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc aatttgaaac
     6181 tgacacaaac catgattttg aagccaacca aaacacaaag ttcaccccag tcggtgtcat
     6241 ccaagatggt agcaccaccc accgaaatga accccaacag tgggtgctcc caagttactc
     6301 aggcagaaac actcataatg tgcatctggc ccccgctgta gcccccactt ttccgggtga
     6361 gcaacttctc ttcttcagat ccaccatgcc cggatgcagc gggtacccca acatggattt
     6421 ggactgtctg ctcccccagg aatgggtgca gtacttctac caagaggcag ccccagcaca
     6481 atctgatgtg gctctgctaa gatttgtgaa tccagacaca ggtagggttt tgtttgagtg
     6541 taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg atttggttat
     6601 tccccccaat ggttatttta gatttgattc ctgggtcaac cagttctaca cgcttgcccc
     6661 catgggaaat ggaacggggc gtagacgtgc agtataatgg ctggagcttt ctttgctgga
     6721 ttggcatctg atatcctagg ctctggactt ggttccctta tcaatgctgg ggctggggcc
     6781 atcaaccaaa aagttgagtt tgaaaataac agaaaattgc aacaagcatc cttccaattt
     6841 agcagcaatc tacaacaggc ttcctttcaa catgacaaag agatgctcca agcacaaatt
     6901 gaggccacca aaaagctaca acaggaaatg atgaaagtta agcaggcaat gctcctagag
     6961 ggtgggttct ctgagacaga tgcagcccgc ggggcaatca acgcccccat gacaaaagct
     7021 ttggactgga gcgggacaag gtactgggct cccgatgcta ggactacaac atacaatgca
     7081 ggccgctttt ccacccctca accatcgggg gcactgccag gaagagctaa tcttagggat
     7141 gctgtccctg ctcggggttc ctccagtaaa tcttccaact cttctactgc tacttctgtg
     7201 tactcaaatc aaaccacttc aacgagactt ggttctacag ctggttctgg caccagtgtc
     7261 tcgagcctcc cgtcaactgc aaggaccagg agctgggttg aggatcaaag taggaatttg
     7321 tcacctttca tgaggggggc ccacaacata tcgtttgtca ccccaccatc tagcagatcc
     7381 tctagccaag gcacagtctc aaccgtgcct aaagaggttt tggactcctg gactggcgct
     7441 ttcaacacgc gcaggcagcc actcttcgct cacattcgta agcgagggga gtcacgggcg
     7501 taa
//