Typing tool

Complete norovirus genomes

MT474042  GII.4 Sydney
 GII.P31

Length: 7,503 | 3 CDS

ORF1: 1..5094
ORF2: 5075..6697
ORF3: 6697..7503
LOCUS       MT474042                7503 bp    RNA     linear   VRL 15-MAY-2020
DEFINITION  Norovirus GII isolate JH22 nonstructural polyprotein (ORF1) gene,
            partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete cds.
ACCESSION   MT474042
VERSION     MT474042.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7503)
  AUTHORS   Hernandez,J.M., Silva,L.D., Souza Junior,E.C., Cardoso,J.F.,
            Reymao,T.K., Portela,A.C.R., Lima,C.P., Teixeira,D.M., Lucena,M.S.,
            Nunes,M.R. and Gabbay,Y.B.
  TITLE     Evolutionary and Molecular Analysis of Complete Genome Sequences of
            Norovirus from Brazil: Emerging Recombinant Strain GII.P16-GII.4
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7503)
  AUTHORS   Hernandez,J.M., Silva,L.D., Souza Junior,E.C., Cardoso,J.F.,
            Reymao,T.K., Portela,A.C.R., Lima,C.P., Teixeira,D.M., Lucena,M.S.,
            Nunes,M.R. and Gabbay,Y.B.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-MAY-2020) Virology, Evandro Chagas Institute, BR316,
            Ananindeua, PARA 67030000, Brazil
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: IDBA-UD v. 1
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7503
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /strain="Hu/GII.Pe_GII.4/JH22/2015/PA/BRA"
                     /isolate="JH22"
                     /isolation_source="stool"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Brazil: Belem, PA"
                     /lat_lon="1.448822 S 48.473166 W"
                     /collection_date="06-Mar-2015"
                     /note="genotype: GII.PE/GII.4"
     gene            <1..5094
                     /gene="ORF1"
     CDS             <1..5094
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="QJT97667.1"
                     /translation="MASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPKQ
                     PPPKEIPPRPPRPPTPELVRKIPPPPPNGEDEPVVSYSTKGGVSGLPELTTVRQPEET
                     NTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISLA
                     KVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSWL
                     SRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKLK
                     PLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLA
                     VELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKK
                     EEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKS
                     ASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELAK
                     KIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPL
                     TLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKAK
                     RDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARASG
                     LLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMSD
                     LKQTLKNIAIKKCQIVYNGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRCA
                     RIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMTNKDGCLK
                     PKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKY
                     SIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLVTG
                     SEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSPSL
                     FITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTV
                     ATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCG
                     CPYIYKRGNDYVVIGVHTAAARGGNTVICATQGGEGEATLEGGDSKGTYCGAPILGPG
                     SAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEPRG
                     KPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCWNG
                     ESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKIKKRLLWGSDLAT
                     MIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWDST
                     QQRDVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCTSQ
                     WNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEY
                     GLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSIIRQMYWTRGPNH
                     EDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPM
                     FRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..984
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     985..2082
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2083..2619
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2620..3018
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3019..3561
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3562..5091
                     /gene="ORF1"
                     /product="RdRp"
     gene            5075..6697
                     /gene="ORF2"
     CDS             5075..6697
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QJT97668.1"
                     /translation="MKMASSDANPSNGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSHNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTNHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
     gene            6697..7503
                     /gene="ORF3"
     CDS             6697..7503
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QJT97669.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKSS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN      
        1 atggcgtcta acgacgcttc cgctgccgct gttgccaaca gcaacaacga catcgcaaaa
       61 tcttcaagtg acggtgtgtt ttctaacatg gctgtcactt ttaagcgggc cctcggggcg
      121 cggcctaaac agccgccccc gaaggaaata ccacccagac ccccgcgacc acccacacca
      181 gaattggtca gaaagatccc tcctccccca cccaacgggg aggatgaacc agtggtctct
      241 tacagcacca aaggtggcgt ttccggactg cctgagctca ccactgtcag acaaccggaa
      301 gaaaccaaca cggcgttcag tgtcccccca ctcaaccaaa gggagagcag ggacgccaag
      361 gagccactaa ctggaacaat cattgagatg tgggatggag aaatctacca ttacggcctg
      421 tatgtggaac gaggtcttat acttggtgtg cacaagccac cggcagccat cagccttgcc
      481 aaggtcgagc taacaccgct ctctttgttc tggagacctg tatacacccc ccagtatctc
      541 atctctccag acactcttag gagattacat ggagagtcat tcccctacac tgcatttgac
      601 aacaactgct acgccttttg ttgttgggta ttagacctaa acgactcatg gttaagcagg
      661 agaatgattc agagaacgac aggtttcttc aggccgtacc aggattggaa caggaaaccc
      721 ctccccacta tggatgattc caaattaaag aaggtagcca acatattctt gtgcactttg
      781 tcttcactat tcaccagacc cattaaggat ataataggga agttgaaacc tcttaacatc
      841 cttaacattc tggctacatg tgattggacc ttcgcaggca tagtggaatc tttaatactc
      901 ttagcagaac tctttggagt tttctggaca cccccagatg tgtctgcgat gatcgccccc
      961 ttactaggtg attatgaact gcaaggacct gaggaccttg cagtggaact ggtcccaata
     1021 gtgatggggg ggataggttt ggtgctagga tttaccaaag agaaaattgg aaagatgctg
     1081 tcatccgctg catccacttt aagagcttgc aaagaccttg gtgcatacgg actggaaatc
     1141 ttaaaattgg tcatgaagtg gttcttccca aagaaagagg aagcaaatga actggctatg
     1201 gtgagatcca tcgaggacgc agtgctagac ctcgaggcaa ttgaaaacaa ccacatgacc
     1261 accctactca aagacaaaga cagcttggca acctacatga gaacccttga ccttgaggag
     1321 gagaaagcca gaaaactctc aaccaaatct gcttcacccg atattgtggg cacaatcaac
     1381 tctcttctgg caagaatcgc tgctgcacgc tccctagtgc atcgggcgaa agaagagctc
     1441 tccagcaggc cgagacctgt cgttgtgatg atatcgggaa gaccagggat agggaaaact
     1501 caccttgcca gggagctggc caagaagatc gcggcctccc tcacagggga ccagcgtgtg
     1561 ggccttatcc cacgcaatgg tgtcgatcac tgggacgcat acaagggcga aagagttgtc
     1621 ctatgggacg actatggaat gagcaacccc atccatgatg ccctcaggtt gcaggagctt
     1681 gctgacactt gccccctcac gctaaattgt gacagaattg agaacaaagg gaaagtcttt
     1741 gacagtgatg ccataattat caccaccaat ctggccaatc cagcaccact ggattatgtc
     1801 aactttgaag cgtgctcgag acgtattgac ttcctcgtgt acgcagaagc ccctgaggtg
     1861 gagaaggcaa agcgcgactt cccaggccaa cctgacatgt ggaagaacgc tttcagtcct
     1921 gacttctcac acataaaact gtcattggct ccacagggtg gttttgacaa gaacggcaac
     1981 accccgcatg gaaaaggggt catgaagacc ctcaccactg gctccctcat cgcccgagca
     2041 tcagggttac tccatgagag gctagatgaa tatgaactgc aaggcccagc cctcaccact
     2101 ttcaactttg accgcaacaa gatacttgct tttagacagc ttgctgctga aaacaagtat
     2161 gggctgatgg acacaatgag agttggaaaa cagctcaagg atgtcaagac catgtcagac
     2221 ctcaaacaga cactcaagaa catcgcgatc aagaagtgcc agatagtgta caatggtagc
     2281 acctacacac ttgaggccga tggcaagggt agtgtgaaag ttgacaaagt gcaaagtgcc
     2341 actgtgcaga ccaacaatga actagccggt gccctacacc acctaaggtg cgctagaatc
     2401 aggtactatg ttaagtgcgt ccaggaggca ctgtattcca tcatccaaat cgctggggct
     2461 gcattcgtca ccacgcgcat cgctaagcgc atgaatatac agaatctctg gtccaaacca
     2521 caggtggaag acacagaaga gatgaccaac aaagatggtt gcctaaaacc caaagatgat
     2581 gaagagtttg tcgtctcatc cgacgacatc aaaactgagg gcaagaaagg gaagaacaag
     2641 tccggccgtg gcaagaagca cacagccttt tcaagcaaag ggctcagtga tgaggaatac
     2701 gatgagtaca agagaattag agaagaaagg aatggtaagt attccataga agagtacctt
     2761 caggacagag acaggtacta cgaggaggtg gccattgcca gggcaaccga agaggacttc
     2821 tgtgaggaag aagaggccaa aatccggcag agaatcttca gaccaacaag gaaacaacgc
     2881 aaagaagaga gggcctctct cggcttggtc acaggctctg aaatcaggaa gagaaaccca
     2941 gaagacttca aacccaaggg aaaactgtgg gctgatgatg acagaagtgt tgactacaat
     3001 gagaaactca attttgaggc cccaccaagc atctggtcgc ggatagtcaa ctttggttca
     3061 ggctggggct tctgggtctc ccccagtctg tttataacat caacccatgt cataccccaa
     3121 ggtgcaaaag agttcttcgg agtccctatc aagcaaatcc agatacacaa atcaggtgaa
     3181 ttctgccgat tgagattccc aaagccaatc agaactgatg tgacgggcat gattctagaa
     3241 gaaggtgcgc ccgaggggac cgtggccaca ctgctcatta agagaccaac tggagagctc
     3301 atgcctctgg cagccagaat ggggacccat gcaaccatga aaattcaggg gcgcaccgtt
     3361 ggagggcaaa tgggtatgct cctgacagga tccaacgcca agagtatgga cctaggcaca
     3421 acaccaggcg actgcggctg cccctacatc tacaagaggg ggaatgacta cgtggtcata
     3481 ggagtccata cggccgctgc ccgtggagga aacactgtca tatgtgccac ccaggggggt
     3541 gagggagaag ccacacttga aggaggtgac agtaaaggga catactgtgg cgcaccaatc
     3601 ttgggcccag ggagcgctcc gaagctcagt accaagacta agttttggag atcatcgaca
     3661 acaccactcc cacctggcac ctacgaacca gcctacctcg gtggtaaaga ccccagagtt
     3721 aaaggtggcc cttcattgca acaagttatg agggaccagc taaaaccatt cacagaaccc
     3781 agaggcaaac caccaagacc aaatgtgttg gaagctgcca agaaaaccat catcaatgtc
     3841 cttgagcaaa caattgatcc accccaaaaa tggtcatttg cgcaagcttg cgcatccctt
     3901 gacaaaacca cctccagcgg ccacccgcac catatgcgga aaaacgattg ttggaatggg
     3961 gagtccttca cgggaaaatt ggctgatcaa gcctccaagg ccaacctaat gtttgaagag
     4021 ggaaagaaca tgactccagt ctacacaggt gcacttaaag atgagttggt aaagaccgat
     4081 aaagtttatg gtaagatcaa aaagaggctt ctgtggggtt cagacctggc gaccatgata
     4141 cggtgcgccc gagcttttgg aggccttatg gatgaactca aggcgcactg tgtcacactt
     4201 cctgtcagag ttggtatgaa catgaatgag gatggcccca tcatctttga gaagcactcc
     4261 agatatagat accactatga tgctgattat tcccggtggg actcaacaca acaaagggat
     4321 gtgctagcag cagcactaga aatcatggtt aagttctctc cagaaccaca cctggcccag
     4381 gtagttgcag aagaccttct ttcccctagc gtgatggatg taggtgactt tcaaatatca
     4441 ataagtgagg gtcttccctc tggggtacct tgtacctccc agtggaattc catcgcccac
     4501 tggctcctca ctctttgtgc actctctgaa gtcacggacc tgtctcctga catcattcag
     4561 gccaactccc ttttctcctt ctatggtgat gatgagattg taagcacaga cataaagttg
     4621 gacccagaga agctgacagc aaaactcaag gagtacgggc taaaaccaac tcgccccgac
     4681 aaaactgagg gaccccttgt tatctctgaa gacctggatg gcctgacatt cctccggaga
     4741 actgtgaccc gtgatccggc tggctggttt ggaaaattgg aacaaagttc aattattaga
     4801 caaatgtact ggaccagagg tcccaaccat gaagatccat ttgaaacaat gataccacac
     4861 tcccaaagac ccatacaatt gatgtccttg ctgggcgagg ctgcactcca cggcccagca
     4921 ttttatagca aaatcagcaa attagtcatt gcagagttga aggaaggtgg catggatttt
     4981 tacgtgccca gacaagagcc aatgttcaga tggatgagat tctcagatct gagcacgtgg
     5041 gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga gtgacgccaa
     5101 cccatctaat gggtccgcag ccaacctcgt cccagaggtc aacaatgagg ttatggctct
     5161 ggagcccgtt gttggtgccg ccattgcggc acctgtagcg ggccaacaaa atgtaattga
     5221 cccctggatt agaaataatt ttgtacaagc ccctggcgga gagtttacag tatcccctag
     5281 aaacgctcca ggtgaaatac tatggagcgc gcccttgggc cctgatctaa atccctacct
     5341 atcccatttg gccagaatgt acaatggtta tgcaggtggt tttgaagtgc aggtaatact
     5401 cgcggggaac gcgttcaccg ccgggaaggt catatttgca gcagtcccac caaattttcc
     5461 aactgaaggc ttgagcccca gccaggtcac tatgttcccc catatagtag tagatgtcag
     5521 gcaactagaa cctgtgttga ttcccttacc cgatgttagg aataatttct atcattataa
     5581 tcaatcaaat gaccccacta ttaagttgat agcaatgttg tatacaccac ttagggctaa
     5641 taatgctggg gatgatgtct ttacagtttc ttgccgggtt ctcacgagac catcccccga
     5701 ttttgatttc atatttctag tgccacccac agttgaatca agaaccaaac cattctctgt
     5761 cccagtttta actgttgagg agatgaccaa ttcaaggttc cccattcctc tggaaaagtt
     5821 gttcacgggt cccagcagtg cctttgttgt ccaaccacaa aacggcaggt gcacgactga
     5881 tggcgtgctc ctaggcacca cccaactgtc tcctgtcaat atctgtacct tcagaggaga
     5941 tgtcacccat atcacaggta gtcataatta cacaatgaat ttggcttctc aaaattggaa
     6001 caattatgac ccaacagaag aaatcccagc ccctctagga actccagact ttgtggggaa
     6061 gattcaaggc gtgctcaccc aaaccacaag gacagatggc tcaacacgcg gccacaaagc
     6121 cacagtgtac actgggagcg ccgactttgc tccaaaactg ggtagagttc agtttgaaac
     6181 tgacacaaac catgattttg aagccaacca aaatacaaag ttcaccccag tcggtgtcat
     6241 ccaagatggt agcaccaccc atagaaatga accccaacag tgggtgctcc caagttactc
     6301 aggcagaaac actcataatg tgcatctggc ccccgctgta gcccccactt ttccgggtga
     6361 gcaacttctc ttcttcagat ccaccatgcc cggatgcagc gggtacccta acatggattt
     6421 ggactgtctg ctcccccagg aatgggtaca gtacttctac caagaggcag ccccagcaca
     6481 atctgatgtg gctctgctaa gatttgtgaa tccagacaca ggtagggtct tgtttgagtg
     6541 taagcttcat aaatcaggct atgttacagt ggctcacact ggccaacatg atttggttat
     6601 tccccccaat ggttatttta gatttgattc ctgggtcaac cagttttaca cgcttgcccc
     6661 catgggaaat ggaacggggc gtagacgtgc agtataatgg ctggagcttt ctttgctgga
     6721 ttggcatctg atgtcctggg ctctggactt ggctccctta tcaatgctgg ggctggggcc
     6781 atcaaccaaa aagttgagtt tgaaaataac agaaaattgc aacaagcatc cttccaattt
     6841 agcagcaatc tacaacaggc ttcctttcaa cacgacaaag agatgctcca agcacaaatt
     6901 gaggccacca aaaagctaca acaggaaatg atgaaagtta agcaggcaat gctcctagag
     6961 ggtgggttct ctgagacaga tgcagcccgc ggggcaatca acgcccccat gacaaaagct
     7021 ttggactgga gcgggacaag gtactgggct cccgatgcta ggaccacaac atacaatgca
     7081 ggccgctttt ccacccctca accatcgggg gcactgccag gaagagctaa tcttagggat
     7141 gctgtccctg ctcggggttc ctccagtaaa tcttctaact cttctactgc tacttctgtg
     7201 tactcaaatc aaaccacttc aacgagactt ggttctacag ctggttctgg caccagtgtc
     7261 tcgagcctcc cgtcaactgc aaggaccagg agctgggttg aggatcaaag taggaattta
     7321 tcacctttca tgaggggggc ccacaacata tcgtttgtca ccccaccatc tagcagatcc
     7381 tctagccaag gcacagtctc aaccgtgcct aaagaggttt tggactcctg gactggcgct
     7441 ttcaacacgc gcaggcagcc actcttcgct cacattcgta agcgagggga gtcacgggcg
     7501 taa
//