Typing tool

Complete norovirus genomes

MN307994  GII.4 Sydney
 GII.P31

Length: 7,510 | 3 CDS

ORF1: 1..5101
ORF2: 5082..6704
ORF3: 6704..7510
LOCUS       MN307994                7510 bp    RNA     linear   VRL 29-JUN-2021
DEFINITION  Norovirus GII isolate 016_GII.Pe_GII.4_Paraiso do Tocantins
            nonstructural polyprotein (ORF1) gene, partial cds; and VP1 (ORF2)
            and VP2 (ORF3) genes, complete cds.
ACCESSION   MN307994
VERSION     MN307994.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7510)
  AUTHORS   Tinker,R.J., da Costa,A.C., Tahmasebi,R., Milagres,F.A.P., Dos
            Santos Morais,V., Pandey,R.P., Jose-Abrego,A., Brustulin,R.,
            Rodrigues Teles,M.D.A., Cunha,M.S., Araujo,E.L.L., Gomez,M.M.,
            Deng,X., Delwart,E., Sabino,E.C., Leal,E. and Luchs,A.
  TITLE     Norovirus strains in patients with acute gastroenteritis in rural
            and low-income urban areas in northern Brazil
  JOURNAL   Arch Virol 166 (3), 905-913 (2021)
   PUBMED   33462673
REFERENCE   2  (bases 1 to 7510)
  AUTHORS   Tinker,R.J., da-Costa,A.C., Leal,E.S., Tahmasebi,R.,
            Milagres,F.A.P., Pandey,R.P., Jose-Abrego,A., Brustulin,R.,
            Teles,Md.A.R., Sayao-Lobato,M.C.A.B., Chagas,R.T.,
            Santos-Abrao,Md.F.N., Alves-Soares,C.V.D., Cunha,M.S., Deng,X.,
            Delwart,E., Sabino,E.C. and Luchs,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (13-AUG-2019) Institute of Tropical Medicine, University
            of Sao Paulo, Av Dr Eneas de Carvalho Aguiar 470, Sao Paulo, Sao
            Paulo 05403000, Brazil
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: Geneious v. R9
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7510
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="016_GII.Pe_GII.4_Paraiso do Tocantins"
                     /isolation_source="feces"
                     /host="Homo sapiens"
                     /db_xref="taxon:122929"
                     /country="Brazil"
                     /collection_date="05-Jul-2013"
                     /note="genotype: GII.Pe_GII.4"
     gene            <1..5101
                     /gene="ORF1"
     CDS             <1..5101
                     /gene="ORF1"
                     /codon_start=2
                     /product="nonstructural polyprotein"
                     /protein_id="QED42370.1"
                     /translation="KMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARPK
                     QPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPEE
                     TNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAISL
                     AKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDSW
                     LSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGKL
                     KPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDL
                     AVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPK
                     KEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTK
                     SASPDIVGTINSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLARELA
                     KKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCP
                     LTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEKA
                     KRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARAS
                     GLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTMS
                     DLKQALKNIAIKKCQIVYNGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLRC
                     ARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMTNKDGCL
                     KPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNGK
                     YSIEEYLQDRDRYSSMEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
                     TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
                     SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
                     TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
                     CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
                     PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
                     RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
                     NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKIKKRLLWGSDL
                     ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
                     STQQRDVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
                     SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
                     EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
                     NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSQLSKLVIAELKEGGMDFYVPRQE
                     PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     <1..988
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     989..2086
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2087..2623
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2624..3025
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3026..3568
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3569..5098
                     /gene="ORF1"
                     /product="RdRp"
     gene            5082..6704
                     /gene="ORF2"
     CDS             5082..6704
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="QED42371.1"
                     /translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
                     QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
                     FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
                     VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
                     TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
                     QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
                     TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
                     STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
                     CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
                     IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
     gene            6704..7510
                     /gene="ORF3"
     CDS             6704..7510
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="QED42372.1"
                     /translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
                     SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
                     APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVPARGSSSKSS
                     NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPFMRGAHNI
                     SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN      
        1 gaagatggcg tctaacgacg cttccgctgc cgctgttgcc aacagcaaca acgacatcgc
       61 aaaatcttca agtgacggtg tgttttctaa catggctgtc acttttaagc gggccctcgg
      121 ggcgcggcct aaacagccgc ccccgaagga gataccaccc agacccccgc gaccacccac
      181 accagaattg gtcaaaaaga tccctcctcc cccacccaac ggggaggatg aactagtggt
      241 ctcttacagc gccaaagatg gcgtttccgg actgcctgag ctcaccactg tcagacaacc
      301 ggaagaaacc aacacggcgt tcagtgtccc cccactcaac caaagggaga gcagggacgc
      361 caaggagcca ctaactggaa caattattga aatgtgggat ggagaaatct accattacgg
      421 cctgtacgtg gaacgaggtc ttatacttgg tgtgcacaag ccaccggcag ccatcagcct
      481 tgccaaggtc gagctaacac cgctctcttt gttctggaga cctgtataca ccccccagta
      541 tctcatctct ccagacactc ttaggagatt acatggagag tcattcccct acactgcatt
      601 tgacaacaat tgctacgcct tttgttgttg ggtattagac ctaaacgact catggctaag
      661 caggagaatg attcagagaa caacaggttt cttcaggccg taccaggatt ggaacaggaa
      721 acccctcccc actatggatg attccaaatt aaagaaggta gccaacatat tcttgtgcac
      781 tttgtcttca ctattcacca gacccattaa ggacataata gggaagttga aacctcttaa
      841 catccttaac attctggcta catgtgattg gaccttcgca ggcatagtgg aatccttaat
      901 actcttagca gaactctttg gagttttctg gacaccccca gatgtgtctg cgatgatcgc
      961 ccccttgcta ggtgattatg aactgcaagg acctgaggac cttgcagtgg aactggtccc
     1021 aatagtgatg ggggggatag gtttggtgct aggatttacc aaagagaaaa ttggaaagat
     1081 gctatcatcc gctgcatcca ctttaagagc ttgtaaagac cttggtgcat acggactgga
     1141 aatcttaaaa ttggtcatga agtggttctt cccaaagaaa gaggaagcaa atgaactggc
     1201 tatggtgaga tccatcgagg acgcagtgct agacctcgag gcaattgaaa acaaccacat
     1261 gaccacccta ctcaaagaca aagacagctt ggcaacctac atgagaaccc ttgaccttga
     1321 ggaggagaaa gccagaaaac tctcaaccaa atctgcttca cccgatattg tgggcacaat
     1381 caactctctt ttggcaagaa tcgctgctgc acgctcccta gtgcatcggg cgaaagaaga
     1441 gctctccagc aggccgagac ctgtcgttgt gatgatatcg ggaagaccag ggatagggaa
     1501 aactcacctt gccagggagc tggccaagaa gatcgcggcc tccctcacag gggaccagcg
     1561 tgtgggtctt atcccacgca atggtgtcga tcactgggac gcatacaagg gcgaaagagt
     1621 tgtcctatgg gacgactatg gaatgagcaa ccccatccat gatgccctca ggttgcagga
     1681 gcttgctgac acttgccccc tcacgctaaa ttgtgacaga attgagaaca aagggaaagt
     1741 ctttgacagt gatgccataa ttatcaccac caatctggcc aacccagcac cactggatta
     1801 tgtcaacttt gaagcgtgct cgagacgtat tgacttcctc gtgtacgcag aagcccctga
     1861 ggtggagaag gcaaagcgcg acttcccagg tcaacctgat atgtggaaga acgctttcag
     1921 tcctgacttc tcacacataa aactgtcatt ggctccacag ggtggttttg acaagaacgg
     1981 caacaccccg catggaaaag gggtcatgaa gaccctcacc actggctccc tcatcgcccg
     2041 agcatcaggg ttgctccatg agaggctaga tgaatatgaa ctgcaaggcc cagccctcac
     2101 cactttcaac tttgaccgca acaagatact tgcttttaga cagcttgctg ctgaaaacaa
     2161 gtatgggctg atggacacaa tgagagttgg aaaacagctc aaggatgtca agaccatgtc
     2221 agacctcaaa caagcactca agaacatcgc gatcaagaag tgccagatag tgtacaatgg
     2281 tagcacctac acacttgagg ccgatggcaa gggtagtgtg aaagttgaca aagtgcaaag
     2341 tgccactgtg cagaccaaca atgaactagc cggtgcccta caccacctaa ggtgcgctag
     2401 aatcagatac tatgttaagt gcgtccagga ggcactgtat tccatcatcc aaatcgctgg
     2461 ggctgcattc gtcaccacgc gcatcgctaa gcgcatgaat atacagaatc tctggtccaa
     2521 gccacaggtg gaagacacag aagagatgac caacaaagat ggttgcctaa aacccaaaga
     2581 tgatgaagag tttgtcgtct catccgacga catcaaaact gagggcaaga aagggaagaa
     2641 caagtccggc cgtggcaaga agcacacagc cttttcaagc aaagggctca gtgatgagga
     2701 gtacgatgag tacaagagaa tcagagaaga aaggaatggt aagtactcca tagaagagta
     2761 ccttcaggac agagacaggt actcttctat ggaggtggcc atcgccaggg caaccgaaga
     2821 ggacttctgt gaagaagaag aggccaaaat ccggcagaga attttcagac caacaaggaa
     2881 acaacgcaaa gaagagaggg cctctctcgg cttggtcaca ggctctgaaa tcaggaagag
     2941 aaacccagaa gacttcaaac ccaagggaaa gctgtgggct gatgatgaca gaagtgttga
     3001 ctacaatgag aaactcaact ttgaggcccc accaagcatc tggtcgcgga tagtcaactt
     3061 tggttcaggc tggggcttct gggtctcccc cagtctgttt ataacatcaa cccatgtcat
     3121 accccaaggt gcaaaagagt tcttcggagt ccctatcaag caaatccaga tacacaaatc
     3181 aggtgaattc tgccggttga gattcccaaa gccaatcaga actgatgtga cgggcatgat
     3241 tctagaagaa ggtgcgcccg aggggaccgt ggccacactg ctcatcaaga gaccaactgg
     3301 agagctcatg cctctggcag ccagaatggg aacccatgca accatgaaaa ttcaggggcg
     3361 cacagttgga gggcaaatgg gtatgctcct gacaggatcc aacgccaaga gtatggacct
     3421 aggcacaaca ccaggcgact gcggctgccc ctacatctac aagaggggga atgactacgt
     3481 ggtcatagga gtccatacgg ccgctgcccg tggaggaaac actgtcatat gtgccaccca
     3541 ggggagtgag ggagaagcca cacttgaagg aggtgacagt aaagggacat actgtggcgc
     3601 accaatcttg ggcccaggga gcgctccgaa gctcagtacc aagactaagt tttggagatc
     3661 atccacaaca ccactcccac ctggcaccta cgaaccagcc tacctcggtg gcaaagaccc
     3721 cagagtcaaa ggtggccctt cattgcaaca agttatgagg gaccagctaa agccattcac
     3781 agaacccaga ggcaaaccac caagaccaaa tgtgttggaa gctgccaaga aaaccatcat
     3841 caatgtcctt gagcaaacaa ttgatccacc ccaaaaatgg tcatttgcgc aagcttgcgc
     3901 atcccttgac aaaaccacct ccagcggcca cccgcaccac atgcggaaaa acgattgttg
     3961 gaatggggag tccttcacag gaaaattggc tgatcaagcc tccaaggcca acctaatgtt
     4021 tgaagaggga aagaacatga ctccagtcta cacaggtgca cttaaagatg agttggtaaa
     4081 gaccgataaa gtttatggta agatcaagaa gaggcttctg tggggttcag atctggcgac
     4141 catgatacgg tgcgcccgag cttttggagg ccttatggat gaactcaagg cgcactgtgt
     4201 cacacttcct gtcagagttg gtatgaacat gaatgaggat ggccccatca tctttgagaa
     4261 gcattccaga tatagatatc actatgatgc tgattattcc cggtgggact caacacaaca
     4321 aagggatgtg ctagcagcag cactagaaat catggttaag ttctctccag aaccacacct
     4381 ggcccaggta gttgcagaag acctcctttc ccctagcgta atggatgtag gtgactttca
     4441 aatatcaata agtgagggtc ttccctctgg ggtaccttgc acctcccagt ggaattccat
     4501 cgcccactgg ctcctcactc tttgtgcact ctctgaagtc acggacctgt cccctgacat
     4561 cattcaggcc aactcccttt tctccttcta tggtgatgat gagattgtaa gcacagacat
     4621 aaagttggac ccagagaagc tgacagcaaa actcaaggag tacgggctga aaccaacccg
     4681 ccccgacaaa actgaaggac cccttgttat ctctgaagac ctggatggcc tgacattcct
     4741 ccggagaact gtgacccgtg atccagctgg ctggtttgga aaattggaac aaagttcaat
     4801 tctcagacaa atgtactgga ccaggggtcc caaccatgaa gatccatttg aaacaatgat
     4861 accacactcc caaagaccca tacaattgat gtccttgctg ggcgaggctg cactccacgg
     4921 cccggcattc tacagccaac tcagcaaact agtcattgca gagttgaagg aaggtggcat
     4981 ggatttttac gtgcccagac aagagccaat gttcagatgg atgagattct cagatctgag
     5041 cacatgggag ggcgatcgca atctggctcc cagttttgtg aatgaagatg gcgtcgagtg
     5101 acgccaaccc atctgatggg tccgcagcca acctcgtccc agaggtcaac aatgaggtta
     5161 tggctctgga gcccgttgtt ggtgccgcca ttgcggcacc tgtagcgggc caacaaaatg
     5221 taattgaccc ctggattaga aacaattttg tacaagcccc tggtggagag tttacagtat
     5281 cccctagaaa cgctccaggt gaaatactat ggagcgcgcc cttgggccct gatctaaatc
     5341 cctacctatc ccatttggcc agaatgtaca atggttatgc aggtggtttt gaagtgcagg
     5401 tgattctcgc ggggaacgcg ttcaccgccg ggaaggtcat atttgcagca gtcccaccaa
     5461 attttccaac tgaaggcttg agccccagcc aggtcactat gttcccccat atagtagtag
     5521 atgttaggca actagaacct gtgttgattc ccttacccga tgttaggaat aatttctatc
     5581 attataatca atcaaatgac cccaccatta agttgatagc aatgttgtat acaccactta
     5641 gggctaataa tgctggggat gatgtcttca cagtttcttg ccgagttctc acgagaccat
     5701 cccccgattt tgatttcata tttctagtgc cacccacagt tgagtcaaga actaaaccat
     5761 tctctgtccc agttttaact gttgaggaga tgaccaattc aagattcccc attcctttgg
     5821 aaaagttgtt cacgggtccc agcagtgcct ttgttgtcca accacaaaac ggcaggtgca
     5881 cgactgatgg cgtgctccta ggcaccaccc aactgtctcc tgtcaacatt tgcaccttca
     5941 gaggagatgt cacccatatc acaggtagtc gcaactacac aatgaatttg gcttctcaaa
     6001 attggaacaa ttatgaccca acagaagaaa tcccagcccc tctaggaact ccagactttg
     6061 tggggaagat tcaaggcgtg ctcacccaaa ccacaaggac agatggctca acacgcggcc
     6121 acaaagccac agtgtacact gggagcgccg actttgctcc aaaactgggt agagttcaat
     6181 ttgaaactga cacagaccat gattttgagg ctaaccaaaa cacaaagttc accccagtcg
     6241 gtgtcatcca agatggtagc accacccacc gaaatgaacc ccaacagtgg gtgctcccaa
     6301 gttactcagg cagaaatacc cataatgtgc atctggcccc cgctgtagcc cccacttttc
     6361 cgggtgagca acttctcttc ttcagatcca ccatgcccgg atgcagcggg taccccaaca
     6421 tggatttgga ctgtctgctc ccccaggaat gggtgcagta cttctaccaa gaggcagccc
     6481 cagcacaatc tgatgtggct ctgctaagat ttgtgaatcc agacacaggt agggttttgt
     6541 ttgagtgtaa gcttcataaa tcaggctatg ttacagtggc tcacactggc caacatgatt
     6601 tggttattcc ccccaatggt tattttagat ttgattcctg ggtcaaccag ttctacacgc
     6661 ttgcccccat gggaaatgga acggggcgta gacgtgcagt ataatggctg gagctttctt
     6721 tgctggattg gcatctgatg tccttggctc tggacttggt tcccttatca atgctggggc
     6781 tggggccatc aaccaaaaag ttgagtttga aaataacaga aaattgcaac aagcatcctt
     6841 ccaatttagc agcaatctac aacaggcttc ctttcaacat gacaaagaga tgctccaagc
     6901 acaaattgag gccaccaaaa agctacaaca ggaaatgatg aaagttaagc aggcaatgct
     6961 cctagagggt gggttctctg agacagacgc agcccgcggg gcaatcaacg cccccatgac
     7021 aaaagctttg gactggagcg ggacaaggta ctgggctccc gatgctagga ctacaacata
     7081 caatgcaggc cgcttttcca cccctcaacc atcgggggca ctgccaggaa gagctaatct
     7141 tagggatgct gtccctgctc ggggttcctc cagtaaatct tctaactctt ctactgctac
     7201 ttctgtgtac tcaaatcaaa ctacttcaac gagacttggt tctacagctg gttctggtac
     7261 cagtgtctcg agcctcccgt caactgcaag gactaggagc tgggttgagg atcaaagtag
     7321 gaatttgtca cctttcatga ggggggccca caacatatcg tttgtcaccc caccatctag
     7381 cagatcctct agccaaggca cagtctcaac cgtgcctaaa gaggttttgg actcctggac
     7441 tggcgctttc aacacgcgca ggcagccact cttcgctcac attcgtaagc gaggggagtc
     7501 acgggcgtaa
//