Typing tool

Complete norovirus genomes

OR397766  GII.1
 GII.P1

Length: 7,513 | 3 CDS

ORF1: 5..5104
ORF2: 5085..6692
ORF3: 6692..7471
LOCUS       OR397766                7513 bp    RNA     linear   VRL 28-NOV-2023
DEFINITION  Norovirus GII isolate NHP_#2_ch2_d3, complete genome.
ACCESSION   OR397766
VERSION     OR397766.1
KEYWORDS    .
SOURCE      Norovirus GII
  ORGANISM  Norovirus GII
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Picornavirales; Caliciviridae; Norovirus; Norwalk virus.
REFERENCE   1  (bases 1 to 7513)
  AUTHORS   Rimkute,I., Chaimongkol,N., Woods,K.D., Nagata,B.M., Darko,S.,
            Gudbole,S., Henry,A.R., Sosnovtsev,S.V., Olia,A.S., Verardi,R.,
            Bok,K., Todd,J.-P., Woodward,R., Kwong,P.D., Douek,D.C.,
            Alves,D.A., Green,K.Y. and Roederer,M.
  TITLE     A non-human primate model for human norovirus infection
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 7513)
  AUTHORS   Rimkute,I., Chaimongkol,N., Woods,K.D., Nagata,B.M., Darko,S.,
            Gudbole,S., Henry,A.R., Sosnovtsev,S.V., Olia,A.S., Verardi,R.,
            Bok,K., Todd,J.-P., Woodward,R., Kwong,P.D., Douek,D.C.,
            Alves,D.A., Green,K.Y. and Roederer,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (04-AUG-2023) VRC, NIH, 40 Convent Drive, Bethesda, MD
            20892, USA
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: CLC Genomics Workbench v. v.21.0.3
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..7513
                     /organism="Norovirus GII"
                     /mol_type="genomic RNA"
                     /isolate="NHP_#2_ch2_d3"
                     /isolation_source="stool"
                     /host="Rhesus macaque"
                     /db_xref="taxon:122929"
                     /country="USA"
                     /collection_date="May-2022"
                     /note="genotype: GII.1"
     gene            5..5104
                     /gene="ORF1"
     CDS             5..5104
                     /gene="ORF1"
                     /codon_start=1
                     /product="nonstructural polyprotein"
                     /protein_id="WLE90371.1"
                     /translation="MKMASNDASAAAAANSNNDTVKSSSDGVLSSMAVTFKRALGARP
                     KQPPPREIPQRPPRPPTPELIKKVPPPPPNGEDEPVVSYSVKDGVSGLPDLSTVRQPP
                     ENNTAFSVPPLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVEQGLVLGVHKPPAAIS
                     LAKVELTPLSLYWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
                     WLSRRMIHRTTGFFRPYQDWNRKPLPTMDDSKLEKVANIFLCALSSLFTRPIKDIIGK
                     LRPLNILNILASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
                     LAVELVPVVMGGVGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
                     KKEEASELAMVRSIEDAVLDLEAIENNHMTALLKDKDSLAAYMRTLDLEEEKARKLST
                     KSASPDIVGTINALLARIAAARSLVHRAKEELSSRPRPVVVMISGKPGIGKTHLAREL
                     AKKIAATLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRIQELADTC
                     PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYADAPDVEK
                     AKRDFPGQPDMWKNAFSPDFSHIKLMLAPQGGFDKNGNTPHGKGVMKTLTVGSLIARA
                     SGLLHERLDEYELQGPALTTYNFDRNKVLAFRQLAAENKYGLMDTMRVGGQLKGVRTM
                     SELKQALKNISVKRCQIVYSGCTYTLESDGKGSVRVDRVQNTTVQTNNELAGALHHLR
                     CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQDLWSKPQLEDTGEAVSKEGC
                     PKPKDDEEFVVSSDDIKVEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
                     KYSIEEYLQDRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
                     TGSEIRKRNPDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSP
                     SLFITSTHVIPQGTQEFFGVSIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
                     TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
                     CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGNEGEAILEGGDDKGTYCGAPILG
                     PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKSGPSLQQVMRDQLKPFTEP
                     RGKQPKPSVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHIRKNDCW
                     NGDSFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLWGSDL
                     ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYKYHYDADYSRWD
                     STQQRAVLAAALEIMVKFSPEPHLAQVVAEDLLSPSVMDVGDFKISINEGLPSGVPCT
                     SQWNSITHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
                     EYGLKPTRPDKTEGPLIISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
                     NHEDPSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVISELKEGGMDFYVPRQE
                     PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
     mat_peptide     5..994
                     /gene="ORF1"
                     /product="p48"
     mat_peptide     995..2092
                     /gene="ORF1"
                     /product="NTPase"
     mat_peptide     2093..2629
                     /gene="ORF1"
                     /product="p22"
     mat_peptide     2630..3028
                     /gene="ORF1"
                     /product="VPg"
     mat_peptide     3029..3571
                     /gene="ORF1"
                     /product="Pro"
     mat_peptide     3572..5101
                     /gene="ORF1"
                     /product="RdRp"
     gene            5085..6692
                     /gene="ORF2"
     CDS             5085..6692
                     /gene="ORF2"
                     /note="major capsid protein"
                     /codon_start=1
                     /product="VP1"
                     /protein_id="WLE90372.1"
                     /translation="MKMASNDAAPSNDGAAGLVPEVNNETMALEPVAGASIAAPLTGQ
                     NNVIDPWIRMNFVQAPNGEFTVSPRNSPGEILLNLELGPELNPFLAHLSRMYNGYAGG
                     VEVQVLLAGNAFTAGKLVFSAIPPHFPLENLSPGQITMFPHVIIDVRTLEPVLLPLPD
                     VRNNFFHYNQQPEPRMRLVAMLYTPLRSNGSGDDVFTVSCRVLTRPSPDFDFNYLVPP
                     TVESKTKPFTLPILTIGELSNSRFPVPIDELYTSPNEGVIVQPQNGRSTLDGELLGTT
                     QLVPSNICALRGRINAQVPDDHHQWNLQVTNTNGTPFDPTEDVPAPLGTPDFLANIYG
                     VTSQRNPNNTCRAHDGVLATWSPKFTPKLGSVILGTWEESDLDLNQPTRFTPVGLFNT
                     DHFDQWALPSYSGRLTLNMNLAPSVSPLFPGEQLLFFRSHIPLKGGTSDGAIDCLLPQ
                     EWIQHFYQESAPSPTDVALIRYTNPDTGRVLFEAKLHRQGFITVANSGSRPIVVPPNG
                     YFRFDSWVNQFYSLAPMGTGNGRRRVQ"
     gene            6692..7471
                     /gene="ORF3"
     CDS             6692..7471
                     /gene="ORF3"
                     /note="minor structural protein; small basic protein"
                     /codon_start=1
                     /product="VP2"
                     /protein_id="WLE90373.1"
                     /translation="MAGAFIAGLAGDIVTNSVGSLVNAGANAINQKVDFENNKQLQQA
                     SFNHDKEMLQAQIQATKQLQADMIALRQGVLTAGGFSPTDAARGAVNAPMTQVLDWNG
                     TRYWAPGATKTTAFSGGFTSSSHARTVDLPKKTAAAPATMPVSRPSSSASTASTRSTL
                     VSGSSNLPSSARSSSSVFSQSTSPSSRTSEWVRSQNRALEPYMRGALQTAYVTPPSSR
                     ASSNGTVSTVPKEVLDSWTSVFNTHRQPLFAHLRRRGESQV"
ORIGIN      
        1 gtgaatgaag atggcgtcta acgacgcttc cgctgccgct gctgctaaca gcaacaacga
       61 caccgtaaaa tcttcaagtg acggagtact ctctagtatg gctgtcacct ttaaacgggc
      121 cctcggggcg cggcctaagc agccgccccc gagggaaata ccacaaagac ccccacgacc
      181 acccactcca gagttgatca aaaaggtccc ccctcccccg cccaatgggg aggatgaacc
      241 agtggtctcc tatagcgtca aagatggcgt ttccgggctg cctgacctct ccaccgtcag
      301 acagccacct gagaacaaca cggcgtttag cgttcctcca ctcaaccaga gggagaacag
      361 ggatgccaag gagccactga ctggaacaat cctggaaatg tgggatgggg agatttacca
      421 ttatggcttg tatgtggaac aaggcctggt gcttggcgtg cacaaaccgc cagcggccat
      481 cagcctcgct aaagttgaat taacaccact ctctttatac tggagacctg tgtacacccc
      541 ccagtatctc atctccccag acactcttag gaggctccat ggggaatcgt tcccttacac
      601 agcttttgac aacaactgct atgccttctg ttgctgggtt ttagacctga atgactcatg
      661 gttgagtaga agaatgatac ataggacaac tggcttcttc agaccttacc aagattggaa
      721 tagaaagccc cttcccacca tggatgactc caaactggag aaggtggcca atattttcct
      781 gtgtgctttg tcttcactat tcactagacc catcaaagac ataataggga agctaagacc
      841 ccttaacatc ctcaatattt tagcatcatg tgattggact tttgcaggca tagtggaatc
      901 cctgatcctc ttggcagagc tctttggagt tttctggaca cccccagatg tgtctgcgat
      961 gatcgccccc ttactgggtg actacgagct gcaggggcct gaggacctcg cggtagaact
     1021 agtgccagta gtaatgggag gggtaggttt ggtgttagga ttcaccaaag aaaagattgg
     1081 aaagatgctg tcatccgctg cgtccacatt gagagcttgt aaagaccttg gtgcgtacgg
     1141 gctggagatc ttaaaattag tcatgaaatg gttcttcccg aagaaagagg aggcgagcga
     1201 attggccatg gtgaggtcca tcgaggacgc ggtgttggac ctcgaagcaa ttgagaacaa
     1261 ccatatgacc gctttactta aagataagga cagcctggcg gcttacatga gaactctcga
     1321 ccttgaggaa gagaaagcca gaaagctctc aaccaaatct gcttcacccg acatcgtggg
     1381 cacaatcaac gctctcttgg cgaggatcgc agccgctcgc tccctcgtgc atcgggcaaa
     1441 ggaagagctc tccagtaggc cgagacccgt tgttgtgatg atatcgggaa aaccagggat
     1501 agggaagacc catcttgcca gagaactggc caaaaagatc gcagctactc tcacagggga
     1561 tcagagagtg ggtctcattc cacgcaatgg cgttgaccac tgggacgcat acaaggggga
     1621 gagggtcgtc ctttgggatg actacgggat gagcaacccc atccatgatg ctctcagaat
     1681 acaggagctt gctgacactt gccccctaac attaaactgt gatagaattg aaaacaaagg
     1741 gaaagttttt gacagtgatg ccataatcat caccaccaac ttggccaacc ctgcaccact
     1801 agactatgtc aattttgagg catgctcgag gcgcattgat ttcctcgtgt atgccgacgc
     1861 ccctgatgtc gaaaaggcga agcgcgactt cccaggccaa cctgacatgt ggaaaaatgc
     1921 ttttagtcct gacttctcac acataaaatt gatgctagcc ccacagggtg gttttgataa
     1981 gaacggcaac accccacatg ggaagggcgt catgaaaacc ctcaccgttg gttccctcat
     2041 cgcccgtgca tcaggactcc tccatgagag actggatgag tacgaactac agggcccagc
     2101 tctcacaacc tacaactttg accgaaacaa agtgcttgct ttcaggcagc ttgctgctga
     2161 aaacaagtac ggtttaatgg acacaatgag agtcggaggg cagctcaagg gtgtcagaac
     2221 catgtcagag ctcaaacaag cactcaaaaa catctcagtt aaaaggtgcc agatagtgta
     2281 cagtggttgc acttacacac ttgaatctga tggtaagggc agtgtgaggg ttgacagagt
     2341 tcagaacacc actgtgcaga ccaacaacga gttagccggc gccctgcacc atctcaggtg
     2401 cgccaggatt aggtactatg tcaagtgtgt tcaagaggcc ctgtattcca tcatccaaat
     2461 tgcaggggct gcgtttgtca ccacgcgcat tgccaaacgc atgaacatac aggacctttg
     2521 gtccaagcca cagctggagg acacaggaga agctgtcagc aaagaagggt gcccaaaacc
     2581 caaggatgat gaggagttcg ttgtttcatc tgacgacatc aaggtcgagg gcaagaaagg
     2641 gaaaaacaag actggtcgcg gcaagaaaca cacagccttc tctagtaaag gtctcagtga
     2701 tgaggagtac gacgagtaca aaagaatcag ggaggaaaga aacggcaagt actctataga
     2761 agagtatctc caggacagag acaagtatta cgaggaggtt gccattgcta gggcgactga
     2821 ggaggacttc tgtgaagaag aagaagccaa aattcgacag aggatcttca gaccaacaag
     2881 gaaacaacgt aaagaggaga gggcttctct cggtctagtc acaggctctg agattaggaa
     2941 gagaaaccca gatgacttca aacccaaggg aaaactgtgg gccgatgacg acagaagtgt
     3001 tgattacaat gagaagctca gttttgaagc cccaccaagc atctggtcaa gaatagtcaa
     3061 ctttggttcg ggctggggtt tttgggtctc tcctagtctg ttcataacat caactcatgt
     3121 tattccccag ggcacacagg agttctttgg agtgtctatc aaacaaatcc aaatacacaa
     3181 atcaggtgag ttttgccgct tgaggttccc aaagccaatc aggactgacg tgacaggtat
     3241 gatcttagag gaaggtgccc ccgaagggac cgtggccaca ctgctcatca aaagaccaac
     3301 tggtgagctc atgcctttag cagccaggat gggaactcac gcaaccatga agatccaagg
     3361 tcgcactgtt gggggccaaa tgggtatgct cctaacagga tccaatgcta aaagtatgga
     3421 cctgggcact acacccggtg actgtggttg tccctacatc tataagagag ggaatgacta
     3481 tgtggtcatc ggggttcaca cagctgctgc ccgtgggggg aacactgtca tatgtgccac
     3541 ccagggtaat gaaggtgagg ccatacttga gggcggagat gacaaaggca cctactgtgg
     3601 tgccccaatc ctgggtccag gaagtgcccc aaagctcagc accaagacca aattttggag
     3661 gtcatccaca acgccactcc cgcctggcac ctacgaacca gcctacctcg gtggcaagga
     3721 ccccagagtc aagagtggcc cctcattaca acaagtcatg agagaccagt tgaaaccatt
     3781 cacagaacca agaggtaaac aaccaaaacc aagtgtgttg gaggctgcca agaaaaccat
     3841 catcaatgtt cttgaacaaa caatagaccc gcctcaaaaa tggtcatttg cgcaagcttg
     3901 tgcgtctctt gacaagacca cctccagtgg ccacccacac cacatacgga agaatgactg
     3961 ctggaatggg gactctttca caggcaagtt ggcagatcag gcctcgaagg ccaacttgat
     4021 gtttgaggag gggaagaaca tgaccccagt ctacacaggt gccctcaagg atgagttagt
     4081 caaaactgac aagatatatg gtaagatcaa gaagaggtta ctctggggct cggacctggc
     4141 aaccatgatc cggtgtgcac gagcgttcgg aggtctgatg gacgaactta aagcccactg
     4201 tgtcacactc cctgtcagag ttggtatgaa catgaatgag gatggcccca ttatctttga
     4261 gaaacactcc aggtataaat atcattatga tgcagattat tctcgatggg actcaacaca
     4321 gcagagagcc gtactagctg cagccctaga gatcatggtc aaattctccc cagagccaca
     4381 cttggcccag gtagttgcag aagaccttct ttcccccagt gtgatggatg tgggtgactt
     4441 caagatatca atcaacgagg gtcttccctc tggggtgccc tgcacctcgc aatggaactc
     4501 catcacccac tggctcctca ctctttgtgc actctctgaa gtcacggacc tgtcccctga
     4561 catcattcaa gccaattcct tattctcttt ctatggtgat gatgaaattg tgagtacaga
     4621 cataaaattg gacccagaaa aactgacagc aaaactgaaa gagtatgggc tgaagccaac
     4681 ccgccctgac aagactgagg gacctctgat catttctgag gatctggatg gtttaacctt
     4741 tctgcggaga actgtaaccc gtgatccagc tggttggttt ggtaaattag aacagagctc
     4801 aatacttagg cagatgtact ggactagggg ccccaaccat gaggatccat ctgaaacaat
     4861 gataccacat tcccaaaggc ccatacagtt gatgtctctg ctaggtgaag ctgcattgca
     4921 cggtccagca ttctacagca aaatcagtaa actagtcatt tcagagttga aggaaggtgg
     4981 catggacttt tacgtgccca ggcaagagcc gatgttcaga tggatgagat tctcagacct
     5041 gagcacgtgg gagggcgatc gcaatctggc tcccagtttt gtgaatgaag atggcgtcga
     5101 atgacgccgc cccatctaat gatggtgcag ccggtctcgt accagaggtc aacaacgaga
     5161 cgatggccct cgaaccggtg gctggggctt ctatagccgc ccctctaacc ggtcaaaata
     5221 atgtgataga cccctggatt agaatgaact ttgtccaagc cccaaatgga gaattcacag
     5281 tgtctccccg caattctcct ggtgaaatct tgctaaattt ggaattaggc cctgaattaa
     5341 atccattctt agcacacctt tcaagaatgt ataatggtta tgccggcggg gttgaagtgc
     5401 aggtactact cgctgggaac gcgttcacag cgggaaaact ggtgttttca gcaatccccc
     5461 cgcacttccc tcttgagaat ctgagtcctg gacaaattac aatgttccct catgtgatta
     5521 ttgatgttag aacattagaa cctgtgcttt tgccccttcc agatgttaga aataatttct
     5581 ttcattacaa tcagcagccc gagccccgta tgagacttgt agctatgttg tatactcctc
     5641 ttagatctaa tggttctggt gatgatgtgt tcacagtttc ttgcagggtt ctcacccgcc
     5701 cttctccaga ttttgatttt aattatttgg ttcccccaac tgtggagtct aaaactaaac
     5761 cattcaccct gccaatccta actattggag aattgtcaaa ttctagattc ccagttccaa
     5821 tagatgaatt gtacaccagc cccaatgaag gagtgatcgt gcagccccaa aatggcagat
     5881 caacacttga tggtgaattg ttgggcacca cgcaactcgt gccctcaaac atctgtgcgc
     5941 tacgagggcg cattaacgcc caggtgccag atgatcacca tcaatggaac ctacaggtaa
     6001 caaacacaaa tgggactcct ttcgacccca ccgaagacgt ccctgcacca ctgggcacac
     6061 cggatttcct ggcgaatatc tatggagtca ccagccagag aaaccccaac aacacttgcc
     6121 gtgcccatga tggggttttg gcaacttgga gccccaaatt tacacccaag ttaggatctg
     6181 tgattttggg cacttgggaa gaaagtgatc ttgatctcaa tcagcccaca aggttcacac
     6241 ctgttggtct gtttaacact gaccactttg atcagtgggc cttgcctagt tattctggaa
     6301 gattaaccct aaacatgaat ttggcaccct ctgtttcccc cctctttcca ggtgaacagc
     6361 tacttttctt caggtcccat ataccactca aaggaggtac ctctgatggt gccattgatt
     6421 gtctactccc ccaggaatgg attcagcatt tttatcagga gtcagcccca tcgcccacgg
     6481 acgtggctct aattagatac accaatcctg acacaggccg cgttttgttt gaagctaaac
     6541 tgcacaggca aggattcatc acagtggcaa actctggttc taggcctatt gttgtccctc
     6601 cgaatggcta ttttaggttt gattcttggg ttaatcaatt ctattctctc gcccccatgg
     6661 gaactgggaa cgggcgcaga agagtgcagt aatggctgga gcttttatag cagggcttgc
     6721 tggtgacata gtcaccaata gtgttggctc acttgtgaac gctggagcta atgcaataaa
     6781 ccaaaaagtg gactttgaaa acaacaagca actacagcag gcttctttca atcatgacaa
     6841 agagatgctg caagctcaaa tccaagccac caaacagcta caggctgaca tgattgctct
     6901 cagacaaggg gtgttgaccg caggcggctt ctcccctact gatgcagcaa gaggggcagt
     6961 taatgcgcct atgacgcagg tcttggactg gaatgggact aggtattggg cccccggagc
     7021 cacaaaaacc accgctttct ccggtgggtt cactagctct tcccatgcca ggactgttga
     7081 tctgcccaag aaaacagcag ctgcaccagc cacgatgcct gtttctagac ccagttcttc
     7141 tgcttctaca gcctctactc gctcaacatt ggttagcggg tcttctaacc ttccttcttc
     7201 agctaggagt tcttctagtg tcttttccca atccacttcc ccttcctccc ggactagtga
     7261 gtgggtgcgt agtcaaaata gggcactgga gccttacatg aggggggcgc tacaaacagc
     7321 ttacgtgacg cctccttcta gtagagcatc tagtaatggc acagtctcaa ccgtgccgaa
     7381 agaagttttg gactcctgga catctgtgtt taacactcac agacagccgc tcttcgctca
     7441 tctccgtcgg agaggggagt cacaagttta gtgaaaagat gatctcttct ttctttgaaa
     7501 atttctgtct ttt
//