![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| PV297973 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P31 |
ORF1: 1..5100
ORF2: 5081..6703
ORF3: 6703..7509
LOCUS PV297973 7509 bp RNA linear VRL 28-JUL-2025
DEFINITION Norovirus GII isolate BCM16-1-AP nonstructural polyprotein (ORF1),
VP1 (ORF2), and VP2 (ORF3) genes, complete cds.
ACCESSION PV297973
VERSION PV297973.1
DBLINK BioProject: PRJNA1195114
BioSample: SAMN45209626
Sequence Read Archive: SRR31712873
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7509)
AUTHORS Bhamidipati,S.V., Surathu,A., Ramani,S., Atmar,R.L., Estes,M.K.,
Muzny,D.M., Cregeen,S.J. and Doddapaneni,H.
TITLE Direct Submission
JOURNAL Submitted (14-MAR-2025) Molecular Virology and Microbiology, Baylor
College of Medicine, Alkek Center for Metagenomics and Microbiome
Research, 1 Baylor Plaza, Houston, TX 77030, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: VirMap v. 1
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7509
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="BCM16-1-AP"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="USA: Houston"
/collection_date="08-May-2016"
/note="genotype: GII.4"
gene 1..5100
/gene="ORF1"
CDS 1..5100
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="XQD65074.1"
/translation="MKMASNDASAAAVANSNNDIAKSSSDGVFSNMAVTFKRALGARP
KQPPPKEIPPRPPRPPTPELVKKIPPPPPNGEDELVVSYSAKDGVSGLPELTTVRQPE
ETNTAFSVPPLNQRESRDAKEPLTGTIIEMWDGEIYHYGLYVERGLILGVHKPPAAIS
LAKVELTPLSLFWRPVYTPQYLISPDTLRRLHGESFPYTAFDNNCYAFCCWVLDLNDS
WLSRRMIQRTTGFFRPYQDWNRKPLPTMDDSKLKKVANIFLCTLSSLFTRPIKDIIGK
LKPLNILNILATCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPED
LAVELVPIVMGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFP
KKEEANELAMVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLST
KSASPDIVGTVNSLLARIAAARSLVHRAKEELSSRPRPVVVMISGRPGIGKTHLAREL
AKKIAASLTGDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTC
PLTLNCDRIENKGKVFDSDAIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPEVEK
AKRDFPGQPDMWKNAFSPDFSHIKLSLAPQGGFDKNGNTPHGKGVMKTLTTGSLIARA
SGLLHERLDEYELQGPALTTFNFDRNKILAFRQLAAENKYGLMDTMRVGKQLKDVKTM
SDLKQALKNIAIKKCQIVYNGSTYTLEADGKGSVKVDKVQSATVQTNNELAGALHHLR
CARIRYYVKCVQEALYSIIQIAGAAFVTTRIAKRMNIQNLWSKPQVEDTEEMTNKDGC
LKPKDDEEFVVSSDDIKTEGKKGKNKSGRGKKHTAFSSKGLSDEEYDEYKRIREERNG
KYSIEEYLQDRDRYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERASLGLV
TGSEIRKRNPEDFKPKGKLWADDDRSVDYNEKLNFEAPPSIWSRIVNFGSGWGFWVSP
SLFITSTHVIPQGAKEFFGVPIKQIQIHKSGEFCRLRFPKPIRTDVTGMILEEGAPEG
TVATLLIKRPTGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGD
CGCPYIYKRGNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDSKGTYCGAPILG
PGSAPKLSTKTKFWRSSTTPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRDQLKPFTEP
RGKPPRPNVLEAAKKTIINVLEQTIDPPQKWSFAQACASLDKTTSSGHPHHMRKNDCW
NGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKVYGKIKKRLLWGSDL
ATMIRCARAFGGLMDELKAHCVTLPVRVGMNMNEDGPIIFEKHSRYRYHYDADYSRWD
STQQRDVLAAALEIMVKFSSEPHLAQVVAEDLLSPSVMDVGDFQISISEGLPSGVPCT
SQWNSIAHWLLTLCALSEVTDLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLK
EYGLKPTRPDKTEGPLVISEDLDGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGP
NHEDPFETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQE
PMFRWMRFSDLSTWEGDRNLAPSFVNEDGVE"
mat_peptide 1..990
/gene="ORF1"
/product="p48"
mat_peptide 991..2088
/gene="ORF1"
/product="NTPase"
mat_peptide 2089..2625
/gene="ORF1"
/product="p22"
mat_peptide 2626..3024
/gene="ORF1"
/product="VPg"
mat_peptide 3025..3567
/gene="ORF1"
/product="Pro"
mat_peptide 3568..5097
/gene="ORF1"
/product="RdRp"
gene 5081..6703
/gene="ORF2"
CDS 5081..6703
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="XQD65075.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHTTGSRNYTMNLASQNWNNYDPTEEIPAPLGTPDFVGKIQGVL
TQTTRTDGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
STTHRNEPQQWVLPSYSGRNTPNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGTGRRRAV"
gene 6703..7509
/gene="ORF3"
CDS 6703..7509
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="XQD65076.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMKVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANFRDAVPARGSSSKSS
NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRDLSPFMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEVLDSWTGAFNTRRQPLFAHIRKRGESRA"
ORIGIN
1 atgaagatgg cgtctaacga cgcttccgct gccgctgttg ccaacagcaa caacgacatc
61 gcaaaatctt caagtgacgg tgtgttttct aacatggctg tcacttttaa gcgggccctc
121 ggggcgcggc ctaaacagcc gcccccgaag gagataccac ccagaccccc gcgaccaccc
181 acaccggaat tggtcaaaaa gatccctcct cccccaccca acggggagga tgaactagtg
241 gtctcttaca gcgccaaaga tggcgtttcc ggattgcctg agctcaccac tgtcagacaa
301 ccggaagaaa ccaacacggc gttcagtgtc cccccactca accaaaggga gagcagggac
361 gccaaggagc cactaactgg aacaattatt gaaatgtggg atggagaaat ctaccattac
421 ggcctgtacg tggaacgagg tcttatactc ggtgtgcaca agccaccggc agccatcagc
481 cttgccaagg tcgagctaac accgctctct ttgttctgga gacctgtata caccccccag
541 tatctcatct ctccagacac tcttaggaga ttacatggag agtcattccc ctacactgca
601 tttgacaaca attgctacgc cttttgttgt tgggtattag acctaaacga ctcatggcta
661 agcaggagaa tgattcagag aacaacaggt ttcttcaggc cgtaccagga ttggaacagg
721 aaacccctcc ccactatgga tgattccaaa ttaaagaagg tagccaacat attcttgtgc
781 actttgtctt cactattcac cagacccatt aaggacataa tagggaagtt gaaacccctc
841 aacatcctta acattctggc tacatgtgat tggaccttcg caggcatagt ggaatcttta
901 atactcttag cagaactctt tggagttttc tggacacccc cagatgtgtc tgcgatgatc
961 gcccccttgc taggtgatta tgaactgcaa ggacctgagg accttgcagt ggagctagtc
1021 ccaatagtga tgggggggat aggtttggtg ctaggattta ccaaagagaa aattggaaag
1081 atgctatcat ccgctgcatc cactttaaga gcttgtaaag accttggtgc atacggactg
1141 gaaatcttaa aattggtcat gaagtggttc ttcccaaaga aagaggaagc aaatgaactg
1201 gctatggtga gatccatcga ggatgcagtg ctagacctcg aggcaattga aaacaaccac
1261 atgaccaccc tactcaaaga caaagacagc ttggcaacct acatgagaac ccttgacctt
1321 gaggaggaga aagccagaaa actctcaacc aaatctgctt cacccgatat tgtgggcaca
1381 gtcaactctc ttctggcaag aatcgctgct gcacgctccc tagtgcatcg ggcgaaagaa
1441 gagctctcca gcaggccgag acctgtcgtt gtgatgatat cgggaagacc agggataggg
1501 aaaactcacc ttgccaggga gctagccaag aagatcgcgg cctccctcac aggggaccag
1561 cgtgtgggtc ttatcccacg caatggtgtc gatcactggg acgcatacaa gggcgaaaga
1621 gttgtcctat gggacgacta tggaatgagc aaccccatcc atgatgccct caggttgcag
1681 gagcttgctg acacttgccc cctcacgcta aattgtgaca gaattgagaa caaagggaaa
1741 gtctttgaca gtgatgctat aattatcacc accaatctgg ccaacccagc accactggat
1801 tatgtcaact ttgaagcgtg ctcgagacgt attgacttcc tcgtgtacgc agaagcccct
1861 gaggtggaga aggcaaagcg cgacttccca ggtcaacctg acatgtggaa gaacgctttc
1921 agtcctgact tctcacacat aaaactgtca ttggctccac agggtggttt tgacaagaac
1981 ggcaacaccc cgcatggaaa aggggtcatg aagaccctca ccactggctc cctcatcgcc
2041 cgagcatcag ggttactcca tgagaggcta gatgaatatg aactgcaagg cccagccctc
2101 accactttca actttgaccg aaacaagata cttgctttta gacagcttgc tgctgaaaac
2161 aagtatgggc tgatggacac aatgagagtt ggaaaacagc tcaaggatgt caagaccatg
2221 tcagacctca aacaagcact caagaacatc gcgatcaaga aatgccagat agtgtacaat
2281 ggtagcacct acacacttga ggccgatggc aagggtagtg tgaaagttga caaagtgcaa
2341 agtgccactg tgcagaccaa caatgaacta gccggtgccc tacaccacct aaggtgcgct
2401 agaatcagat actatgttaa gtgcgtccag gaggcactgt attccatcat ccaaatcgct
2461 ggggctgcat tcgtcaccac gcgcatcgct aagcgcatga atatacagaa tctctggtcc
2521 aagccacagg tggaagacac agaagagatg accaacaaag atggttgcct aaaacccaaa
2581 gatgatgaag agtttgtcgt ctcatccgac gacatcaaaa ctgagggcaa gaaagggaag
2641 aacaagtccg gccgtggcaa gaaacacaca gccttttcaa gcaaagggct cagtgatgag
2701 gagtacgatg agtacaagag aatcagagaa gaaaggaatg gtaagtactc catagaagag
2761 taccttcagg acagagacag gtactacgag gaggtggcca ttgccagggc aaccgaagag
2821 gacttctgtg aagaagaaga ggccaaaatc cggcagagaa tttttagacc aacaaggaaa
2881 caacgcaaag aagagagggc ctctctcggc ttggtcacag gctctgaaat caggaagaga
2941 aacccagaag acttcaaacc caagggaaag ctgtgggctg atgatgacag aagtgttgac
3001 tacaatgaga aactcaactt tgaggcccca ccaagcatct ggtcgcggat agtcaacttt
3061 ggttcaggct ggggcttctg ggtctccccc agtctgttta taacatcaac ccatgtcata
3121 ccccaaggtg caaaagagtt cttcggagtc cctatcaagc aaatccagat acacaaatca
3181 ggtgaattct gccggttgag attcccaaag ccaatcagaa ctgatgtgac gggcatgatt
3241 ctagaagaag gtgcgcccga ggggaccgtg gccacactgc tcatcaagag accaactgga
3301 gagctcatgc ctctggcagc cagaatgggg acccatgcaa ccatgaaaat tcaggggcgc
3361 acagttggag ggcaaatggg tatgctcctg acaggatcca acgccaagag tatggaccta
3421 ggcacaacac caggcgactg cggctgcccc tacatctaca agagggggaa tgactacgtg
3481 gtcataggag tccatacggc cgctgcccgt ggaggaaaca ccgtcatatg tgccacccag
3541 gggagtgagg gagaagccac acttgaagga ggtgacagta aagggacata ctgtggcgca
3601 ccaatcttgg gcccagggag cgctccgaaa ctcagcacca agactaagtt ttggagatca
3661 tccacaacac cactcccacc tggcacctac gaaccagcct acctcggtgg caaagacccc
3721 agagtcaaag gtggcccttc attgcaacaa gttatgaggg accagctaaa gccattcaca
3781 gaacccagag gcaaaccacc aagaccaaac gtgttggaag ctgccaagaa aaccatcatc
3841 aatgtccttg agcaaacaat tgatccaccc caaaaatggt catttgcgca agcttgcgca
3901 tcccttgaca aaaccacctc cagcggccac ccgcaccaca tgcggaaaaa cgattgttgg
3961 aatggggagt ccttcacagg aaaattggct gatcaagcct ccaaggccaa cctaatgttt
4021 gaagagggaa agaacatgac cccagtctac acaggtgcac ttaaagatga gttggtaaag
4081 accgataaag tttatggtaa gatcaagaag aggcttctgt ggggttcaga tctggcgacc
4141 atgatacggt gcgctcgagc ttttggaggc cttatggatg aactcaaggc gcactgtgtc
4201 acacttcctg tcagagttgg tatgaacatg aatgaggatg gccccatcat ctttgagaag
4261 cactccagat atagatatca ctatgatgct gattattccc ggtgggactc aacacaacaa
4321 agggatgtgc tagcagcagc actagaaatc atggttaagt tctcttcaga accacacctg
4381 gcccaggtag ttgcagaaga cctcctttcc cctagcgtaa tggatgtagg tgactttcaa
4441 atatcaataa gtgagggtct tccctctggg gtaccttgta cctcccagtg gaattccatc
4501 gcccactggc tcctcactct ttgtgcactc tctgaagtca cggacctgtc ccctgacatc
4561 attcaggcca actccctttt ctccttctac ggtgatgatg agattgtaag cacagacata
4621 aagttggacc cagagaagct gacagcaaaa ctcaaggagt acgggctgaa accaacccgc
4681 cccgacaaaa ctgaaggacc ccttgttatc tctgaagacc tggatggcct gacattcctc
4741 cggagaactg tgacccgtga tccagctggc tggtttggaa aattggaaca aagttcaatt
4801 ctcagacaaa tgtactggac caggggtccc aaccatgaag atccatttga aacaatgata
4861 ccacactccc aaagacccat acaattgatg tccttgctgg gcgaggctgc actccatggc
4921 ccggcatttt atagcaaaat tagcaagtta gtcattgcag agttgaagga aggtggcatg
4981 gatttttacg tgcccagaca agagccaatg ttcagatgga tgagattctc agatctgagc
5041 acgtgggagg gcgatcgcaa tctggctccc agttttgtga atgaagatgg cgtcgagtga
5101 cgccaaccca tctgatgggt ccgcagccaa cctcgtccca gaggtcaaca atgaggttat
5161 ggctctggag cccgttgttg gtgccgccat tgcggcacct gtagcgggcc aacaaaatgt
5221 aattgacccc tggattagaa acaattttgt acaagcccct ggtggagagt ttacagtgtc
5281 ccctagaaac gctccaggtg aaatactatg gagcgcgccc ttgggccctg atctaaatcc
5341 ctacctatcc catttggcca gaatgtacaa tggttatgca ggtggttttg aagtgcaggt
5401 aattctcgcg gggaacgcgt tcaccgccgg gaaggtcata tttgcagcag tcccaccaaa
5461 ttttccaact gaaggcttga gccccagcca ggtcactatg ttcccccata tagtagtaga
5521 tgttaggcaa ctagaacctg tgttgattcc cttacccgat gttaggaata atttctatca
5581 ttataatcag tcaaatgacc ccaccattaa gttgatagca atgttgtaca caccacttag
5641 ggctaataat gctggggatg atgtcttcac agtttcttgc cgagttctca cgagaccatc
5701 ccccgatttt gatttcatat ttctagtgcc acccacagtt gagtcaagaa ctaaaccatt
5761 ctctgtccca gttttaactg ttgaggagat gaccaattca agattcccca ttcctttgga
5821 aaagttgttc acgggtccca gcagtgcctt tgttgtccaa ccacaaaacg gcaggtgcac
5881 gactgatggc gtgctcctag gcaccaccca actgtctcct gtcaacatct gcaccttcag
5941 aggagatgtc acccatacca caggtagtcg taactacaca atgaatttgg cttctcaaaa
6001 ttggaacaat tatgacccaa cagaagaaat cccagcccct ctaggaactc cagactttgt
6061 ggggaagatt caaggcgtgc tcacccaaac cacaaggaca gatggctcaa cacgcggcca
6121 caaagccaca gtgtacactg ggagcgccga ctttgctcca aaactgggta gagttcaatt
6181 tgaaactgac acagaccatg attttgaagc taaccaaaac acaaagttca ccccagtcgg
6241 tgtcatccaa gatggtagca ccacccaccg aaatgaaccc caacagtggg tgctcccaag
6301 ttactcaggc agaaatactc ctaatgtgca tctggccccc gctgtagccc ccacttttcc
6361 gggtgagcaa cttctcttct tcagatccac catgcccgga tgcagcggtt accccaacat
6421 ggatttggac tgtctgctcc cccaggaatg ggtgcagtac ttctaccaag aggcagcccc
6481 agcacaatct gatgtggctc tgctaagatt tgtgaatcca gacacaggta gggttttgtt
6541 tgagtgtaag cttcataaat caggctatgt tacagtggct cacactggcc aacatgattt
6601 ggttattccc cccaatggtt attttagatt tgattcctgg gtcaaccagt tctacacgct
6661 tgcccccatg ggaaatggaa cggggcgtag acgtgcagta taatggctgg agctttcttt
6721 gctggattgg catctgatgt ccttggctct ggacttggtt cccttatcaa tgctggggct
6781 ggggccatca accaaaaagt tgagtttgaa aataacagaa aattgcaaca agcatccttc
6841 caatttagca gcaatctaca acaggcttcc tttcaacatg acaaagagat gctccaagca
6901 caaattgagg ccaccaaaaa gctacaacag gaaatgatga aagttaagca ggcaatgctc
6961 ctagaaggtg ggttctctga gacagatgca gcccgcgggg caatcaacgc ccccatgaca
7021 aaagctttgg actggagcgg gacaaggtac tgggctcccg atgctaggac tacaacatac
7081 aatgcaggcc gcttttccac ccctcaacca tcgggggcac tgccaggaag agctaatttt
7141 agggatgctg tccctgctcg gggttcctcc agtaaatctt ctaactcttc tactgctact
7201 tctgtgtact caaatcaaac tacttcaacg agacttggtt ctacagctgg ttctggtacc
7261 agtgtctcga gcctcccgtc aactgcaagg actaggagct gggttgagga tcaaagtagg
7321 gatttgtcac ctttcatgag gggggcccat aacatatcgt ttgtcacccc accatctagc
7381 agatcctcta gccaaggcac agtctcaacc gtgcctaaag aggttttgga ctcctggact
7441 ggcgctttca acacgcgcag gcagccactc ttcgctcaca ttcgtaagcg aggggagtca
7501 cgggcgtaa
//