![]() |
||||
|
Typing tool
|
Complete norovirus genomes
| MK907799 | GII.4 Sydney | ||
|---|---|---|---|
| GII.P4 New Orleans |
ORF1: 1..4899
ORF2: 4880..6502
ORF3: 6502..7308
LOCUS MK907799 7390 bp RNA linear VRL 02-NOV-2019
DEFINITION Norovirus GII isolate G19_035 nonstructural polyprotein (ORF1)
gene, partial cds; and VP1 (ORF2) and VP2 (ORF3) genes, complete
cds.
ACCESSION MK907799
VERSION MK907799.1
KEYWORDS .
SOURCE Norovirus GII
ORGANISM Norovirus GII
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Picornavirales; Caliciviridae; Norovirus; Norovirus norwalkense.
REFERENCE 1 (bases 1 to 7390)
AUTHORS Strubbia,S., Phan,M.V., Schaeffer,J., Koopmans,M., Cotten,M. and Le
Guyader,S.
TITLE Optimisation of agnostic metagenomic approaches to characterise
human enteric viruses in sewage
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 7390)
AUTHORS Le Guyader,S. and Strubbia,S.
TITLE Direct Submission
JOURNAL Submitted (09-MAY-2019) LSEM, IFREMER, Rue de l'Ile d'Yeu, NANTES
44311, France
COMMENT ##Assembly-Data-START##
Assembly Method :: SPAdes v. 3.12.0
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..7390
/organism="Norovirus GII"
/mol_type="genomic RNA"
/isolate="G19_035"
/isolation_source="stool"
/host="Homo sapiens"
/db_xref="taxon:122929"
/geo_loc_name="France: Nantes"
/collection_date="01-Oct-2008"
/note="genotype: GII.4-GII.P4_New Orleans 2009"
gene <1..4899
/gene="ORF1"
CDS <1..4899
/gene="ORF1"
/codon_start=1
/product="nonstructural polyprotein"
/protein_id="QCO93096.1"
/translation="IPPPPPNGEDEIVVSYSVKDGVSGLPDLSTVRQPEESNTAFSVP
PLNQRENRDAKEPLTGTILEMWDGEIYHYGLYVERGLVLGVHKPPAAISLAKVELAPL
SLYWRPVYTPQYLISPDTLKKLSGETFPYTAFDNNCYAFCCWVLDLNDSWLSRRMIQR
TTGFFRPYQDWNRKPLPTMDDSKIKKVANIFLCALSSLFTRPIKDIIGKIRPLNILNI
LASCDWTFAGIVESLILLAELFGVFWTPPDVSAMIAPLLGDYELQGPEDLAVELVPVV
MGGIGLVLGFTKEKIGKMLSSAASTLRACKDLGAYGLEILKLVMKWFFPKKEEANELA
IVRSIEDAVLDLEAIENNHMTTLLKDKDSLATYMRTLDLEEEKARKLSTKSASPDIVG
TINALLARIAAARSLVHRAKEELSSRPRPVVLMISGRPGIGKTHLAREVAKRIAASLT
GDQRVGLIPRNGVDHWDAYKGERVVLWDDYGMSNPIHDALRLQELADTCPLTLNCDRI
ENKGKVFDSDVIIITTNLANPAPLDYVNFEACSRRIDFLVYAEAPDVEKAKRDFPGQP
DMWKNAFSSDFSHIKLALAPQGGFDKNGNTPHGKGVMKSLTTGSLIARASGLLHERLD
EFELQGPALTTFNFDRNKVLAFRQLAAENKYGLLDTMRVGKQLKDVKTMPELKQALKN
VSIKKCQIVYSGCTYMLDSDGKGNVKVDRIQSATVQTNNELVGALHHLRCARIRYYVK
CVQEALYSIIQIAGAAFVTTRIAKRMNIQDLWSKPQVENTEETTSKDGCPRPKDDEEF
VISSDDIKTEGKKGKNKTGRGKKHTAFSSKGLSDEEYDEYKRIREERNGKYSIEEYLQ
DRDKYYEEVAIARATEEDFCEEEEAKIRQRIFRPTRKQRKEERVSLGLVTGSEIRKRN
PDDFKPKGKLWADDDRSVDYNEKLSFEAPPSIWSRIVNFGSGWGFWVSPSLFITSTHV
IPQGAKEFFGVPIKQIQVHKSGEFCRLRFPKPIRTDVTGMILEEGAPEGTVVTLLIKR
STGELMPLAARMGTHATMKIQGRTVGGQMGMLLTGSNAKSMDLGTTPGDCGCPYIYKR
GNDYVVIGVHTAAARGGNTVICATQGSEGEATLEGGDNKGTYCGAPILGPGSAPTLST
KTKFWRSSTASLPPGTYEPAYLGGKDPRVKGGPSLQQVMREQLKPFTEPRGKPPKPSV
LEAAKKTIINVLEQTIDPPEKWSFAQACASLDKTTSSGHPHHMRKNDCWNGESFTGKL
ADQASKANLMFEEGKNMTPVYTAALKDELVKTDKIYGKIKKRLLWGSDLATMIRCARA
FGGLMDELKAHCVTLPIRVGMNMNEDGPIIFERHSRYTYHYDADYSRWDSTQQRAVLA
AALEIMVKFSPEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCTSQWNSIAHW
LLTLCALSEVTNLSPDTIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLREYGLKPTRP
DKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHGDPSETM
IPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWMRFS
DLSTWEGDRNLAPSFVNEDGVE"
mat_peptide <1..789
/gene="ORF1"
/product="p48"
mat_peptide 790..1887
/gene="ORF1"
/product="NTPase"
mat_peptide 1888..2424
/gene="ORF1"
/product="p22"
mat_peptide 2425..2823
/gene="ORF1"
/product="VPg"
mat_peptide 2824..3366
/gene="ORF1"
/product="Pro"
mat_peptide 3367..4896
/gene="ORF1"
/product="RdRp"
gene 4880..6502
/gene="ORF2"
CDS 4880..6502
/gene="ORF2"
/note="major capsid protein"
/codon_start=1
/product="VP1"
/protein_id="QCO93094.1"
/translation="MKMASSDANPSDGSAANLVPEVNNEVMALEPVVGAAIAAPVAGQ
QNVIDPWIRNNFVQAPGGEFTVSPRNAPGEILWSAPLGPDLNPYLSHLARMYNGYAGG
FEVQVILAGNAFTAGKVIFAAVPPNFPTEGLSPSQVTMFPHIVVDVRQLEPVLIPLPD
VRNNFYHYNQSNDPTIKLIAMLYTPLRANNAGDDVFTVSCRVLTRPSPDFDFIFLVPP
TVESRTKPFSVPVLTVEEMTNSRFPIPLEKLFTGPSSAFVVQPQNGRCTTDGVLLGTT
QLSPVNICTFRGDVTHITGSRNYTMNLASQNWNNYDPTEEIPAPLGAPDFVGKIQGML
TQTTRADGSTRGHKATVYTGSADFAPKLGRVQFETDTDHDFEANQNTKFTPVGVIQDG
STTHRNEPQQWVLPSYSGRNTHNVHLAPAVAPTFPGEQLLFFRSTMPGCSGYPNMDLD
CLLPQEWVQYFYQEAAPAQSDVALLRFVNPDTGRVLFECKLHKSGYVTVAHTGQHDLV
IPPNGYFRFDSWVNQFYTLAPMGNGAGRRRVV"
gene 6502..7308
/gene="ORF3"
CDS 6502..7308
/gene="ORF3"
/note="minor structural protein; small basic protein"
/codon_start=1
/product="VP2"
/protein_id="QCO93095.1"
/translation="MAGAFFAGLASDVLGSGLGSLINAGAGAINQKVEFENNRKLQQA
SFQFSSNLQQASFQHDKEMLQAQIEATKKLQQEMMRVKQAMLLEGGFSETDAARGAIN
APMTKALDWSGTRYWAPDARTTTYNAGRFSTPQPSGALPGRANLRDAVLARGSSSKSY
NSSTATSVYSNQTTSTRLGSTAGSGTSVSSLPSTARTRSWVEDQSRNLSPLMRGAHNI
SFVTPPSSRSSSQGTVSTVPKEVLNSWTGAFNTRRQPLFAHIRKRGESRV"
ORIGIN
1 attccccccc ccccacccaa cggagaggat gaaatagtgg tctcttatag tgtcaaagat
61 ggtgtttccg gcttgcctga cctttccacc gtcaggcagc cggaagaatc taacacggcc
121 ttcagtgtcc ctccactcaa ccagagggag aatagagatg ctaaggaacc actcactgga
181 acaattctgg aaatgtggga cggggaaatc taccattatg gcctgtatgt ggagcgaggt
241 cttgtactag gcgtgcacaa accgccagct gccattagcc tcgctaaggt tgagttagca
301 ccactctcat tgtactggag acctgtgtac actcctcagt acctcatctc tccagacact
361 ctcaagaaat tgtccggaga aacgttcccc tacacagcct ttgacaacaa ctgctatgcc
421 ttttgttgct gggtcctgga cctaaatgac tcgtggctga gcaggagaat gatccagaga
481 acaactggtt tcttcaggcc ctaccaagac tggaatagga aaccccttcc cactatggat
541 gactccaaaa taaagaaggt ggccaacata tttctgtgtg ctctgtcctc gctattcacc
601 aggcccataa aagatataat agggaagata aggcctctta acatcctcaa catcttagcc
661 tcatgtgatt ggacctttgc gggtatagtg gagtccctga tactcttggc agaactcttt
721 ggagttttct ggacaccccc agatgtgtct gcgatgattg cccccttact tggtgattac
781 gagctacaag ggcctgagga ccttgcagtg gagctcgtcc ccgtggtgat ggggggaatt
841 ggtttggtgc taggattcac caaagagaag attgggaaga tgttgtcatc agctgcgtcc
901 accttaagag cttgcaaaga ccttggtgca tatgggctag agatcctaaa gttagtcatg
961 aagtggttct tcccgaagaa ggaggaggcg aatgagctgg ctatagtgag gtccatcgag
1021 gatgcagtcc tggatctcga agcaattgaa aacaatcata tgaccacctt gcttaaagat
1081 aaagacagtc tggcaaccta catgagaaca cttgaccttg aagaggagaa agccaggaaa
1141 ctctcaacca aatctgcctc acccgacatc gtgggcacaa tcaacgctct cctggcgaga
1201 atcgctgccg cacgttctct ggtgcatcga gcgaaggagg agctttccag cagaccaaga
1261 cctgtggtgt tgatgatatc aggcaggcca ggaataggga agactcacct cgctagggaa
1321 gtggctaaga gaatcgcagc ctcccttaca ggagaccaac gtgttggtct catcccacgc
1381 aatggcgtcg accattggga tgcgtacaag ggggagaggg tcgtcctatg ggacgattat
1441 ggaatgagca accctattca cgatgccctc aggctgcaag aactcgctga cacttgcccc
1501 ctcactctga actgtgacag gattgaaaat aaaggaaagg tctttgacag cgatgtcatc
1561 attatcacca ccaatctggc caacccagcc ccactggact atgtcaactt tgaagcatgc
1621 tcgaggcgca ttgacttcct cgtgtatgca gaagcccctg atgtcgaaaa ggcgaagcgt
1681 gacttcccag gccagcctga catgtggaag aacgctttca gttctgattt ctcacacata
1741 aaactagcac tggccccaca gggtggtttc gacaagaacg ggaacacccc acatggaaag
1801 ggcgtcatga agtctctcac cactggctcc cttattgccc gggcatcagg gctgctccat
1861 gagaggttag atgaatttga actgcagggc ccagctctca ccaccttcaa tttcgatcgc
1921 aataaagtgc tagcctttag acagcttgct gctgaaaata aatatggatt gttggacaca
1981 atgagggttg ggaaacagct caaggacgtc aaaaccatgc cagaactcaa acaagcactc
2041 aagaatgtct caatcaagaa gtgtcaaata gtgtatagtg gttgcaccta catgcttgat
2101 tctgatggca agggcaatgt gaaagttgac aggatccaaa gcgccaccgt gcagaccaac
2161 aatgagctgg ttggtgccct gcaccacttg aggtgcgcca gaatcagata ctatgtcaag
2221 tgtgtccagg aagccctgta ttccatcatt caaattgctg gggctgcatt tgtcaccacg
2281 cgcattgcca agcgcatgaa catacaagac ctatggtcca agccacaagt ggaaaacaca
2341 gaggagacta ccagcaagga cgggtgccca agacccaagg atgatgagga gtttgtcatt
2401 tcgtccgacg acatcaaaac tgagggaaaa aaagggaaga acaagactgg ccgtggcaag
2461 aagcacacag cattttcaag caaaggcctc agtgatgaag agtacgatga atacaagagg
2521 atcagagaag aaaggaatgg caagtactct atagaagagt accttcagga cagggacaaa
2581 tactatgaag aggtggccat tgccagagcg actgaggaag acttctgtga agaggaggaa
2641 gccaagatcc gacaaaggat ctttaggcca acaaggaaac aacgcaagga ggaaagagtc
2701 tctctcggtt tggtcacggg ttctgaaatt aggaaaagaa acccagatga cttcaaaccc
2761 aaggggaaat tgtgggctga cgatgacagg agtgtggact acaatgagaa actcagtttt
2821 gaggccccgc caagcatttg gtcaagaata gtcaactttg gttcaggctg gggattctgg
2881 gtctccccca gcttgttcat aacatcaacc catgttatac cccagggcgc aaaggagttc
2941 tttggagtcc ccatcaaaca aatacaggta cacaagtcag gcgagttctg tcgcttgaga
3001 ttccctaaac caattaggac tgatgtgacg ggtatgatct tagaagaagg cgcacctgag
3061 ggcaccgtgg tcacactact catcaaaagg tccactgggg aacttatgcc cctagcagct
3121 aggatgggga cccatgcgac catgaagatc caagggcgca ctgttggggg ccagatgggc
3181 atgcttctga caggatccaa cgccaagagt atggacctgg gtaccacacc aggtgattgt
3241 ggctgcccct acatctacaa gagaggtaat gactatgtgg tcattggagt ccacacggct
3301 gccgcacgtg gggggaacac tgtcatatgt gccacccaag ggagtgaagg agaggctaca
3361 cttgagggtg gtgacaacaa ggggacatac tgtggtgcac caatcctagg cccagggagt
3421 gccccaacac ttagcaccaa gaccaaattc tggaggtcgt ccacagcatc actcccacct
3481 ggcacctatg aaccagccta tcttggtggc aaggacccta gagttaaggg tggcccttca
3541 ctgcagcaag tcatgaggga acagttgaag ccattcacag agcccagggg taagccacca
3601 aagccaagtg tgttagaagc tgccaagaaa accatcatta atgtccttga gcaaacaatt
3661 gatccacctg agaaatggtc gttcgcacaa gcttgcgcgt ctcttgacaa gaccacttcc
3721 agtggtcatc cgcaccacat gcggaaaaac gactgctgga acggggagtc cttcacaggc
3781 aagctggcag accaggcttc taaggccaac ctgatgtttg aagaagggaa gaacatgacc
3841 ccagtctaca cagctgcgct caaggatgag ttagttaaaa ctgacaaaat ttatggtaag
3901 atcaagaaga ggcttctttg gggctcggac ttggcgacca tgatccggtg tgctcgagca
3961 ttcggaggcc taatggatga actcaaagcg cactgtgtca cacttcccat tagagttggc
4021 atgaatatga atgaggatgg ccccatcatc ttcgagaggc attccaggta cacgtaccac
4081 tatgatgctg attactctcg atgggattca acacaacaga gagccgtgtt ggcagcagcc
4141 ctagaaatca tggtaaaatt ctccccagaa ccacatttgg ctcaggtagt tgctgaagac
4201 cttctttctc ctagcgtggt ggacgtgggc gacttcacaa tatcaatcaa cgagggcctt
4261 ccctctgggg tgccttgcac ctcccaatgg aactccatcg cccactggct tctcactctc
4321 tgtgcgctct ccgaagtcac aaacctgtct cctgatacca tacaggctaa ttctctcttc
4381 tctttttatg gtgatgatga aattgttagc acagacataa aattggaccc agagaaattg
4441 acagcaaagc tcagagaata tgggttaaag ccaacccgcc ctgacaaaac tgaagggccc
4501 cttgtcatct ctgaagacct gaatggtcta actttcctgc ggagaactgt gacccgcgac
4561 ccagctggtt ggtttggaaa actggagcag agttcaatac tcaggcaaat gtactggact
4621 aggggtccca accatggaga cccatctgaa actatgattc cacactccca aaggcccata
4681 caattgatgt ccctactggg ggaggccgct ctccacggcc cagcatttta cagtaaaatt
4741 agcaaattgg tcattgcaga gctaaaagaa ggtggtatgg atttttacgt gcccagacaa
4801 gagccaatgt tcagatggat gagattctca gatctgagca cgtgggaggg cgatcgcaat
4861 ctggctccca gttttgtgaa tgaagatggc gtcgagtgac gccaacccat ctgatgggtc
4921 cgcagccaac ctcgtcccag aggtcaacaa tgaggttatg gctctggagc ccgttgttgg
4981 tgccgccatt gcggcacctg tagcgggcca acaaaatgta attgacccct ggattagaaa
5041 taattttgta caagcccctg gtggagagtt tacagtgtcc cccagaaacg ctccaggtga
5101 aatactatgg agcgcgccct taggccctga tctaaatccc tacctatccc atttggccag
5161 aatgtataat ggttatgcag gtggttttga agtgcaggta attctcgcgg ggaacgcgtt
5221 caccgccggg aaagtcatat ttgcagcagt cccaccaaat ttcccaactg aaggcttgag
5281 ccccagccag gtcactatgt tcccccatat agtagtagat gttaggcaac tagaacctgt
5341 gttgattccc ttacccgatg ttaggaataa tttctaccat tacaatcaat caaatgaccc
5401 caccattaag ttgatagcaa tgttgtacac accacttagg gctaataatg ctggggacga
5461 tgtcttcaca gtttcttgcc gagttctcac gagaccatcc cccgattttg atttcatatt
5521 tctagtgcca cccacagttg agtctagaac taaaccattc tctgtcccag ttttaactgt
5581 tgaggagatg accaattcaa gattccccat ccctctggaa aagttgttca cgggtcccag
5641 cagtgccttt gttgttcaac cacaaaatgg taggtgcacg actgatggcg tgctcctagg
5701 caccacccaa ttgtctcctg tcaacatctg caccttcaga ggggatgtca cccacattac
5761 aggtagtcgt aactacacaa tgaatttggc ttctcaaaat tggaacaatt atgacccaac
5821 agaagaaatc ccagcccctc taggagctcc agattttgtg gggaagattc aaggcatgct
5881 cacccaaacc acaagggcag atggctcaac acgcggccac aaagccacgg tgtacactgg
5941 gagcgccgac tttgctccaa aactgggcag agttcaattt gaaactgaca cagaccatga
6001 ttttgaagct aaccaaaaca caaagttcac cccagtcggt gtcatccaag atggcagcac
6061 cacccaccga aatgaacccc aacagtgggt gctcccaagt tactcaggca gaaatactca
6121 caatgttcat ctggcccccg ctgtagcccc tacttttccg ggtgagcaac ttctcttctt
6181 taggtccact atgcccggat gtagcgggta ccccaacatg gatttggact gtctgctccc
6241 ccaggaatgg gtgcagtact tctaccaaga ggcagcccca gcacaatctg atgtggctct
6301 gctaagattt gtgaatccag acacaggtag ggttttgttt gagtgtaaac ttcataaatc
6361 aggctatgtt acagtggctc acactggcca acatgatttg gttatccccc ccaatggtta
6421 ttttaggttt gattcctggg tcaaccagtt ctacacgctt gcccccatgg gaaatggagc
6481 ggggcgtaga cgtgtagtat aatggctgga gctttctttg ctggattagc atctgatgtc
6541 cttggctctg gacttggctc ccttatcaat gctggggctg gggccatcaa ccaaaaagtt
6601 gagtttgaaa ataacagaaa attgcaacaa gcatccttcc aatttagcag caatctacaa
6661 caggcttcct ttcaacatga caaagaaatg ctccaagcac aaattgaggc caccaaaaag
6721 ctacaacagg aaatgatgag agttaagcag gcaatgctcc tagagggtgg gttctctgag
6781 acagatgcag cccgcggggc aatcaacgcc cccatgacaa aagctttgga ctggagcggg
6841 acaaggtact gggctcctga tgctaggacc acaacataca atgctggccg cttttccacc
6901 cctcaaccat cgggggcact gccaggaaga gctaatctta gggatgctgt ccttgctcgg
6961 ggttcctcta gcaaatctta taactcttct actgctactt ctgtgtactc aaatcaaacc
7021 acttcaacga gacttggttc tacagctggt tctggtacca gtgtctcgag tctcccgtca
7081 actgcaagga ctaggagctg ggttgaggat caaagtagga atttgtcccc tctcatgagg
7141 ggggcccaca acatatcgtt tgtcacccca ccatctagca gatcctctag ccaaggcaca
7201 gtctcaaccg tgcctaaaga ggttttgaac tcctggactg gcgctttcaa cacgcgcagg
7261 cagcctctct tcgctcatat tcgtaaacga ggggagtcac gggtgtaatg tgaaaagaca
7321 aaattgatta tctttctttc tctttagtgt cttttaaaaa aaaaaaaaaa aaaaaaaaaa
7381 aaaaaaaaaa
//