Ortho-proteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol.

Archive ouverte

Gallien, Sébastien | Perrodou, Emmanuel | Carapito, Christine | Deshayes, Caroline | Reyrat, Jean-Marc | van Dorsselaer, Alain | Poch, Olivier | Schaeffer, Christine | Lecompte, Odile

Edité par CCSD ; Cold Spring Harbor Laboratory Press -

International audience. The progress in sequencing technologies irrigates biology with an ever-increasing number of genome sequences. In most cases, the gene repertoire is predicted in silico and conceptually translated into proteins. As recently highlighted, the predicted genes exhibit frequent errors, particularly in start codons, with a serious impact on subsequent biological studies. A new "ortho-proteogenomic" approach is presented here for the annotation refinement of multiple genomes at once. It combines comparative genomics with an original proteomic protocol that allows the characterization of both N-terminal and internal peptides in a single experiment. This strategy was applied to the Mycobacterium genus with Mycobacterium smegmatis as the reference, and identified 946 distinct proteins, including 443 characterized N termini. These experimental data allowed the correction of 19% of the characterized start codons, the identification of 29 proteins missed during the annotation process, and the curation, thanks to comparative genomics, of 4328 sequences of 16 other Mycobacterium proteomes.

Consulter en ligne

Suggestions

Du même auteur

ICDS database: interrupted CoDing sequences in prokaryotic genomes.

Archive ouverte | Perrodou, Emmanuel | CCSD

International audience. Unrecognized frameshifts, in-frame stop codons and sequencing errors lead to Interrupted CoDing Sequence (ICDS) that can seriously affect all subsequent steps of functional characterization, ...

Interrupted coding sequences in Mycobacterium smegmatis: authentic mutations or sequencing errors?

Archive ouverte | Deshayes, Caroline | CCSD

BACKGROUND: In silico analysis has shown that all bacterial genomes contain a low percentage of ORFs with undetected frameshifts and in-frame stop codons. These interrupted coding sequences (ICDSs) may really be present in the org...

Detecting the molecular scars of evolution in the Mycobacterium tuberculosis complex by analyzing interrupted coding sequences.

Archive ouverte | Deshayes, Caroline | CCSD

BACKGROUND: Computer-assisted analyses have shown that all bacterial genomes contain a small percentage of open reading frames with a frameshift or in-frame stop codon We report here a comparative analysis of these interrupted cod...

Chargement des enrichissements...