OpenProt: a more comprehensive guide to explore eukaryotic coding potential and proteomes

Archive ouverte

Brunet, Marie, A | Brunelle, Mylène | Lucier, Jean-François | Delcourt, Vivian | Levesque, Maxime | Grenier, Frédéric | Samandi, Sondos | Leblanc, Sébastien | Aguilar, Jean-David | Dufour, Pascal | Jacques, Jean-Francois | Fournier, Isabelle | Ouangraoua, Aïda | Scott, Michelle, S | Boisvert, François-Michel | Roucou, Xavier

Edité par CCSD ; Oxford University Press -

International audience. Advances in proteomics and sequencing have highlighted many non-annotated open reading frames (ORFs) in eukaryotic genomes. Genome annotations, cornerstones of today's research, mostly rely on protein prior knowledge and on ab initio prediction algorithms. Such algorithms notably enforce an arbitrary criterion of one coding sequence (CDS) per transcript, leading to a substantial underestimation of the coding potential of eukaryotes. Here, we present OpenProt, the first database fully endorsing a polycistronic model of eukaryotic genomes to date. OpenProt contains all possible ORFs longer than 30 codons across 10 species, and cumulates supporting evidence such as protein conservation, translation and expression. OpenProt annotates all known proteins (RefProts), novel predicted isoforms (Isoforms) and novel predicted proteins from alternative ORFs (AltProts). It incorporates cutting-edge algorithms to evaluate protein orthology and re-interrogate publicly available ribosome profiling and mass spectrometry datasets, supporting the annotation of thousands of predicted ORFs. The constantly growing database currently cumulates evidence from 87 ribosome profiling and 114 mass spectrometry studies from several species, tissues and cell lines. All data is freely available and downloadable from a web platform (www.openprot.org) supporting a genome browser and advanced queries for each species. Thus, OpenProt enables a more comprehensive landscape of eukaryotic genomes' coding potential.

Suggestions

Du même auteur

Deep transcriptome annotation suggests that small and large proteins encoded in the same genes often cooperate

Archive ouverte | Samandi, Sondos | CCSD

Recent studies in eukaryotes have demonstrated the translation of alternative open reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be c...

Openprot 2021: deeper functional annotation of the coding potential of eukaryotic genomes

Archive ouverte | Brunet, Marie A. | CCSD

International audience. OpenProt (www.openprot.org) is the first proteogenomic resource supporting a polycistronic annotation model for eukaryotic genomes. It provides a deeper annotation of open reading frames (ORF...

Conservation of physiological dysregulation signatures of aging across primates

Archive ouverte | Dansereau, Gabriel | CCSD

International audience. Two major goals in the current biology of aging are to identify general mechanisms underlying the aging process and to explain species differences in aging. Recent research in humans suggests...

Chargement des enrichissements...