Openprot 2021: deeper functional annotation of the coding potential of eukaryotic genomes

Archive ouverte

Brunet, Marie A. | Lucier, Jean-Francois | Levesque, Maxime | Leblanc, Sebastien | Jacques, Jean-Francois | Al-Saedi, Hassan R. H. | Avino, Mariano | Fournier, Isabelle | Salzet, Michel | Ouangraoua, Aida | Scott, Michelle S. | Boisvert, Francois-Michel | Roucou, Xavier

Edité par CCSD ; Oxford University Press -

International audience. OpenProt (www.openprot.org) is the first proteogenomic resource supporting a polycistronic annotation model for eukaryotic genomes. It provides a deeper annotation of open reading frames (ORFs) while mining experimental data for supporting evidence using cutting-edge algorithms. This update presents the major improvements since the initial release of OpenProt. All species support recent NCBI RefSeq and Ensembl annotations, with changes in annotations being reported in OpenProt. Using the 131 ribosome profiling datasets re-analysed by OpenProt to date, non-AUG initiation starts are reported alongside a confidence score of the initiating codon. From the 177 mass spectrometry datasets re-analysed by OpenProt to date, the unicity of the detected peptides is controlled at each implementation. Furthermore, to guide the users, detectability statistics and protein relationships (isoforms) are now reported for each protein. Finally, to foster access to deeper ORF annotation independently of one's bioinformatics skills or computational resources, OpenProt now offers a data analysis platform. Users can submit their dataset for analysis and receive the results from the analysis by OpenProt. All data on OpenProt are freely available and downloadable for each species, the release-based format ensuring a continuous access to the data. Thus, OpenProt enables a more comprehensive annotation of eukaryotic genomes and fosters functional proteomic discoveries.

Suggestions

Du même auteur

OpenProt: a more comprehensive guide to explore eukaryotic coding potential and proteomes

Archive ouverte | Brunet, Marie, A | CCSD

International audience. Advances in proteomics and sequencing have highlighted many non-annotated open reading frames (ORFs) in eukaryotic genomes. Genome annotations, cornerstones of today's research, mostly rely o...

The Protein Coded by a Short Open Reading Frame, Not by the Annotated Coding Sequence, Is the Main Gene Product of the Dual-Coding Gene MIEF1

Archive ouverte | Delcourt, Vivian | CCSD

International audience. Proteogenomics and ribosome profiling concurrently show that genes may code for both a large and one or more small proteins translated from annotated coding sequences (CDSs) and unannotated a...

Spatially-Resolved Top-down Proteomics Bridged to MALDI MS Imaging Reveals the Molecular Physiome of Brain Regions

Archive ouverte | Delcourt, Vivian | CCSD

International audience. Tissue spatially-resolved proteomics was performed on 3 brain regions, leading to the characterization of 123 reference proteins. Moreover, 8 alternative proteins from alternative open readin...

Chargement des enrichissements...