Standardized benchmarking in the quest for orthologs

Archive ouverte

Altenhoff, Adrian, M | Boeckmann, Brigitte | Capella-Gutiérrez, Salvador | Dalquen, Daniel, A | Deluca, Todd | Forslund, Kristoffer | Huerta-Cepas, Jaime | Linard, Benjamin | Pereira, Cécile | Pryszcz, Leszek, P | Schreiber, Fabian | Sousa da Silva, Alan | Szklarczyk, Damian | Train, Clément-Marie | Bork, Peer | Lecompte, Odile | von Mering, Christian | Xenarios, Ioannis | Sjölander, Kimmen | Jensen, Lars Juhl | Martin, Maria J | Muffato, Matthieu | Gabaldon, Toni | Lewis, Suzanna E | Thomas, Paul D | Sonnhammer, Erik | Dessimoz, Christophe

Edité par CCSD ; Nature Publishing Group -

International audience. Achieving high accuracy in orthology inference is essential for many comparative, evolutionary and functional genomic analyses, yet the true evolutionary history of genes is generally unknown and orthologs are used for very different applications across phyla, requiring different precision–recall trade-offs. As a result, it is difficult to assess the performance of orthology inference methods. Here, we present a community effort to establish standards and an automated web-based service to facilitate orthology benchmarking. Using this service, we characterize 15 well-established inference methods and resources on a battery of 20 different benchmarks. Standardized benchmarking provides a way for users to identify the most effective methods for the problem at hand, sets a minimum requirement for new tools and resources, and guides the development of more accurate orthology inference methods.

Suggestions

Du même auteur

The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements

Archive ouverte | Altenhoff, Adrian M. | CCSD

International audience. The Orthologous Matrix (OMA) project is a method and associated database inferring evolutionary relationships amongst currently 1706 complete proteomes (i.e. the protein sequence associated f...

The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored

Archive ouverte | Szklarczyk, Damian | CCSD

International audience. An essential prerequisite for any systems-level understanding of cellular functions is to correctly uncover and annotate all functional interactions among proteins in the cell. Toward this go...

eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges

Archive ouverte | Powell, Sean | CCSD

International audience

Chargement des enrichissements...