Benchmarking joint multi-omics dimensionality reduction approaches for cancer study

Archive ouverte

Cantini, Laura | Zakeri, Pooya | Hernandez, Céline | Naldi, Aurelien | Thieffry, Denis | Remy, Elisabeth | Baudot, Anaïs

Edité par CCSD -

High-dimensional multi-omics data are now standard in biology. They can greatly enhance our understanding of biological systems when effectively integrated. To achieve this multi-omics data integration, Joint Dimensionality Reduction (jDR) methods are among the most efficient approaches. However, several jDR methods are available, urging the need for a comprehensive benchmark with practical guidelines. . We performed a systematic evaluation of nine representative jDR methods using three complementary benchmarks. First, we evaluated their performances in retrieving ground-truth sample clustering from simulated multi-omics datasets. Second, we used TCGA cancer data to assess their strengths in predicting survival, clinical annotations and known pathways/biological processes. Finally, we assessed their classification of multi-omics single-cell data. . From these in-depth comparisons, we observed that intNMF performs best in clustering, while MCIA offers a consistent and effective behavior across many contexts. The full code of this benchmark is implemented in a Jupyter notebook - multi-omics mix (momix) - to foster reproducibility, and support data producers, users and future developers. . High-dimensional multi-omics data are now standard in biology. They can greatly enhance our understanding of biological systems when effectively integrated. To achieve this multi-omics data integration, Joint Dimensionality Reduction (jDR) methods are among the most efficient approaches. However, several jDR methods are available, urging the need for a comprehensive benchmark with practical guidelines. We performed a systematic evaluation of nine representative jDR methods using three complementary benchmarks. First, we evaluated their performances in retrieving ground-truth sample clustering from simulated multi-omics datasets. Second, we used TCGA cancer data to assess their strengths in predicting survival, clinical annotations and known pathways/biological processes. Finally, we assessed their classification of multi-omics single-cell data. From these in-depth comparisons, we observed that intNMF performs best in clustering, while MCIA offers a consistent and effective behavior across many contexts. The full code of this benchmark is implemented in a Jupyter notebook-multi-omics mix (momix)-to foster reproducibility, and support data producers, users and future developers.

Suggestions

Du même auteur

Benchmarking joint multi-omics dimensionality reduction approaches for the study of cancer

Archive ouverte | Cantini, Laura | CCSD

International audience. High-dimensional multi-omics data are now standard in biology. They can greatly enhance our understanding of biological systems when effectively integrated. To achieve proper integration, joi...

Dynamical Boolean Modeling of Immunogenic Cell Death

Archive ouverte | Checcoli, Andrea | CCSD

International audience. As opposed to the standard tolerogenic apoptosis, immunogenic cell death (ICD) constitutes a type of cellular demise that elicits an adaptive immune response. ICD has been characterized in ma...

Integrative proteomic and phosphoproteomic profiling of prostate cell lines

Archive ouverte | Katsogiannou, Maria | CCSD

International audience. Background: Prostate cancer is a major public health issue, mainly because patients relapse after androgen deprivation therapy. Proteomic strategies, aiming to reflect the functional activity...

Chargement des enrichissements...