Development of an absolute assignment predictor for triple-negative breast cancer subtyping using machine learning approaches

Archive ouverte

Ben Azzouz, Fadoua | Michel, Bertrand | Lasla, Hamza | Gouraud, Wilfried | François, Anne-Flore | Girka, Fabien | Lecointre, Théo | Guérin-Charbonnel, Catherine | Juin, Philippe | Campone, Mario | Jézéquel, Pascal

Edité par CCSD ; Elsevier -

International audience. Triple-negative breast cancer (TNBC) heterogeneity represents one of the main obstacles to precision medicine for this disease. Recent concordant transcriptomics studies have shown that TNBC could be divided into at least three subtypes with potential therapeutic implications. Although a few studies have been conducted to predict TNBC subtype using transcriptomics data, the subtyping was partially sensitive and limited by batch effect and dependence on a given dataset, which may penalize the switch to routine diagnostic testing. Therefore, we sought to build an absolute predictor (i.e., intra-patient diagnosis) based on machine learning algorithms with a limited number of probes. To that end, we started by introducing probe binary comparison for each patient (indicators). We based the predictive analysis on this transformed data. Probe selection was first involved combining both filter and wrapper methods for variable selection using cross-validation. We tested three prediction models (random forest, gradient boosting [GB], and extreme gradient boosting) using this optimal subset of indicators as inputs. Nested cross-validation consistently allowed us to choose the best model. The results showed that the fifty selected indicators highlighted the biological characteristics associated with each TNBC subtype. The GB based on this subset of indicators performs better than other models.

Suggestions

Du même auteur

bc-GenExMiner 4.5: new mining module computes breast cancer differential gene expression analyses

Archive ouverte | Jézéquel, Pascal | CCSD

International audience. Breast cancer gene-expression miner' (bc-GenExMiner) is a breast cancer-associated web portal (http://bcgenex.ico.unicancer.fr). Here, we describe the development of a new statistical mining ...

Interest of the bc-GenExMiner web tool in oncology. Intérêt de l’outil web bc-GenExMiner en oncologie

Archive ouverte | Jézéquel, Pascal | CCSD

International audience. We are taking advantage of the launch of the latest version (v4.6) of our web-based data mining tool "breast cancer gene-expression miner" (bc-GenExMiner) to take stock of its position within...

Gene-expression molecular subtyping of triple-negative breast cancer tumours: importance of immune response

Archive ouverte | Jézéquel, Pascal | CCSD

International audience. INTRODUCTION: Triple-negative breast cancers need to be refined in order to identify therapeutic subgroups of patients.METHODS: We conducted an unsupervised analysis of microarray gene-expres...

Chargement des enrichissements...