Lasso based feature selection for malaria risk exposure prediction

Archive ouverte

Kouwayè, Bienvenue | Fonton, Noël | Rossi, Fabrice

Edité par CCSD ; Ibai publishing -

International audience. In life sciences, the experts generally use empirical knowledge to recode variables, choose interactions and perform selection by classical approach. The aim of this work is to perform automatic learning algorithm for variables selection which can lead to know if experts can be help in they decision or simply replaced by the machine and improve they knowledge and results. The Lasso method can detect the optimal subset of variables for estimation and prediction under some conditions. In this paper, we propose a novel approach which uses automatically all variables available and all interactions. By a double cross-validation combine with Lasso, we select a best subset of variables and with GLM through a simple cross-validation perform predictions. The algorithm assures the stability and the the consistency of estimators.

Suggestions

Du même auteur

Sélection de variables par le GLM-Lasso pour la prédiction du risque palustre

Archive ouverte | Kouwayè, Bienvenue | CCSD

National audience. In this study, we propose an automatic learning method for variables selection based on Lasso in epidemiology context. One of the aim of this approach is to overcome the pretreatment of experts in...

Predicting local malaria exposure using a Lasso-based two-level cross validation algorithm

Archive ouverte | Kouwaye, Bienvenue | CCSD

International audience. Recent studies have highlighted the importance of local environmental factors to determine the fine-scale heterogeneity of malaria transmission and exposure to the vector. In this work, we co...

Variables selection by the LASSO method. Application to malaria data of Tori-Bossito (Benin)

Archive ouverte | Kouwaye, Bienvenue | CCSD

COPROMATH 2013 Cotonou Bénin. This work deals with prediction of anopheles number using environmental and climate variables. The variables selection is performed by GLMM (Generalized linear mixed model) combined wi...

Chargement des enrichissements...