How to deal with missing data in supervised deep learning?

Archive ouverte

Ipsen, Niels, Bruun | Mattei, Pierre-Alexandre | Frellsen, Jes

Edité par CCSD -

International audience. The issue of missing data in supervised learning has been largely overlooked, especially in the deep learning community. We investigate strategies to adapt neural architectures for handling missing values. Here, we focus on regression and classification problems where the features are assumed to be missing at random. Of particular interest are schemes that allow reusing as-is a neural discriminative architecture. To address supervised deep learning with missing values, we propose to marginalize over missing values in a joint model of covariates and outcomes. Thereby, we leverage both the flexibility of deep generative models to describe the distribution of the covariates and the power of purely discriminative models to make predictions. More precisely, a deep latent variable model can be learned jointly with the discriminative model, using importance-weighted variational inference, essentially using importance sampling to mimick averaging over multiple imputations. In low-capacity regimes, or when the discriminative model has a strong inductive bias, we find that our hybrid generative/discriminative approach generally outperforms single imputations methods.

Suggestions

Du même auteur

not-MIWAE: Deep Generative Modelling with Missing not at Random Data

Archive ouverte | Ipsen, Niels Bruun | CCSD

International audience. When a missing process depends on the missing values themselves, it needs to be explicitly modelled and taken into account while doing likelihood-based inference. We present an approach for b...

Explainability as statistical inference

Archive ouverte | Senetaire, Hugo Henri Joseph | CCSD

International audience. A wide variety of model explanation approaches have been proposed in recent years, all guided by very different rationales and heuristics. In this paper, we take a new route and cast interpre...

Model-agnostic out-of-distribution detection using combined statistical tests

Archive ouverte | Bergamin, Federico | CCSD

International audience. We present simple methods for out-of-distribution detection using a trained generative model. These techniques, based on classical statistical tests, are model-agnostic in the sense that they...

Chargement des enrichissements...