Don't fear the unlabelled: safe deep semi-supervised learning via simple debiasing

Archive ouverte

Schmutz, Hugo | Humbert, Olivier | Mattei, Pierre-Alexandre

Edité par CCSD -

International audience. Semi supervised learning (SSL) provides an effective means of leveraging unlabelled data to improve a model's performance. Even though the domain has received a considerable amount of attention in the past years, most methods present the common drawback of being unsafe. By safeness we mean the quality of not degrading a fully supervised model when including unlabelled data. Our starting point is to notice that the estimate of the risk that most discriminative SSL methods minimise is biased, even asymptotically. This bias makes these techniques untrustable without a proper validation set, but we propose a simple way of removing the bias. Our debiasing approach is straightforward to implement, and applicable to most deep SSL methods. We provide simple theoretical guarantees on the safeness of these modified methods, without having to rely on the strong assumptions on the data distribution that SSL theory usually requires. We evaluate debiased versions of different existing SSL methods and show that debiasing can compete with classic deep SSL techniques in various classic settings and even performs well when traditional SSL fails.

Consulter en ligne

Suggestions

Du même auteur

18FDG PET/CT and Machine Learning for the prediction of lung cancer response to immunotherapy

Archive ouverte | Schmutz, Hugo | CCSD

International audience. In patients with non-small cell lung cancer (NSCLC) treated with immunotherapy, individual biological and PET imaging prognostic biomarkers have been recently identified. However, combination...

Are labels informative in semi-supervised learning?Estimating and leveraging the missing-data mechanism

Archive ouverte | Sportisse, Aude | CCSD

International audience. Semi-supervised learning is a powerful technique for leveraging unlabeled data to improve machine learning models, but it can be affected by the presence of “informative” labels, which occur ...

Model-agnostic out-of-distribution detection using combined statistical tests

Archive ouverte | Bergamin, Federico | CCSD

International audience. We present simple methods for out-of-distribution detection using a trained generative model. These techniques, based on classical statistical tests, are model-agnostic in the sense that they...

Chargement des enrichissements...