The Optimal Noise in Noise-Contrastive Learning Is Not What You Think

Archive ouverte

Chehab, Omar | Gramfort, Alexandre | Hyvärinen, Aapo

Edité par CCSD -

International audience. Learning a parametric model of a data distribution is a well-known statistical problem that has seen renewed interest as it is brought to scale in deep learning. Framing the problem as a self-supervised task, where data samples are discriminated from noise samples, is at the core of state-of-the-art methods, beginning with Noise-Contrastive Estimation (NCE). Yet, such contrastive learning requires a good noise distribution, which is hard to specify; domain-specific heuristics are therefore widely used. While a comprehensive theory is missing, it is widely assumed that the optimal noise should in practice be made equal to the data, both in distribution and proportion; this setting underlies Generative Adversarial Networks (GANs) in particular. Here, we empirically and theoretically challenge this assumption on the optimal noise. We show that deviating from this assumption can actually lead to better statistical estimators, in terms of asymptotic variance. In particular, the optimal noise distribution is different from the data's and even from a different family.

Suggestions

Du même auteur

Uncovering the structure of clinical EEG signals with self-supervised learning

Archive ouverte | Banville, Hubert | CCSD

32 pages, 9 figures. International audience. Objective. Supervised learning paradigms are often limited by the amount of labeled data that is available. This phenomenon is particularly problematic in clinically-rele...

Self-supervised representation learning from electroencephalography signals. Apprentissage de représentations auto-supervisé à partir de signaux d'électroencéphalographie

Archive ouverte | Banville, Hubert | CCSD

International audience. The supervised learning paradigm is limited by the cost - and sometimes the impracticality - of data collection and labeling in multiple domains. Self-supervised learning, a paradigm which ex...

Modeling Shared Responses in Neuroimaging Studies through MultiView ICA

Archive ouverte | Richard, Hugo | CCSD

International audience. Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization. However, the aggregation of data coming from multiple subjects...

Chargement des enrichissements...