Feature Clustering for Support Identification in Extreme Regions

Archive ouverte

Jalalzai, Hamid | Leluc, Rémi

Edité par CCSD ; PMLR -

International audience. Understanding the complex structure of multivariate extremes is a major challenge in various fields from portfolio monitoring and environmental risk management to insurance. In the framework of multivariate Extreme Value Theory, a common characterization of extremes' dependence structure is the angular measure. It is a suitable measure to work in extreme regions as it provides meaningful insights concerning the subregions where extremes tend to concentrate their mass. The present paper develops a novel optimization-based approach to assess the dependence structure of extremes. This support identification scheme rewrites as estimating clusters of features which best capture the support of extremes. The dimension reduction technique we provide is applied to statistical learning tasks such as feature clustering and anomaly detection. Numerical experiments provide strong empirical evidence of the relevance of our approach.

Suggestions

Du même auteur

Membership Inference Attacks via Adversarial Examples

Archive ouverte | Jalalzai, Hamid | CCSD

Trustworthy and Socially Responsible Machine Learning (TSRML 2022) co-located with NeurIPS 2022. The raise of machine learning and deep learning led to significant improvement in several domains. This change is supp...

Heavy-tailed Representations, Text Polarity Classification & Data Augmentation

Archive ouverte | Jalalzai, Hamid | CCSD

The dominant approaches to text representation in natural language rely on learning embeddings on massive corpora which have convenient properties such as compositionality and distance preservation. In this paper, we develop a nov...

Apprentissage à partir de données extrêmes multivariées : application au traitement du langage naturel. Learning from multivariate extremes : theory and application to natural language processing

Archive ouverte | Jalalzai, Hamid | CCSD

Extremes surround us and appear in a large variety of data. Natural data likethe ones related to environmental sciences contain extreme measurements; inhydrology, for instance, extremes may correspond to floods and heavy rainfalls...

Chargement des enrichissements...