Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models

Archive ouverte

Colombo, Pierre | Pellegrain, Victor | Boudiaf, Malik | Storchan, Victor | Tami, Myriam | Ayed, Ismail, Ben | Hudelot, Céline | Piantanida, Pablo

Edité par CCSD -

International audience. Proprietary and closed APIs are becoming increasingly common to process natural language, and are impacting the practical applications of natural language processing, including fewshot classification. Few-shot classification involves training a model to perform a new classification task with a handful of labeled data. This paper presents three contributions. First, we introduce a scenario where the embedding of a pre-trained model is served through a gated API with compute-cost and data-privacy constraints. Second, we propose a transductive inference, a learning paradigm that has been overlooked by the NLP community. Transductive inference, unlike traditional inductive learning, leverages the statistics of unlabeled data. We also introduce a new parameter-free transductive regularizer based on the Fisher-Rao loss, which can be used on top of the gated API embeddings. This method fully utilizes unlabeled data, does not share any label with the third-party API provider and could serve as a baseline for future research. Third, we propose an improved experimental setting and compile a benchmark of eight datasets involving multiclass classification in four different languages, with up to 151 classes. We evaluate our methods using eight backbone models, along with an episodic evaluation over 1,000 episodes, which demonstrate the superiority of transductive inference over the standard inductive setting.

Suggestions

Du même auteur

Open-Set Likelihood Maximization for Few-Shot Learning

Archive ouverte | Boudiaf, Malik | CCSD

International audience. We tackle the Few-Shot Open-Set Recognition (FSOSR) problem, i.e. classifying instances among a set of classes for which we only have a few labeled samples, while simultaneously detecting ins...

Automatic Text Evaluation through the Lens of Wasserstein Barycenters

Archive ouverte | Colombo, Pierre | CCSD

International audience. A new metric BaryScore to evaluate text generation based on deep contextualized embeddings (e.g., BERT, Roberta, ELMo) is introduced. This metric is motivated by a new framework relying on op...

Spatial Contrastive Learning for Few-Shot Classification

Archive ouverte | Ouali, Yassine | CCSD

International audience. In this paper, we explore contrastive learning for few-shot classification, in which we propose to use it as an additional auxiliary training objective acting as a data-dependent regularizer ...

Chargement des enrichissements...