Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning

Archive ouverte

Nicolas, Lionel | Lyding, Verena | Borg, Claudia | Forascu, Corina | Fort, Karën | Zdravkova, Katerina | Kosem, Iztok | Cibej, Jaka | Holdt, Špela, Arhar | Millour, Alice | König, Alexander | Rodosthenous, Christos | Sangati, Federico | Hassan, Umair Ul | Katinskaia, Anisia | Barreiro, Anabela | Aparaschivei, Lavinia | Hacohen-Kerner, Yaakov

Edité par CCSD -

International audience. We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners can be involved. We present the approach by explaining its core paradigm that consists in pairing specific types of LRs with specific exercises, by detailing both its strengths and challenges, and by discussing how much these challenges have been addressed at present. Accordingly, we also report on ongoing proof-of-concept efforts aiming at developing the first prototypical implementation of the approach in order to correct and extend an LR called ConceptNet based on the input crowdsourced from language learners. We then present an international network called the European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect) that provides the context to accelerate the implementation of the generic approach. Finally, we exemplify how it can be used in several language learning scenarios to produce a multitude of NLP resources and how it can therefore alleviate the long-standing NLP issue of the lack of LRs.

Suggestions

Du même auteur

Substituto - A Synchronous Educational Language Game for Simultaneous Teaching and Crowdsourcing

Archive ouverte | Grace Araneta, Marianne | CCSD

International audience. This paper investigates a general framework for synchronous educational language games that simultaneously allows researchers to crowdsource learner answers in a controlled environment. Our p...

EnetCollect in Italy

Archive ouverte | Nicolas, Lionel | Accademia University Press

In this paper, we present the enetCollect1 COST Action, a large network project, which aims at initiating a new Research and Innovation (R&I) trend on combining the well-established domain of language learning with recent and succ...

Are Crescia and Piadina the Same? Towards Identifying Synonymy or Non-Synonymy between Italian Words to Enable Crowdsourcing from Language Learners

Archive ouverte | Aparaschivei, Lavinia | Accademia University Press

We introduce a method to generate candidate pairs of related Italian words sharing (or not) synonymous relations from the ConceptNet knowledgebase. The pairs are intended to generate questions for a vocabulary trainer which combin...

Chargement des enrichissements...