PARSEME Meets Universal Dependencies: Getting on the Same Page in Representing Multiword Expressions

Archive ouverte

Savary, Agata | Stymne, Sara | Barbu Mititelu, Verginica | Schneider, Nathan | Ramisch, Carlos | Nivre, Joakim

Edité par CCSD -

International audience. Multiword expressions (MWEs) are challenging and pervasive phenomena whose idiosyncratic properties show notably at the levels of lexicon, morphology, and syntax. Thus, they should best be annotated jointly with morphosyntax. We discuss two multilingual initiatives, Universal Dependencies and PARSEME, addressing these annotation layers in cross-lingually unified ways. We compare the annotation principles of these initiatives with respect to MWEs, and we put forward a roadmap towards their gradual unification. The expected outcomes are more consistent treebanking and higher universality in modeling idiosyncrasy.

Suggestions

Du même auteur

PARSEME corpus release 1.3

Archive ouverte | Savary, Agata | CCSD

International audience

UniDive: A COST Action on Universality, Diversity and Idiosyncrasy in Language Technology

Archive ouverte | Savary, Agata | CCSD

International audience. This paper presents the objectives, organization and activities of the UniDive COST Action, a scientific network dedicated to universality, diversity and idiosyncrasy in language technology. ...

Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 23191)

Archive ouverte | Baldwin, Timothy | CCSD

International audience. The Dagstuhl Seminar 23191 entitled "Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics" took place May 7-12, 2023. Its main objectives were to deepen the underst...

Chargement des enrichissements...