PARSEME Corpus

The PARSEME corpus contains multilingual data used for the detection of verbal multiword expressions (MWEs), structured in CUPT format with annotations for words, lemmas, UPOS, XPOS, and attributes.

Data and Resources

Cite this as

Carlos Ramisch, Silvio Cordeiro, Agata Savary, Veronika Vincze, Verginica Barbu Mititelu, Archna Bhatia, Maja Buljan, Marie Candito (2024). Dataset: PARSEME Corpus. https://doi.org/10.57702/kdd2szga

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.18653/v1/W18-4928
Version 1.1
Author Carlos Ramisch
More Authors
Silvio Cordeiro
Agata Savary
Veronika Vincze
Verginica Barbu Mititelu
Archna Bhatia
Maja Buljan
Marie Candito
Homepage http://opensource.adobe.com/NLP-Cube/blog/posts/1-gbd/results.html