SemEval-2016 Semantic Textual Similarity Dataset

The SemEval-2016 dataset for Semantic Textual Similarity was used to evaluate sentence pairs by training models with 90% of the data for training and 10% for validation.

Data and Resources

Cite this as

Eneko Agirre, Carmen Banea, Claire Cardie, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei Guo, Inigo Lopez-Gazpio, Montse Maritxalar, Rada Mihalcea, German Rigau, Larraitz Uria, Janyce Wiebe (2024). Dataset: SemEval-2016 Semantic Textual Similarity Dataset. https://doi.org/10.57702/1xyq4gms

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.18653/v1/S17-2016
Author Eneko Agirre
More Authors
Carmen Banea
Claire Cardie
Daniel Cer
Mona Diab
Aitor Gonzalez-Agirre
Weiwei Guo
Inigo Lopez-Gazpio
Montse Maritxalar
Rada Mihalcea
German Rigau
Larraitz Uria
Janyce Wiebe
Homepage http://www.aclweb.org/anthology/S16-2025