-
DocRED dataset
The DocRED dataset was built from Wikipedia and Wikidata, covering various relations related to science, art, personal life, etc. -
CALOR corpus
The CALOR corpus is a collection of documents in French language that were hand annotated in frame semantics. -
A Joint Model for Definition Extraction with Syntactic Connection and Semantic...
Definition Extraction (DE) is one of the well-known topics in Information Extraction that aims to identify terms and their corresponding definitions in unstructured texts. -
OPTIMAL QUESTIONNAIRES FOR SCREENING OF STRATEGIC AGENTS
The dataset used in the paper is a set of travel histories and types of travellers, where each traveller has a tendency to misreport their travel history. -
Exploiting Asymmetry for Synthetic Training
The Exploiting Asymmetry for Synthetic Training in Proceedings of the 2023 Conference on data generation: SynthIE and the case of information extraction dataset is used to...