-
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Docu...
Information Extraction from visually rich documents is a challenging task that has gained a lot of attention in recent years due to its importance in several document-control... -
Collective Segmentation and Labeling of Distant Entities in Information Extra...
Collective segmentation and labeling of distant entities in information extraction. -
DocRED dataset
The DocRED dataset was built from Wikipedia and Wikidata, covering various relations related to science, art, personal life, etc. -
CALOR corpus
The CALOR corpus is a collection of documents in French language that were hand annotated in frame semantics. -
A Joint Model for Definition Extraction with Syntactic Connection and Semantic...
Definition Extraction (DE) is one of the well-known topics in Information Extraction that aims to identify terms and their corresponding definitions in unstructured texts. -
OPTIMAL QUESTIONNAIRES FOR SCREENING OF STRATEGIC AGENTS
The dataset used in the paper is a set of travel histories and types of travellers, where each traveller has a tendency to misreport their travel history. -
Exploiting Asymmetry for Synthetic Training
The Exploiting Asymmetry for Synthetic Training in Proceedings of the 2023 Conference on data generation: SynthIE and the case of information extraction dataset is used to...