Information Extraction - Groups

Wiki80

The dataset used in the paper for zero-shot classification tasks in information extraction.

Dataset
JSON

DocParser: End-to-end OCR-free Information Extraction from Visually Rich Docu...

Information Extraction from visually rich documents is a challenging task that has gained a lot of attention in recent years due to its importance in several document-control...

Dataset
JSON

Collective Segmentation and Labeling of Distant Entities in Information Extra...

Collective segmentation and labeling of distant entities in information extraction.

Dataset
JSON

DocRED dataset

The DocRED dataset was built from Wikipedia and Wikidata, covering various relations related to science, art, personal life, etc.

Dataset
JSON

RuREBus

The RuREBus corpus is a dataset of strategic planning documents issued by the Ministry of Economic Development of the Russian Federation.

Dataset
JSON

CALOR corpus

The CALOR corpus is a collection of documents in French language that were hand annotated in frame semantics.

Dataset
JSON

TACRED

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a few-shot relation extraction task (TACRED) and a few-shot variant of TACRED.

Dataset
JSON

W00

This dataset is contributed by Jin et al. It has 731 deﬁnitional and 1454 non-deﬁnitional sentences from the ACL-ARC anthology.

Dataset
JSON

A Joint Model for Deﬁnition Extraction with Syntactic Connection and Semantic...

Deﬁnition Extraction (DE) is one of the well-known topics in Information Extraction that aims to identify terms and their corresponding deﬁnitions in unstructured texts.

Dataset
JSON

ACE05

A joint information extraction dataset providing entity, relation, and event annotation for three languages: English, Chinese, and Arabic.

Dataset
JSON

OPTIMAL QUESTIONNAIRES FOR SCREENING OF STRATEGIC AGENTS

The dataset used in the paper is a set of travel histories and types of travellers, where each traveller has a tendency to misreport their travel history.

Dataset
JSON

Exploiting Asymmetry for Synthetic Training

The Exploiting Asymmetry for Synthetic Training in Proceedings of the 2023 Conference on data generation: SynthIE and the case of information extraction dataset is used to...

Dataset
JSON

12 datasets found