Question Answering - Groups

Universal Conceptual Cognitive Annotation (UCCA)

The Universal Conceptual Cognitive Annotation (UCCA) dataset is a graph-based semantic annotation scheme based on typological linguistic principles.

Dataset
JSON

Wikipedia Neutrality Corpus

This dataset is used to test the ability of large language models to detect and correct biased Wikipedia edits according to Wikipedia's Neutral Point of View (NPOV) policy.

Dataset
JSON

Evidence Collection Dataset for Fact-Checking

An evidence collection dataset for fact-checking, containing 390 factual statements with associated human-generated search queries and search results.

Dataset
JSON

FKTC

FKTC is a test set for evaluating the factual knowledge of large language models. It contains 210,158 prompts in total.

Dataset
JSON

SemEval-2021 task 4

The dataset used in the paper for question answering task

Dataset
JSON

ZJUKLAB at SemEval-2021 task 4

The dataset used in the paper for negative augmentation with language model for reading comprehension of abstract meaning

Dataset
JSON

ReCAM

The dataset used in the paper for multiple-choice cloze-style MRC tasks

Dataset
JSON

Wikidata

The dataset used in the paper is Wikidata, which contains a large number of entities and their corresponding semantic types.

Dataset
JSON

MQUAKE: Assessing knowledge editing in language models via multi-hop questions

MQUAKE is a knowledge editing benchmark that includes MQUAKE-CF-3K based on counterfactual edits, and MQUAKE-T with temporal knowledge updates.

Dataset
JSON

PokeMQA: Programmable knowledge editing for Multi-hop Question Answering

Multi-hop question answering (MQA) is one of the challenging tasks to evaluate machine’s comprehension and reasoning abilities, where large language models (LLMs) have widely...

Dataset
JSON

BREAK

Break dataset contains question-decomposition meaning representation (QDMR) annotations from BREAK.

Dataset
JSON

DROP

DROP dataset contains complex compositional questions against natural language passages describing football games and historical events.

Dataset
JSON

Natural Questions, WebQuestions, and TriviaQA

The dataset used in the paper is Natural Questions, WebQuestions, and TriviaQA. These datasets are used for unsupervised open-domain question answering.

Dataset
JSON

MovieQA, TVQA, AVSD, EQA, Embodied QA

A collection of datasets for visual question answering, including MovieQA, TVQA, AVSD, EQA, and Embodied QA.

Dataset
JSON

Affordance-centric Question-driven Task Completion

A new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user’s view.

Dataset
JSON

Towards Question-based Recommender Systems

Conversational and question-based recommender systems have gained increasing attention in recent years, with users enabled to converse with the system and better control...