-
XOR-ATTRIQA
The XOR-ATTRIQA dataset is a classification task where model is asked to predict whether the provided answer to the question is supported by the given passage context, which... -
Stanford Human Preferences (SHP)
The Stanford Human Preferences (SHP) dataset is sourced from Reddit with various subreddits that focus on QA. Preferences have been extracted from the accumulated up- and... -
Simple Question dataset
The dataset used in this paper is a set of categorical probability distributions for a finite set of categories A = {a1,..., ak}. The dataset is used to evaluate the proposed... -
CelebA-spoof: Large-scale face anti-spoofing dataset with rich annotations
A face anti-spoofing dataset with rich annotations, focusing on questions with a single entity and relation. -
Planning by Automatic Prompt Engineering for Large Language Models Agents
The paper proposes a novel method, REPROMPT, for optimizing the step-by-step instructions in the prompt of LLM agents based on the chat history obtained from interactions with... -
SimpQ dataset for Question Answering
The SimpQ dataset contains questions answerable using various knowledge graphs. -
SimpleQuestions dataset for Question Answering
The SimpleQuestions dataset contains questions answerable using various knowledge graphs. -
WebQuestions dataset for Google Suggest
The WebQuestions dataset contains questions answerable using Google Suggest as the knowledge graph. -
MS MARCO Dev (small)
The MS MARCO Dev (small) dataset is a small version of the MS MARCO passage dev set. -
TREC 2020 Deep Learning (Passage Subtask)
The TREC 2020 Deep Learning (Passage Subtask) dataset consists of 54 queries with manual judgments from NIST annotators (211 relevance assessments per query, on average). -
TREC 2019 Deep Learning (Passage Subtask)
The TREC 2019 Deep Learning (Passage Subtask) dataset consists of 43 manually-judged queries using four relevance grades (215 relevance assessments per query, on average). -
SemEval-2013 Task 13
The SemEval-2013 task 13 dataset, containing 20 nouns, 20 verbs, and 10 adjectives in WordNet-sense-tagged contexts. -
bAbI story-based QA dataset
The bAbI story-based QA dataset is composed of 20 different tasks, each of which has 1,000 synthetically-generated story-question pairs. A story can be as short as two sentences... -
Semantic communications: Principles and challenges
This dataset has no description
-
Task-oriented multi-user semantic communications for vqa
This dataset has no description