196 datasets found

Tags: question answering

Filter Results
  • WebKB

    The dataset used in this paper is a probabilistic logic programming dataset, which is a probabilistic version of the WebKB dataset.
  • FKTC

    FKTC is a test set for evaluating the factual knowledge of large language models. It contains 210,158 prompts in total.
  • SemEval-2021 task 4

    The dataset used in the paper for question answering task
  • ZJUKLAB at SemEval-2021 task 4

    The dataset used in the paper for negative augmentation with language model for reading comprehension of abstract meaning
  • Wikidata

    The dataset used in the paper is Wikidata, which contains a large number of entities and their corresponding semantic types.
  • BREAK

    Break dataset contains question-decomposition meaning representation (QDMR) annotations from BREAK.
  • DROP

    DROP dataset contains complex compositional questions against natural language passages describing football games and historical events.
  • MovieQA, TVQA, AVSD, EQA, Embodied QA

    A collection of datasets for visual question answering, including MovieQA, TVQA, AVSD, EQA, and Embodied QA.
  • Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management

    Q-Pain: a question answering dataset to measure social bias in pain management
  • DuReader

    DuReader dataset is a Chinese machine reading comprehension dataset, focusing on real-world web data
  • MS-MARCO

    MS-MARCO dataset is a large-scale question answering dataset, focusing on real-world web data
  • CICERO

    The CICERO dataset is used for training and evaluation.
  • HiTab

    A hierarchical table dataset for question answering and natural language generation.
  • Pandalm Dataset

    The dataset used to train Pandalm, a generative safety evaluator for Chinese.
  • Auto-J Dataset

    The dataset used to train Auto-J, a generative safety evaluator for English.
  • Jade Dataset

    The dataset used to train Jade, a linguistic-based safety evaluation platform for Chinese.
  • ShieldLM Dataset

    The dataset used to train ShieldLM, a generative safety evaluator for English.
  • SAFETY-J Dataset

    The dataset used to train SAFETY-J, a bilingual generative safety evaluator for English and Chinese.
  • MSRVTT

    The MSRVTT is a large-scale dataset for video captioning. It contains 10k video clips and each video clip is accompanied with 20 human-edited English sentence descriptions,...
  • CoQA

    The CoQA dataset is a benchmark for question answering research. It consists of conversational questions.
You can also access this registry using the API (see API Docs).