-
WebQuestions dataset for Google Suggest
The WebQuestions dataset contains questions answerable using Google Suggest as the knowledge graph. -
Easy-to-Hard: Leveraging Simple Questions for Complex Question Generation
This paper makes one of the first efforts toward automatically generating complex questions from knowledge graphs. -
QASent and WikiQA
The dataset used in the paper is QASent and WikiQA for answer sentence selection and paraphrase identification tasks. -
Twitter Conversations
The dataset used for training the seq2seq model -
Ya-Hoo Answers
The dataset used for training the seq2seq model -
WikiAnswers2
The dataset used for training the seq2seq model -
INFOLOSSQA
INFOLOSSQA is a dataset for characterizing and recovering simplification-induced information loss in form of question-and-answer (QA) pairs. -
Massive Multitask Language Understanding (MMLU) dataset
The MMLU dataset is a benchmark for measuring the behavior of large language models on a number of tasks. It consists of 15908 multiple choice questions distributed across 57... -
QUERY2BOX: REASONING OVER KNOWLEDGE GRAPHS IN VECTOR SPACE USING BOX EMBEDDINGS
Answering complex logical queries on large-scale incomplete knowledge graphs (KGs) is a fundamental yet challenging task. Recently, a promising approach to this problem has been... -
MSMARCO and Natural Questions
The dataset for dense passage retrieval, used for training and testing the proposed PAIR approach.