-
AutoCast++: Enhancing World Event Prediction with Zero-Shot Ranking-Based Con...
The Autocast++ dataset is a benchmark for event forecasting using news articles. -
Simplifying graph convolutional networks
Simplifying graph convolutional networks. -
StackLLaMA: An RL fine-tuned LLaMA model for Stack Exchange question and answ...
The dataset used in the paper is the StackExchange dataset. -
Symbolic, Language Agnostic and Ontologically Grounded Large Language Models
The dataset used in the paper to demonstrate the limitations of large language models (LLMs) in capturing inferential aspects of natural language. -
A general language assistant as a laboratory for alignment
A general language assistant for aligning language models with human users -
SimpleQuestion
The SimpleQuestion dataset is a dataset for question answering, consisting of 100,000 questions and 1,000,000 answers. -
PANDA (Pedantic ANswer-correctness Determination and Adjudication)
Question answering (QA) can only make progress if we know if an answer is correct, but for many of the most challenging and interesting QA examples, current answer correctness... -
ScholarChemQA
ScholarChemQA is a large-scale QA dataset constructed from chemical papers. Specifically, the questions are from paper titles with a question mark, and the multi-choice answers... -
Question Answering Datasets
The dataset used in the paper is a collection of adversarial examples and natural examples for question answering tasks. -
REVERIE dataset
The REVERIE dataset is a dataset of household tasks in an indoor environment. It contains images annotated with natural language instructions including the referring expressions...