Dataset - LDM

Measuring Massive Multitask Language Understanding

The dataset used in this paper is a multiple choice question set that allows for the evaluation of large language models.
- Dataset
- JSON
CommonsenseQA

The dataset used in the paper is also mentioned as CommonsenseQA, which is a 5-way multiple choice QA dataset that requires commonsense knowledge.
- Dataset
- JSON
Sciq

The Sciq dataset is a multi-domain multiple-choice question dataset consisting of 13,000 questions in the fields of physics, chemistry, biology, and other natural sciences.
- Dataset
- JSON
MCQ

The MCQ dataset is a cross-domain cloze-style dataset, that includes the domains of science, vocabulary, common sense, and trivia.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found