Dataset - LDM

Measuring Massive Multitask Language Understanding

The dataset used in this paper is a multiple choice question set that allows for the evaluation of large language models.
- Dataset
- JSON
Content Moderation Dataset (CMD)

A dataset of social media content containing potentially biased (unsafe) texts, along with unbiased (safe or benign) variations.
- Dataset
- JSON
BERT

The dataset used in this paper is a pre-trained BERT model trained on English Wikipedia and Books datasets.
- Dataset
- JSON
EgoSchema

EgoSchema is a diagnostic benchmark for assessing very long-form video-language understanding capabilities of modern multimodal systems.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found