Dataset - LDM

UAV20L: A Benchmark for Visual Tracking

The UAV20L dataset is a benchmark for visual tracking.
- Dataset
- JSON
OTB-2013: A Benchmark for Visual Tracking

The OTB-2013 dataset is a benchmark for visual tracking.
- Dataset
- JSON
ShapeNet Annotated with Referring Expressions (SNARE)

A benchmark dataset for grounding natural language referring expressions to distinguish 3D objects.
- Dataset
- JSON
MVBench

A comprehensive multi-modal video understanding benchmark.
- Dataset
- JSON
nvBench

The nvBench dataset is a benchmark for text-to-vis models, containing natural language questions and their corresponding data visualizations.
- Dataset
- JSON
3DLoMatch

The 3DLoMatch [12] is a registration dataset with low overlap pairs between 10% − 30%.
- Dataset
- JSON
ACE and CoNLL04

The ACE and CoNLL04 datasets are widely used entity-relation extraction benchmarks.
- Dataset
- JSON
HumanEval

The dataset used in the paper is the HumanEval dataset, which is used to evaluate the performance of language models.
- Dataset
- JSON
SciMT-Safety

The SciMT-Safety dataset is a benchmark for evaluating the safety of AI systems in science. It consists of hundreds of refined red-teaming queries that span the fields of...
- Dataset
- JSON
SciGuard

The SciGuard dataset is a benchmark for evaluating the safety of AI systems in science. It consists of hundreds of refined red-teaming queries that span the fields of chemistry...
- Dataset
- JSON
NAS-Bench-360

A benchmark for neural architecture search.
- Dataset
- JSON
OpenML Benchmark

A benchmark for automated machine learning.
- Dataset
- JSON
PennML Benchmark Suite

The PennML benchmark suite consists of over 90 regression problems and provides a performance overview of several common regression algorithms.
- Dataset
- JSON
CEC2013 Benchmark Functions

The dataset used in this paper is the CEC2013 benchmark functions.
- Dataset
- JSON
Guard: A safe reinforcement learning benchmark

The dataset used in the paper is a collection of robot locomotion tasks with various constraints.
- Dataset
- JSON
Building a conversational agent overnight with dialogue self-play

The Building a conversational agent overnight with dialogue self-play dataset is a benchmark for conversational AI.
- Dataset
- JSON
FAIR-Play

A video benchmark for binaural audio generation from video.
- Dataset
- JSON
AI-Feynman database

The AI-Feynman database is a widely used public benchmark for symbolic regression.
- Dataset
- JSON
Keijzer benchmark

The Keijzer benchmark is a widely used public benchmark for symbolic regression.
- Dataset
- JSON
Nguyen benchmark

The Nguyen benchmark is a widely used public benchmark for symbolic regression.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

82 datasets found