Robustness - Groups

Multimodal Robustness Benchmark

The MMR benchmark is designed to evaluate MLLMs' comprehension of visual content and robustness against misleading questions, ensuring models truly leverage multimodal inputs...

Dataset
JSON

CIFAR-10-C and CIFAR-100-C

CIFAR-10-C and CIFAR-100-C are robustness benchmarks consisting of 19 corruptions types with five levels of severities.

Dataset
JSON

CMNIST

Dataset bias is a significant problem in training fair classifiers. When attributes unrelated to classification exhibit strong biases towards certain classes, classifiers...

Dataset
JSON

LAV Dataset

The LAV dataset is used to evaluate the robustness of the proposed Penalty-based Imitation Learning with Cross Semantics Generation approach.

Dataset
JSON

Imbalanced Gradients

The Imbalanced Gradients dataset is a benchmark for evaluating the robustness of deep neural networks.

Dataset
JSON

5 datasets found

Multimodal Robustness Benchmark

CIFAR-10-C and CIFAR-100-C

CMNIST

LAV Dataset

Imbalanced Gradients