2 datasets found

Tags: uncertainty estimation

Filter Results
  • UNK-VQA

    The UNK-VQA dataset is a dataset for evaluating the ability of large language models to answer questions when the answer is unknown.
  • Shifts Dataset

    The Shifts Dataset: a large, standardized dataset for evaluation of uncertainty estimates and robustness to realistic, curated distributional shift.