2 datasets found

Tags: human-annotated

Filter Results
  • StrategyQA

    The StrategyQA dataset is used to evaluate the ability of LLMs in generating accurate answers to multi-step reasoning questions.
  • Rel3D

    Rel3D is a large-scale dataset of human-annotated spatial relations in 3D. It consists of spatial relations situated in synthetic 3D scenes, making it possible to extract rich...
You can also access this registry using the API (see API Docs).