3 datasets found

Tags: commonsense reasoning

Filter Results
  • COMET

    COMET is a model for commonsense reasoning that can generate coherent and contextually relevant text.
  • CSQA

    The CSQA dataset is a widely used benchmark dataset for conversational KBQA, consisting of around 200K dialogues where training set, validation set and testing set contain 153K,...
  • StrategyQA

    The StrategyQA dataset is used to evaluate the ability of LLMs in generating accurate answers to multi-step reasoning questions.