-
Self-Imagine
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used three mathematics tasks (GSM8K, ASDIV, and SVAMP) and nine general-purpose... -
A survey of reasoning with foundation models
The paper discusses the challenges of using large language models for reasoning tasks. -
TravelPlanner
The TravelPlanner dataset is a benchmark for real-world planning with language agents. -
Planning by Automatic Prompt Engineering for Large Language Models Agents
The paper proposes a novel method, REPROMPT, for optimizing the step-by-step instructions in the prompt of LLM agents based on the chat history obtained from interactions with... -
Buffer of Thoughts
Buffer of Thoughts is a novel and versatile thought-augmented reasoning approach for enhancing accuracy, efficiency and robustness of large language models (LLMs). -
NAIVE: A Method for Representing Uncertainty and Temporal Relationships in an...
NAIVE is a low-level knowledge representation language and inferencing process for reasoning about nondeterministic dynamic systems like those found in medicine. -
Kinship dataset
The Kinship dataset is used for relation discovery and reasoning tasks. Given a set of examples about relations, the model infers the direct relation between two people and... -
DNA promoter dataset
The DNA promoter dataset consist of a background theory with 14 logical if-then rules. The rules includes four symbols contact, minus10, minus35, conf ormation which are not...