Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines

The dataset used in the paper is a hierarchical structure of propositions, where a higher-level proposition is a temporal abstraction of lower-level propositions. Each proposition represents a subtask, which is assigned to a group of agents that coordinate to make the proposition become true.

Data and Resources

Cite this as

Xuejing Zheng, Chao Yu (2024). Dataset: Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines. https://doi.org/10.57702/ho6yzwwp

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2403.07005
Author Xuejing Zheng
More Authors
Chao Yu