The dataset used in the paper is a collection of experiences sampled from a replay buffer, used to train and evaluate the proposed Multi-step DDPG (MDDPG) and Mixed Multi-step DDPG (MMDDPG) algorithms.
BibTex:
Before browse our site, please accept our cookies policy