A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control

Deep reinforcement learning for high dimensional, hierarchical control tasks usually requires the use of complex neural networks as functional approximators, which can lead to inefficiency, instability and even divergence in the training process. Here, we introduce stacked deep Q learning (SDQL), a flexible modularized deep reinforcement learning architecture, that can enable finding of optimal control policy of control tasks consisting of multiple linear stages in a stable and efficient way.

Data and Resources

Cite this as

Yuguang Yang, Johns Hopkins University (2024). Dataset: A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control. https://doi.org/10.57702/y6obe121

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.1911.10684
Author Yuguang Yang
More Authors
Johns Hopkins University