A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control

doi:doi:10.57702/y6obe121

A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control

Deep reinforcement learning for high dimensional, hierarchical control tasks usually requires the use of complex neural networks as functional approximators, which can lead to inefﬁciency, instability and even divergence in the training process. Here, we introduce stacked deep Q learning (SDQL), a ﬂexible modularized deep reinforcement learning architecture, that can enable ﬁnding of optimal control policy of control tasks consisting of multiple linear stages in a stable and efﬁcient way.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Yuguang Yang, Johns Hopkins University (2024). Dataset: A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control. https://doi.org/10.57702/y6obe121

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.1911.10684
Author	Yuguang Yang
More Authors	Johns Hopkins University