An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems

doi:doi:10.25835/zmlriehg

An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems

We provide supplementary code material to our recent paper "An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems".

The provided matlab codes contain a comparison between our proposed Qlearning algorithm and an existing SDP-based method. To reproduce the results of Table I, you can run the file 'comparison_QL_SDP.m'. Note that CVX is required to run the SDP-based method, see (http://cvxr.com/cvx/) for details.

BibTex:

@dataset{Alsalti_Mohammad_and_Lopez_Victor_G_and_Müller_Matthias_A_2023,
    abstract = {We provide supplementary code material to our recent paper "An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems".

The provided matlab codes contain a comparison between our proposed Qlearning algorithm and an existing SDP-based method. To reproduce the results of Table I, you can run the file 'comparison_QL_SDP.m'. Note that CVX is required to run the SDP-based method, see (http://cvxr.com/cvx/) for details.},
    author = {Alsalti, Mohammad and Lopez, Victor G. and Müller, Matthias A.},
    doi = {10.25835/zmlriehg},
    institution = {Institut für Regelungstechnik},
    keyword = {'Data-based control', 'Q-learning', 'optimal output regulation', 'reinforcement learning'},
    month = {dec},
    publisher = {LUIS},
    title = {An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems},
    url = {https://service.tib.eu/ldmservice/vdataset/luh-qlearning_opfb},
    year = {2023}
}