LAPSE:2020.0511
Published Article
LAPSE:2020.0511
Improved Q-Learning Method for Linear Discrete-Time Systems
Jian Chen, Jinhua Wang, Jie Huang
May 22, 2020
In this paper, the Q-learning method for quadratic optimal control problem of discrete-time linear systems is reconsidered. The theoretical results prove that the quadratic optimal controller cannot be solved directly due to the linear correlation of the data sets. The following corollaries have been made: (1) The correlation of data is the key factor in the success for the calculation of quadratic optimal control laws by Q-learning method; (2) The control laws for linear systems cannot be derived directly by the existing Q-learning method; (3) For nonlinear systems, there are some doubts about the data independence of current method. Therefore, it is necessary to discuss the probability of the controllers established by the existing Q-learning method. To solve this problem, based on the ridge regression, an improved model-free Q-learning quadratic optimal control method for discrete-time linear systems is proposed in this paper. Therefore, the computation process can be implemented correctly, and the effective controller can be solved. The simulation results show that the proposed method can not only overcome the problem caused by the data correlation, but also derive proper control laws for discrete-time linear systems.
Keywords
least squares regression, model-free control, optimal control, Q-learning, reinforcement learning, ridge regression
Suggested Citation
Chen J, Wang J, Huang J. Improved Q-Learning Method for Linear Discrete-Time Systems. (2020). LAPSE:2020.0511
Author Affiliations
Chen J: College of Electrical Engineering and Automation, Fuzhou University, Fuzhou 350108, China
Wang J: College of Electrical Engineering and Automation, Fuzhou University, Fuzhou 350108, China; Fujian Key Laboratory of New Energy Generation and Power Conversion, Fuzhou 350108, China
Huang J: College of Electrical Engineering and Automation, Fuzhou University, Fuzhou 350108, China
Journal Name
Processes
Volume
8
Issue
3
Article Number
E368
Year
2020
Publication Date
2020-03-22
Published Version
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr8030368, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2020.0511
This Record
External Link

doi:10.3390/pr8030368
Publisher Version
Download
Files
[Download 1v1.pdf] (339 kB)
May 22, 2020
Main Article
License
CC BY 4.0
Meta
Record Statistics
Record Views
516
Version History
[v1] (Original Submission)
May 22, 2020
 
Verified by curator on
May 22, 2020
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2020.0511
 
Original Submitter
Calvin Tsay
Links to Related Works
Directly Related to This Work
Publisher Version