LAPSE:2019.1131
Published Article

LAPSE:2019.1131
Multivariable System Identification Method Based on Continuous Action Reinforcement Learning Automata
November 5, 2019
Abstract
In this work, a closed-loop identification method based on a reinforcement learning algorithm is proposed for multiple-input multiple-output (MIMO) systems. This method could be an attractive alternative solution to the problem that the current frequency-domain identification algorithms are usually dependent on the attenuation factor. With this method, after continuously interacting with the environment, the optimal attenuation factor can be identified by the continuous action reinforcement learning automata (CARLA), and then the corresponding parameters could be estimated in the end. Moreover, the proposed method could be applied to time-varying systems online due to its online learning ability. The simulation results suggest that the presented approach can meet the requirement of identification accuracy in both square and non-square systems.
In this work, a closed-loop identification method based on a reinforcement learning algorithm is proposed for multiple-input multiple-output (MIMO) systems. This method could be an attractive alternative solution to the problem that the current frequency-domain identification algorithms are usually dependent on the attenuation factor. With this method, after continuously interacting with the environment, the optimal attenuation factor can be identified by the continuous action reinforcement learning automata (CARLA), and then the corresponding parameters could be estimated in the end. Moreover, the proposed method could be applied to time-varying systems online due to its online learning ability. The simulation results suggest that the presented approach can meet the requirement of identification accuracy in both square and non-square systems.
Record ID
Keywords
CARLA, closed-loop identification, MIMO, reinforcement learning
Subject
Suggested Citation
Jiang M, Jin Q. Multivariable System Identification Method Based on Continuous Action Reinforcement Learning Automata. (2019). LAPSE:2019.1131
Author Affiliations
Jiang M: School of Information Science and Technology, Beijing University of Chemical Technology, No. 15, Beisanhuan East Road, Beijing 100029, China
Jin Q: School of Information Science and Technology, Beijing University of Chemical Technology, No. 15, Beisanhuan East Road, Beijing 100029, China
[Login] to see author email addresses.
Jin Q: School of Information Science and Technology, Beijing University of Chemical Technology, No. 15, Beisanhuan East Road, Beijing 100029, China
[Login] to see author email addresses.
Journal Name
Processes
Volume
7
Issue
8
Article Number
E546
Year
2019
Publication Date
2019-08-17
ISSN
2227-9717
Version Comments
Original Submission
Other Meta
PII: pr7080546, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2019.1131
This Record
External Link

https://doi.org/10.3390/pr7080546
Publisher Version
Download
Meta
Record Statistics
Record Views
1106
Version History
[v1] (Original Submission)
Nov 5, 2019
Verified by curator on
Nov 5, 2019
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2019.1131
Record Owner
Calvin Tsay
Links to Related Works
[0.07 s]
