Proceedings of ESCAPE 35ISSN: 2818-4734
Volume: 4 (2025)
Table of Contents
LAPSE:2025.0337
Published Article
LAPSE:2025.0337
MORL4PC: Multi-Objective Reinforcement Learning for Process Control
Niki Kotecha, Max Bloor, Calvin Tsay, Antonio del Rio Chanona
June 27, 2025
Abstract
In chemical process control, decision-making often involves balancing multiple conflicting objectives, such as maximizing production, minimizing energy consumption, and ensuring process safety. Traditional approaches for multi-objective optimization, such as linear programming and evolutionary algorithms, have proven effective but struggle to adapt in real-time to the dynamic and nonlinear nature of chemical processes. In this paper, we propose a framework that combines Reinforcement Learning (RL) with Multi-Objective Evolutionary Algorithms (MOEAs) to address these challenges. Specifically, we utilize MOEAs, such as NSGA-II, to optimize the parameter space of policy neural networks, resulting in a Pareto front of policies. This Pareto front provides a diverse set of policies that enable operators to dynamically switch control strategies based on real-time system conditions and prioritized objectives. Our proposed methodology is applied to a Controlled Stirred Tank Reactor (CSTR) case study, focusing on balancing the objectives of minimizing set point error and operational costs. The case studies highlight the adaptability of the method in dynamic environments, including scenarios involving fouling, showcasing its resilience and flexibility in complex, nonlinear systems. This work demonstrates the potential of combining RL and MOEAs to advance decision-making in chemical process control, providing a robust and adaptable framework for navigating environments with changing objectives. The methodology can also be extended to other industrial applications with similarly challenging multi-objective requirements.
Keywords
Industry 40, Machine Learning, Process Control, Reinforcement Learning
Suggested Citation
Kotecha N, Bloor M, Tsay C, Chanona ADR. MORL4PC: Multi-Objective Reinforcement Learning for Process Control. Systems and Control Transactions 4:1151-1156 (2025) https://doi.org/10.69997/sct.161830
Author Affiliations
Kotecha N: Department of Chemical Engineering, Sargent Centre for Process Systems Engineering, Imperial College London, SW7 2AZ, United Kingdom
Bloor M: Department of Chemical Engineering, Sargent Centre for Process Systems Engineering, Imperial College London, SW7 2AZ, United Kingdom
Tsay C: Department of Computing, Imperial College London, SW7 2AZ, United Kingdom
Chanona ADR: Department of Chemical Engineering, Sargent Centre for Process Systems Engineering, Imperial College London, SW7 2AZ, United Kingdom
Journal Name
Systems and Control Transactions
Volume
4
First Page
1151
Last Page
1156
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 1151-1156-1533-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0337
This Record
External Link

https://doi.org/10.69997/sct.161830
Article DOI
Download
Files
Jun 27, 2025
Main Article
License
CC BY-SA 4.0
Meta
Record Statistics
Record Views
1111
Version History
[v1] (Original Submission)
Jun 27, 2025
 
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0337
 
Record Owner
PSE Press
Links to Related Works
Directly Related to This Work
Article DOI
References Cited
  1. Bloor, M.; Ahmed, A.; Kotecha, N.; Tsay, C.; Mercangöz, M.; Del Rio-Chanona, A. Hierarchical Reinforcement Learning for Plantwide Control. In Computer Aided Chemical Engineering; Elsevier, 2024; Vol. 53, pp 1639-1644. https://doi.org/10.1016/B978-0-443-28824-1.50274-X
  2. Van Moffaert, K.; Drugan, M. M.; Nowe, A. Scalarized Multi-Objective Reinforcement Learning: Novel Design Techniques. In 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL); IEEE: Singapore, Singapore, 2013; pp 191-199. https://doi.org/10.1109/ADPRL.2013.6615007
  3. Van Moffaert, K.; Nowé, A. Multi-Objective Reinforcement Learning Using Sets of Pareto Dominating Policies. J. Mach. Learn. Res. 2014, 14 (1), 3483-3512
  4. Felten, F.; Talbi, E.-G.; Danoy, G. Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework. J. Artif. Intell. Res. 2024, 79, 679-723. https://doi.org/10.1613/jair.1.15702
  5. Hayes, C. F.; Radulescu, R.; Bargiacchi, E.; Källström, J.; Macfarlane, M.; Reymond, M.; Verstraeten, T.; Zintgraf, L. M.; Dazeley, R.; Heintz, F.; Howley, E.; Irissappane, A. A.; Mannion, P.; Nowé, A.; Ramos, G.; Restelli, M.; Vamplew, P.; Roijers, D. M. A Practical Guide to Multi-Objective Reinforcement Learning and Planning. Auton. Agents Multi-Agent Syst. 2022, 36 (1), 26. https://doi.org/10.1007/s10458-022-09552-y
  6. Kotecha, N.; Chanona, A. del R. Leveraging Graph Neural Networks and Multi-Agent Reinforcement Learning for Inventory Control in Supply Chains. arXiv October 24, 2024. https://doi.org/10.1016/j.compchemeng.2025.109111
  7. Salimans, T.; Ho, J.; Chen, X.; Sidor, S.; Sutskever, I. Evolution Strategies as a Scalable Alternative to Reinforcement Learning. arXiv September 7, 2017.
  8. Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T. A Fast and Elitist Multiobjective Genetic Algorithm: NSGA-II. IEEE Trans. Evol. Comput. 2002, 6 (2), 182-197. https://doi.org/10.1109/4235.996017
  9. Qiu, Y.; Kotecha, N.; Del Rio Chanona, A. Leveraging Reinforcement Learning and Evolutionary Strategies for Dynamic Multi Objective Decision Making in Supply Chain Management. IFAC-Pap. 2024, 58 (14), 598-603. https://doi.org/10.1016/j.ifacol.2024.08.402
  10. Bloor, M.; Torraca, J.; Sandoval, I. O.; Ahmed, A.; White, M.; Mercangöz, M.; Tsay, C.; Chanona, E. A. D. R.; Mowbray, M. PC-Gym: Benchmark Environments For Process Control Problems. arXiv December 5, 2024.
(0.08 seconds)

[0.09 s]