PSE Community.org

ISSN: 2818-4734
Volume: 5 (2026)
Table of Contents

LAPSE:2026.0510

Published Article

LAPSE:2026.0510

Enhancing Control in Chemical Processes using Reinforcement from Human Feedback

Gerold H, Brandner D, Lucia S

June 12, 2026

Abstract
Reinforcement learning (RL) presents a promising alternative to model-based advanced control schemes, such as model predictive control (MPC), whose application can be limited by highly complex system models. However, incorporating constraints in RL remains challenging and formulating a suitable optimization objective is not straightforward. Reinforcement learning from human feedback (RLHF) offers an approach to derive the RL reward function from human expert preferences, enabling the incorporation of process knowledge. In this work, we present the application of RLHF to fine-tune an approximate MPC controller with suboptimal performance. We demonstrate that combining conventional reward formulations with RLHF, along with varying trajectory segment lengths for collecting human feedback, improves the control methodology for a batch bioreactor by enhancing safety and accounting for long-term effects. Furthermore, direct-preference based policy optimization (DPPO) represents a promising alternative for directly fine-tuning learning-based controllers while circumventing explicit reward model design.

Record ID

LAPSE:2026.0510

Keywords

human feedback, Model predictive control, reinforcement learning

Subject

Modelling and Simulations

Suggested Citation

H G, D B, S L. Enhancing Control in Chemical Processes using Reinforcement from Human Feedback. Systems and Control Transactions 5:2457-2465 (2026) https://doi.org/10.69997/sct.156501

Author Affiliations

H G: Technische Universität Dortmund, Laboratory of Process Automation Systems, Emil-Figge-Straße 70, Dortmund 44227, Germany [ORCID]
D B: Technische Universität Dortmund, Laboratory of Process Automation Systems, Emil-Figge-Straße 70, Dortmund 44227, Germany [ORCID]
S L: Technische Universität Dortmund, Laboratory of Process Automation Systems, Emil-Figge-Straße 70, Dortmund 44227, Germany [ORCID]
[Login] to see author email addresses.

Journal Name

Systems and Control Transactions

Volume

First Page

2457

Last Page

2465

Year

2026

Publication Date

2026-06-12

DOI: