PSE Community.org

ISSN: 2818-4734
Volume: 5 (2026)
Table of Contents

LAPSE:2026.0465

Published Article

LAPSE:2026.0465

A Graph Reinforcement Learning Framework for Batch Process Scheduling in State-Task Networks

Syu-Ning Johnn, Victor-Alexandru Darvariu, Vassilis M. Charitopoulos

June 12, 2026

Abstract
Batch production scheduling of resources to meet fluctuating product demand is a critical topic in the process industry. Existing optimisation approaches, based on heuristic and exact methods, trade off solution optimality and scalability to large problems. In this work, we investigate deep reinforcement learning as a powerful alternative in order to learn heuristics for batch scheduling. We formulate the batch scheduling problem as a Markov decision process operating on a state-task network representation encoded using graph neural networks, capturing relevant structural inductive biases. We propose a centralised training with decentralised execution architecture, in which agents placed on machines individually choose which tasks to complete using a global view of the network, cooperating towards task schedules that optimise the final production quantity. Preliminary results demonstrate that the proposed end-to-end framework learns to construct task schedules comparable to the optimal solution on small instances unseen during training, exhibiting strong potential for extension to more general graph structures and better scalability.

Record ID

LAPSE:2026.0465

Keywords

Batch Process Scheduling, Deep-Q Networks, Graph Neural Networks, Markov Decision Process, Reinforcement Learning

Subject

Modelling and Simulations

Suggested Citation

Johnn S, Darvariu V, Charitopoulos VM. A Graph Reinforcement Learning Framework for Batch Process Scheduling in State-Task Networks. Systems and Control Transactions 5:2099-2106 (2026) https://doi.org/10.69997/sct.190792

Author Affiliations

Johnn S: Department of Chemical Engineering & The Sargent Centre for Process Systems Engineering, University College London, London, United Kingdom [ORCID]
Darvariu V: Oxford Robotics Institute, Department of Engineering Science, University of Oxford, Oxford, United Kingdom [ORCID]
Charitopoulos VM: Department of Chemical Engineering & The Sargent Centre for Process Systems Engineering, University College London, London, United Kingdom [ORCID]
[Login] to see author email addresses.

Journal Name

Systems and Control Transactions

Volume

First Page

2099

Last Page

2106

Year

2026

Publication Date

2026-06-12

DOI: