LAPSE:2025.0345
Published Article

LAPSE:2025.0345
Optimal Energy Scheduling for Battery and Hydrogen Storage Systems Using Reinforcement Learning
June 27, 2025
Abstract
Optimal energy scheduling for sector-coupled multi-energy systems is becoming increasingly important as renewable energies such as wind and photovoltaics continue to expand. They are very volatile and difficult to predict. This creates a deviation between generation and demand that can be compensated for by energy storage technologies. For these, rule-based control is well established in industry, and mixed-integer model predictive control (MPC) is an area of research that promises the best results, usually regarding minimal costs. Drawbacks of MPC include the need for an adequate system model, often associated with high modeling effort, high computational effort for larger prediction horizons, and complications with stochastic variables. In this work, Reinforcement Learning is used in an attempt to overcome these difficulties without applying elaborate mixed-integer linear programming. The self-learning algorithm, which requires no explicit knowledge of the system behavior, can learn a control policy and uncertainties of the variables just by interaction with the (simulated) system model. It is demonstrated that Reinforcement Learning (exchange factor = 36.8 %) can learn complex system behavior with comparable quality to model predictive control (ex. = 32.4 %) and outperforms rule-based control (ex. = 41.8 %). This is done in a case study with the goal of minimizing the exchange of energy with the grid, with a battery and hydrogen system providing storage flexibility. These results were achieved in the context that the Reinforcement Learning agent only has instantaneous rather than predictive information, i.e., a very limited state of information compared to the MPC. The trained policy is then deployed while significantly decreasing the computational effort.
Optimal energy scheduling for sector-coupled multi-energy systems is becoming increasingly important as renewable energies such as wind and photovoltaics continue to expand. They are very volatile and difficult to predict. This creates a deviation between generation and demand that can be compensated for by energy storage technologies. For these, rule-based control is well established in industry, and mixed-integer model predictive control (MPC) is an area of research that promises the best results, usually regarding minimal costs. Drawbacks of MPC include the need for an adequate system model, often associated with high modeling effort, high computational effort for larger prediction horizons, and complications with stochastic variables. In this work, Reinforcement Learning is used in an attempt to overcome these difficulties without applying elaborate mixed-integer linear programming. The self-learning algorithm, which requires no explicit knowledge of the system behavior, can learn a control policy and uncertainties of the variables just by interaction with the (simulated) system model. It is demonstrated that Reinforcement Learning (exchange factor = 36.8 %) can learn complex system behavior with comparable quality to model predictive control (ex. = 32.4 %) and outperforms rule-based control (ex. = 41.8 %). This is done in a case study with the goal of minimizing the exchange of energy with the grid, with a battery and hydrogen system providing storage flexibility. These results were achieved in the context that the Reinforcement Learning agent only has instantaneous rather than predictive information, i.e., a very limited state of information compared to the MPC. The trained policy is then deployed while significantly decreasing the computational effort.
Record ID
Keywords
Model-Predictive-Control MPC, Optimal Energy Scheduling, Reinforcement Learning RL
Subject
Suggested Citation
Zebenholzer M, Kasper L, Schirrer A, Hofmann R. Optimal Energy Scheduling for Battery and Hydrogen Storage Systems Using Reinforcement Learning. Systems and Control Transactions 4:1201-1207 (2025) https://doi.org/10.69997/sct.134052
Author Affiliations
Zebenholzer M: TU Wien, Institute of Energy Systems and Thermodynamics, Vienna, Austria
Kasper L: TU Wien, Institute of Energy Systems and Thermodynamics, Vienna, Austria
Schirrer A: TU Wien, Institute of Mechanics and Mechatronics, Vienna, Austria
Hofmann R: TU Wien, Institute of Energy Systems and Thermodynamics, Vienna, Austria
Kasper L: TU Wien, Institute of Energy Systems and Thermodynamics, Vienna, Austria
Schirrer A: TU Wien, Institute of Mechanics and Mechatronics, Vienna, Austria
Hofmann R: TU Wien, Institute of Energy Systems and Thermodynamics, Vienna, Austria
Journal Name
Systems and Control Transactions
Volume
4
First Page
1201
Last Page
1207
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 1201-1207-1653-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0345
This Record
External Link

https://doi.org/10.69997/sct.134052
Article DOI
Download
Meta
Record Statistics
Record Views
1372
Version History
[v1] (Original Submission)
Jun 27, 2025
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0345
Record Owner
PSE Press
Links to Related Works
References Cited
- Le, T. S., et al. (2023). Optimal sizing of renewable energy storage: A techno-economic analysis of hydrogen, battery and hybrid systems considering degradation and seasonal storage. Applied Energy, 336, 120817. https://doi.org/10.1016/j.apenergy.2023.120817
- Van, L. P., et al. (2023). Review of hydrogen technologies based microgrid: Energy management systems, challenges and future recommendations. International Journal of Hydrogen Energy, 48(38), 14127-14148. https://doi.org/10.1016/j.ijhydene.2022.12.345
- Holtwerth, A., et al. (2024). Closed-loop model predictive control of a hybrid battery-hydrogen energy storage system using mixed-integer linear programming. Energy Conversion and Management: X, 22, 100561. https://doi.org/10.1016/j.ecmx.2024.100561
- Perera, A. T. D., & Kamalaruban, P. (2021). Applications of reinforcement learning in energy systems. Renewable and Sustainable Energy Reviews, 137, 110618. https://doi.org/10.1016/j.rser.2020.110618
- Schwenzer, M., et al. (2021). Review on model predictive control: An engineering perspective. The International Journal of Advanced Manufacturing Technology, 117(5-6), 1327-1349. https://doi.org/10.1007/s00170-021-07682-3
- Jin, M., & Lavaei, J. (2020). Stability-Certified Reinforcement Learning: A Control-Theoretic Perspective. IEEE Access, 8, 229086-229100. https://doi.org/10.1109/ACCESS.2020.3045114
- ENTSO-E Transparency Platform. (n.d.). Retrieved January 14, 2025, from https://transparency.entsoe.eu
- Schulman, J., et al. (2017). Proximal Policy Optimization Algorithms (arXiv:1707.06347). arXiv.
- The MathWorks Inc. (2024). MATLAB version: 23.2 (R2023b), Natick, Massachusetts: The MathWorks Inc. https://www.mathworks.com
- Löfberg, Johan. "A toolbox for modeling and optimization in MATLAB." Proceedings of the CACSD Conference (2004), https://www.mathworks.com
- The MathWorks Inc. (2024). Reinforcement Learning Toolbox version: 23.2 (R2023b), Natick, Massachusetts: The MathWorks Inc. https://www.mathworks.com
- Gurobi Optimization, LLC. "Gurobi Optimizer Reference Manual." (2024). https://www.gurobi.com
[0.26 s]

