LAPSE:2023.13642
Published Article

LAPSE:2023.13642
Unlocking the Flexibility of District Heating Pipeline Energy Storage with Reinforcement Learning
March 1, 2023
Abstract
The integration of pipeline energy storage in the control of a district heating system can lead to profit gain, for example by adjusting the electricity production of a combined heat and power (CHP) unit to the fluctuating electricity price. The uncertainty from the environment, the computational complexity of an accurate model, and the scarcity of placed sensors in a district heating system make the operational use of pipeline energy storage challenging. A vast majority of previous works determined a control strategy by a decomposition of a mixed-integer nonlinear model and significant simplifications. To mitigate consequential stability, feasibility, and computational complexity challenges, we model CHP economic dispatch as a Markov decision process. We use a reinforcement learning (RL) algorithm to estimate the system’s dynamics through interactions with the simulation environment. The RL approach is compared with a detailed nonlinear mathematical optimizer on day-ahead and real-time electricity markets and two district heating grid models. The proposed method achieves moderate profit impacted by environment stochasticity. The advantages of the RL approach are reflected in three aspects: stability, feasibility, and time scale flexibility. From this, it can be concluded that RL is a promising alternative for real-time control of complex, nonlinear industrial systems.
The integration of pipeline energy storage in the control of a district heating system can lead to profit gain, for example by adjusting the electricity production of a combined heat and power (CHP) unit to the fluctuating electricity price. The uncertainty from the environment, the computational complexity of an accurate model, and the scarcity of placed sensors in a district heating system make the operational use of pipeline energy storage challenging. A vast majority of previous works determined a control strategy by a decomposition of a mixed-integer nonlinear model and significant simplifications. To mitigate consequential stability, feasibility, and computational complexity challenges, we model CHP economic dispatch as a Markov decision process. We use a reinforcement learning (RL) algorithm to estimate the system’s dynamics through interactions with the simulation environment. The RL approach is compared with a detailed nonlinear mathematical optimizer on day-ahead and real-time electricity markets and two district heating grid models. The proposed method achieves moderate profit impacted by environment stochasticity. The advantages of the RL approach are reflected in three aspects: stability, feasibility, and time scale flexibility. From this, it can be concluded that RL is a promising alternative for real-time control of complex, nonlinear industrial systems.
Record ID
Keywords
4th generation district heating, combined heat and power economic dispatch, Markov decision process, mixed-integer nonlinear program, pipeline energy storage, Q-learning
Subject
Suggested Citation
Stepanovic K, Wu J, Everhardt R, de Weerdt M. Unlocking the Flexibility of District Heating Pipeline Energy Storage with Reinforcement Learning. (2023). LAPSE:2023.13642
Author Affiliations
Stepanovic K: Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands [ORCID]
Wu J: Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands; Flex Technologies, Atoomweg 7, 3542 AA Utrecht, The Netherlands
Everhardt R: Flex Technologies, Atoomweg 7, 3542 AA Utrecht, The Netherlands
de Weerdt M: Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands [ORCID]
Wu J: Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands; Flex Technologies, Atoomweg 7, 3542 AA Utrecht, The Netherlands
Everhardt R: Flex Technologies, Atoomweg 7, 3542 AA Utrecht, The Netherlands
de Weerdt M: Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands [ORCID]
Journal Name
Energies
Volume
15
Issue
9
First Page
3290
Year
2022
Publication Date
2022-04-30
ISSN
1996-1073
Version Comments
Original Submission
Other Meta
PII: en15093290, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2023.13642
This Record
External Link

https://doi.org/10.3390/en15093290
Publisher Version
Download
Meta
Record Statistics
Record Views
186
Version History
[v1] (Original Submission)
Mar 1, 2023
Verified by curator on
Mar 1, 2023
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2023.13642
Record Owner
Auto Uploader for LAPSE
Links to Related Works
