Unlocking the Flexibility of District Heating Pipeline Energy Storage with Reinforcement Learning

Ksenija Stepanovic; Jichen Wu; Rob Everhardt; Mathijs de Weerdt

LAPSE:2023.13642

Published Article

LAPSE:2023.13642

Unlocking the Flexibility of District Heating Pipeline Energy Storage with Reinforcement Learning

Ksenija Stepanovic, Jichen Wu, Rob Everhardt, Mathijs de Weerdt

March 1, 2023

Abstract
The integration of pipeline energy storage in the control of a district heating system can lead to profit gain, for example by adjusting the electricity production of a combined heat and power (CHP) unit to the fluctuating electricity price. The uncertainty from the environment, the computational complexity of an accurate model, and the scarcity of placed sensors in a district heating system make the operational use of pipeline energy storage challenging. A vast majority of previous works determined a control strategy by a decomposition of a mixed-integer nonlinear model and significant simplifications. To mitigate consequential stability, feasibility, and computational complexity challenges, we model CHP economic dispatch as a Markov decision process. We use a reinforcement learning (RL) algorithm to estimate the system’s dynamics through interactions with the simulation environment. The RL approach is compared with a detailed nonlinear mathematical optimizer on day-ahead and real-time electricity markets and two district heating grid models. The proposed method achieves moderate profit impacted by environment stochasticity. The advantages of the RL approach are reflected in three aspects: stability, feasibility, and time scale flexibility. From this, it can be concluded that RL is a promising alternative for real-time control of complex, nonlinear industrial systems.

Record ID

LAPSE:2023.13642

Keywords

4th generation district heating, combined heat and power economic dispatch, Markov decision process, mixed-integer nonlinear program, pipeline energy storage, Q-learning

Subject

Optimization

Suggested Citation

Stepanovic K, Wu J, Everhardt R, de Weerdt M. Unlocking the Flexibility of District Heating Pipeline Energy Storage with Reinforcement Learning. (2023). LAPSE:2023.13642

Author Affiliations

Stepanovic K: Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands [ORCID]
Wu J: Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands; Flex Technologies, Atoomweg 7, 3542 AA Utrecht, The Netherlands
Everhardt R: Flex Technologies, Atoomweg 7, 3542 AA Utrecht, The Netherlands
de Weerdt M: Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, Van Mourik Broekmanweg 6, 2628 XE Delft, The Netherlands [ORCID]

Journal Name

Energies

Volume

15

Issue

9

First Page

3290

Year

2022

Publication Date

2022-04-30