UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Reinforcement learning and A* search for the unit commitment problem

de Mars, Patrick; O’Sullivan, Aidan; (2022) Reinforcement learning and A* search for the unit commitment problem. Energy and AI , 9 , Article 100179. 10.1016/j.egyai.2022.100179. Green open access

[thumbnail of de Mars_Reinforcement learning and A search for the unit commitment problem_VoR.pdf]
Preview
Text
de Mars_Reinforcement learning and A search for the unit commitment problem_VoR.pdf - Published Version

Download (1MB) | Preview

Abstract

Previous research has combined model-free reinforcement learning with model-based tree search methods to solve the unit commitment problem with stochastic demand and renewables generation. This approach was limited to shallow search depths and suffered from significant variability in run time across problem instances with varying complexity. To mitigate these issues, we extend this methodology to more advanced search algorithms based on A* search. First, we develop a problem-specific heuristic based on priority list unit commitment methods and apply this in Guided A* search, reducing run time by up to 94% with negligible impact on operating costs. In addition, we address the run time variability issue by employing a novel anytime algorithm, Guided IDA*, replacing the fixed search depth parameter with a time budget constraint. We show that Guided IDA* mitigates the run time variability of previous guided tree search algorithms and enables further operating cost reductions of up to 1%.

Type: Article
Title: Reinforcement learning and A* search for the unit commitment problem
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.egyai.2022.100179
Publisher version: https://doi.org/10.1016/j.egyai.2022.100179
Language: English
Additional information: This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third-party material in this article are included in the Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
Keywords: Unit commitment, Reinforcement learning, Tree search, Power systems
UCL classification: UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment > Bartlett School Env, Energy and Resources
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10152355
Downloads since deposit
316Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item