Reinforcement learning and A* search for the unit commitment problem

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Reinforcement learning and A* search for the unit commitment problem

de Mars, Patrick; O’Sullivan, Aidan; (2022) Reinforcement learning and A* search for the unit commitment problem. Energy and AI , 9 , Article 100179. 10.1016/j.egyai.2022.100179. Green open access

Preview

Text
de Mars_Reinforcement learning and A search for the unit commitment problem_VoR.pdf - Published Version
Download (1MB) | Preview

Abstract

Previous research has combined model-free reinforcement learning with model-based tree search methods to solve the unit commitment problem with stochastic demand and renewables generation. This approach was limited to shallow search depths and suffered from significant variability in run time across problem instances with varying complexity. To mitigate these issues, we extend this methodology to more advanced search algorithms based on A* search. First, we develop a problem-specific heuristic based on priority list unit commitment methods and apply this in Guided A* search, reducing run time by up to 94% with negligible impact on operating costs. In addition, we address the run time variability issue by employing a novel anytime algorithm, Guided IDA*, replacing the fixed search depth parameter with a time budget constraint. We show that Guided IDA* mitigates the run time variability of previous guided tree search algorithms and enables further operating cost reductions of up to 1%.

Type:	Article
Title:	Reinforcement learning and A* search for the unit commitment problem
Open access status:	An open access version is available from UCL Discovery
DOI:	10.1016/j.egyai.2022.100179
Publisher version:	https://doi.org/10.1016/j.egyai.2022.100179
Language:	English
Additional information:	This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third-party material in this article are included in the Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
Keywords:	Unit commitment, Reinforcement learning, Tree search, Power systems
UCL classification:	UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment > Bartlett School Env, Energy and Resources UCL > Provost and Vice Provost Offices > UCL BEAMS UCL
URI:	https://discovery-pp.ucl.ac.uk/id/eprint/10152355

Downloads since deposit

3,300Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item