de Mars, P;
O'Sullivan, A;
(2021)
Applying reinforcement learning and tree search to the unit commitment problem.
Applied Energy
, 302
, Article 117519. 10.1016/j.apenergy.2021.117519.
Preview |
Text
1-s2.0-S0306261921008990-main.pdf - Published Version Download (1MB) | Preview |
Abstract
Recent advances in artificial intelligence have demonstrated the capability of reinforcement learning (RL) methods to outperform the state of the art in decision-making problems under uncertainty. Day-ahead unit commitment (UC), scheduling power generation based on forecasts, is a complex power systems task that is becoming more challenging in light of increasing uncertainty. While RL is a promising framework for solving the UC problem, the space of possible actions from a given state is exponential in the number of generators and it is infeasible to apply existing RL methods in power systems larger than a few generators. Here we present a novel RL algorithm, guided tree search, which does not suffer from an exponential explosion in the action space with increasing number of generators. The method augments a tree search algorithm with a policy that intelligently reduces the branching factor. Using data from the GB power system, we demonstrate that guided tree search outperforms an unguided method in terms of computational complexity, while producing solutions that show no performance loss in terms of operating costs. We compare solutions against mixed-integer linear programming (MILP) and find that guided tree search outperforms a solution using reserve constraints, the current industry approach. The RL solutions exhibit complex behaviours that differ qualitatively from MILP, demonstrating its potential as a decision support tool for human operators.
Type: | Article |
---|---|
Title: | Applying reinforcement learning and tree search to the unit commitment problem |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1016/j.apenergy.2021.117519 |
Publisher version: | https://doi.org/10.1016/j.apenergy.2021.117519 |
Language: | English |
Additional information: | © 2021 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
Keywords: | Unit commitment, Reinforcement learning, Tree search, Deep learning, Power systems |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of the Built Environment > Bartlett School Env, Energy and Resources |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10133018 |
Archive Staff Only
![]() |
View Item |