UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Network-aware compute and memory allocation in optically composable data centers with deep reinforcement learning and graph neural networks

Shabka, Zacharaya; Zervas, Georgios; (2023) Network-aware compute and memory allocation in optically composable data centers with deep reinforcement learning and graph neural networks. Journal of Optical Communications and Networking , 15 (2) p. 133. 10.1364/jocn.478944. Green open access

[thumbnail of jocn_allocation_rev.pdf]
Preview
Text
jocn_allocation_rev.pdf - Accepted Version

Download (1MB) | Preview

Abstract

Composable data center architectures promise a means of pooling resources remotely within data centers, allowing for both more flexibility and resource efficiency underlying the increasingly important infrastructure-as-a-service business. This can be accomplished by means of using an optically circuit switched backbone in the data center network (DCN), providing the required bandwidth and latency guarantees to ensure reliable performance when applications are run across non-local resource pools. However, resource allocation in this scenario requires both server-level and network-level resources to be co-allocated to requests. The online nature and underlying combinatorial complexity of this problem, alongside the typical scale of DCN topologies, make exact solutions impossible and heuristic-based solutions sub-optimal or non-intuitive to design. We demonstrate that deep reinforcement learning, where the policy is modeled by a graph neural network, can be used to learn effective network-aware and topologically scalable allocation policies end-to-end. Compared to state-of-the-art heuristics for network-aware resource allocation, the method achieves up to a 20% higher acceptance ratio, can achieve the same acceptance ratio as the best performing heuristic with <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline"> <mml:mn>3</mml:mn> <mml:mo>×<!-- × --></mml:mo> </mml:math> less networking resources available, and can maintain all-around performance when directly applied (with no further training) to DCN topologies with <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline"> <mml:mrow class="MJX-TeXAtom-ORD"> <mml:msup> <mml:mn>10</mml:mn> <mml:mn>2</mml:mn> </mml:msup> </mml:mrow> <mml:mo>×<!-- × --></mml:mo> </mml:math> more servers than the topologies seen during training.

Type: Article
Title: Network-aware compute and memory allocation in optically composable data centers with deep reinforcement learning and graph neural networks
Open access status: An open access version is available from UCL Discovery
DOI: 10.1364/jocn.478944
Publisher version: https://doi.org/10.1364/jocn.478944
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10164118
Downloads since deposit
90Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item