Production-Scalable Control Optimisation for Optical Switching With Deep Reinforcement Learning

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Production-Scalable Control Optimisation for Optical Switching With Deep Reinforcement Learning

Shabka, Zacharaya; Enrico, Michael; Almeida, Paulo; Parsons, Nick; Zervas, Georgios; (2024) Production-Scalable Control Optimisation for Optical Switching With Deep Reinforcement Learning. Journal of Lightwave Technology , 42 (6) pp. 2018-2025. 10.1109/JLT.2023.3328330. Green open access

Preview

Text
Zervas_Production-Scalable_Control_Optimisation_for_Optical_Switching_With_Deep_Reinforcement_Learning.pdf
Download (910kB) | Preview

Abstract

Proportional-integral-derivative(PID) control underlies >95% of automation across many industries including high-radix optical circuit switches based on PID-controlled piezoelectric-actuator-based beam steering. To meet performance metric requirements (switching speed and actuator stability for optical switches) PID control requires three parameters to be optimally tuned (aka PID tuning). Typical PID tuning methods involve slow, exhaustive and often hands-on search processes which waste engineering resources and slow down production. Moreover, manufacturing tolerances in production mean that actuators are non-identical and so controlled differently by the same PID parameters. This work presents a novel PID parameter optimisation method (patent pending) based on deep reinforcement learning which avoids tuning procedures altogether whilst improving switching performance. On a market leading optical switching product based on electromechanical control processes, compared against the manufacturer's production parameter set, average switching speed is improved 22% whilst 5× more (17.5% to 87.5%) switching events stabilise in ≤20ms (the ideal worst-case performance) without any practical deterioration in other performance metrics such as overshoot. The method also generates actuator-tailored PID parameters in O(milliseconds) without any interaction with the device using only generic information about the actuator (known from manufacturing and characterisation processes). This renders the method highly applicable to mass-manufacturing scenarios generally. Training is achieved with just a small number of actuators and can generally complete in O(hours) , so can be easily repeated if needed (e.g. if new hardware is built using entirely different types of actuators).

Type:	Article
Title:	Production-Scalable Control Optimisation for Optical Switching With Deep Reinforcement Learning
Open access status:	An open access version is available from UCL Discovery
DOI:	10.1109/JLT.2023.3328330
Publisher version:	https://doi.org/10.1109/JLT.2023.3328330
Language:	English
Additional information:	This version is the author-accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords:	Optical switches, Control systems, Actuators, Process control, Optimization, Tuning, Production
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng
URI:	https://discovery-pp.ucl.ac.uk/id/eprint/10180661

Downloads since deposit

400Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item