Norcliffe, Alexander Alexander Luke Ian;
Deisenroth, Marc Peter;
(2023)
Faster Training of Neural ODEs Using Gauß-Legendre Quadrature.
Transactions on Machine Learning Research
, 2023
(8)
Preview |
PDF
979_Faster_Training_of_Neural_.pdf - Published Version Download (1MB) | Preview |
Abstract
Neural ODEs demonstrate strong performance in generative and time-series modelling. However, training them via the adjoint method is slow compared to discrete models due to the requirement of numerically solving ODEs. To speed neural ODEs up, a common approach is to regularise the solutions. However, this approach may affect the expressivity of the model; when the trajectory itself matters, this is particularly important. In this paper, we propose an alternative way to speed up the training of neural ODEs. The key idea is to speed up the adjoint method by using Gauß-Legendre quadrature to solve integrals faster than ODE-based methods while remaining memory efficient. We also extend the idea to training SDEs using the Wong-Zakai theorem, by training a corresponding ODE and transferring the parameters. Our approach leads to faster training of neural ODEs, especially for large models. It also presents a new way to train SDE-based models.
Type: | Article |
---|---|
Title: | Faster Training of Neural ODEs Using Gauß-Legendre Quadrature |
Open access status: | An open access version is available from UCL Discovery |
Publisher version: | https://openreview.net/forum?id=f0FSDAy1bU |
Language: | English |
Additional information: | © The Author(s), 2023. This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. https://creativecommons.org/licenses/by/4.0/ |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10195961 |
Archive Staff Only
![]() |
View Item |