UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Optimizing Bayesian Recurrent Neural Networks on an FPGA-based Accelerator

Ferianc, Martin; Que, Zhiqiang; Fan, Hongxiang; Luk, Wayne; Rodrigues, Miguel; (2021) Optimizing Bayesian Recurrent Neural Networks on an FPGA-based Accelerator. In: Proceedings of the 2021 International Conference on Field-Programmable Technology (ICFPT). (pp. pp. 1-10). IEEE Green open access

[thumbnail of 2106.06048v3.pdf]
Preview
Text
2106.06048v3.pdf - Accepted Version

Download (1MB) | Preview

Abstract

Neural networks have demonstrated their outstanding performance in a wide range of tasks. Specifically recurrent architectures based on long-short term memory (LSTM) cells have manifested excellent capability to model time dependencies in real-world data. However, standard recurrent architectures cannot estimate their uncertainty which is essential for safety-critical applications such as in medicine. In contrast, Bayesian recurrent neural networks (RNNs) are able to provide uncertainty estimation with improved accuracy. Nonetheless, Bayesian RNNs are computationally and memory demanding, which limits their practicality despite their advantages. To address this issue, we propose an FPGA-based hardware design to accelerate Bayesian LSTM-based RNNs. To further improve the overall algorithmic-hardware performance, a co-design framework is proposed to explore the most fitting algorithmic-hardware configurations for Bayesian RNNs. We conduct extensive experiments on healthcare applications to demonstrate the improvement of our design and the effectiveness of our framework. Compared with GPU implementation, our FPGA-based design can achieve up to 10 times speedup with nearly 106 times higher energy efficiency. To the best of our knowledge, this is the first work targeting acceleration of Bayesian RNNs on FPGAs.

Type: Proceedings paper
Title: Optimizing Bayesian Recurrent Neural Networks on an FPGA-based Accelerator
Event: 2021 International Conference on Field-Programmable Technology (ICFPT)
ISBN-13: 978-1-6654-2010-5
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/ICFPT52863.2021.9609847
Publisher version: https://doi.org/10.1109/ICFPT52863.2021.9609847
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
Keywords: Recurrent neural networks, Bayesian inference, Field-programmable gate array, Hardware acceleration
UCL classification: UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities
UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities > Dept of Information Studies
UCL > Provost and Vice Provost Offices > UCL SLASH
UCL
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10150058
Downloads since deposit
6,391Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item