UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks

Matsubara, T; Oates, CJ; Briol, FX; (2021) The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks. Journal of Machine Learning Research , 22 pp. 1-57. Green open access

[thumbnail of 20-1300.pdf]
Preview
Text
20-1300.pdf - Published Version

Download (4MB) | Preview

Abstract

Bayesian neural networks attempt to combine the strong predictive performance of neural networks with formal quantification of uncertainty associated with the predictive output in the Bayesian framework. However, it remains unclear how to endow the parameters of the network with a prior distribution that is meaningful when lifted into the output space of the network. A possible solution is proposed that enables the user to posit an appropriate Gaussian process covariance function for the task at hand. Our approach constructs a prior distribution for the parameters of the network, called a ridgelet prior, that approximates the posited Gaussian process in the output space of the network. In contrast to existing work on the connection between neural networks and Gaussian processes, our analysis is non-asymptotic, with finite sample-size error bounds provided. This establishes the universality property that a Bayesian neural network can approximate any Gaussian process whose covariance function is sufficiently regular. Our experimental assessment is limited to a proof-of-concept, where we demonstrate that the ridgelet prior can out-perform an unstructured prior on regression problems for which a suitable Gaussian process prior can be provided.

Type: Article
Title: The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks
Open access status: An open access version is available from UCL Discovery
Publisher version: https://jmlr.csail.mit.edu/papers/volume22/20-1300...
Language: English
Additional information: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v22/20-1300.html.
Keywords: Bayesian neural networks, Gaussian processes, prior selection, ridgelet transform, statistical learning theory
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10133989
Downloads since deposit
912Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item