The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks

Matsubara, T; Oates, CJ; Briol, FX; (2021) The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks. Journal of Machine Learning Research , 22 pp. 1-57. Green open access

Preview

Text
20-1300.pdf - Published Version
Download (4MB) | Preview

Abstract

Bayesian neural networks attempt to combine the strong predictive performance of neural networks with formal quantification of uncertainty associated with the predictive output in the Bayesian framework. However, it remains unclear how to endow the parameters of the network with a prior distribution that is meaningful when lifted into the output space of the network. A possible solution is proposed that enables the user to posit an appropriate Gaussian process covariance function for the task at hand. Our approach constructs a prior distribution for the parameters of the network, called a ridgelet prior, that approximates the posited Gaussian process in the output space of the network. In contrast to existing work on the connection between neural networks and Gaussian processes, our analysis is non-asymptotic, with finite sample-size error bounds provided. This establishes the universality property that a Bayesian neural network can approximate any Gaussian process whose covariance function is sufficiently regular. Our experimental assessment is limited to a proof-of-concept, where we demonstrate that the ridgelet prior can out-perform an unstructured prior on regression problems for which a suitable Gaussian process prior can be provided.

Type:	Article
Title:	The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks
Open access status:	An open access version is available from UCL Discovery
Publisher version:	https://jmlr.csail.mit.edu/papers/volume22/20-1300...
Language:	English
Additional information:	CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v22/20-1300.html.
Keywords:	Bayesian neural networks, Gaussian processes, prior selection, ridgelet transform, statistical learning theory
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI:	https://discovery-pp.ucl.ac.uk/id/eprint/10133989

Downloads since deposit

700Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item