Matsubara, T;
Oates, CJ;
Briol, FX;
(2021)
The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks.
Journal of Machine Learning Research
, 22
pp. 1-57.
Preview |
Text
20-1300.pdf - Published Version Download (4MB) | Preview |
Abstract
Bayesian neural networks attempt to combine the strong predictive performance of neural networks with formal quantification of uncertainty associated with the predictive output in the Bayesian framework. However, it remains unclear how to endow the parameters of the network with a prior distribution that is meaningful when lifted into the output space of the network. A possible solution is proposed that enables the user to posit an appropriate Gaussian process covariance function for the task at hand. Our approach constructs a prior distribution for the parameters of the network, called a ridgelet prior, that approximates the posited Gaussian process in the output space of the network. In contrast to existing work on the connection between neural networks and Gaussian processes, our analysis is non-asymptotic, with finite sample-size error bounds provided. This establishes the universality property that a Bayesian neural network can approximate any Gaussian process whose covariance function is sufficiently regular. Our experimental assessment is limited to a proof-of-concept, where we demonstrate that the ridgelet prior can out-perform an unstructured prior on regression problems for which a suitable Gaussian process prior can be provided.
Type: | Article |
---|---|
Title: | The ridgelet prior: A covariance function approach to prior specification for bayesian neural networks |
Open access status: | An open access version is available from UCL Discovery |
Publisher version: | https://jmlr.csail.mit.edu/papers/volume22/20-1300... |
Language: | English |
Additional information: | CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v22/20-1300.html. |
Keywords: | Bayesian neural networks, Gaussian processes, prior selection, ridgelet transform, statistical learning theory |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10133989 |
Archive Staff Only
View Item |