Robust Full Bayesian Learning for Radial Basis Networks

de Freitas, Nando

doi:doi:10.1162/089976601750541831

Robust Full Bayesian Learning for Radial Basis Networks

Christophe Andrieu‚ Nando De Freitas and Arnaud Doucet

Abstract

We propose a hierarchical full Bayesian model for radial basis networks. This model treats the model dimension (number of neurons), model parameters, regularization parameters, and noise parameters as unknown random variables. We develop a reversible-jump Markov chain Monte Carlo (MCMC) method to perform the Bayesian computation. We find that the results obtained using this method are not only better than the ones reported previously, but also appear to be robust with respect to the prior specification. In addition, we propose a novel and computationally efficient reversible-jump MCMC simulated annealing algorithm to optimize neural networks. This algorithm enables us to maximize the joint posterior distribution of the network parameters and the number of basis function. It performs a global search in the joint space of the parameters and number of parameters, thereby surmounting the problem of local minima to a large extent. We show that by calibrating the full hierarchical Bayesian prior, we can obtain the classical Akaike information criterion, Bayesian information criterion, and minimum description length model selection criteria within a penalized likelihood framework. Finally, we present a geometric convergence theorem for the algorithm with homogeneous transition kernel and a convergence theorem for the reversible-jump MCMC simulated annealing method.

Address

Cambridge‚ MA‚ USA

ISSN

0899−7667

Journal

Neural Computation

Number

Pages

2359–2407

Publisher

MIT Press

Volume

Year

2001

Robust Full Bayesian Learning for Radial Basis Networks

Abstract

Links

See Also