arXiv Analytics

Sign in

arXiv:1910.06539 [stat.ML]AbstractReferencesReviewsResources

Challenges in Bayesian inference via Markov chain Monte Carlo for neural networks

Theodore Papamarkou, Jacob Hinkle, M. Todd Young, David Womble

Published 2019-10-15Version 1

Markov chain Monte Carlo (MCMC) methods and neural networks are instrumental in tackling inferential and prediction problems. However, Bayesian inference based on joint use of MCMC methods and of neural networks is limited. This paper reviews the main challenges posed by neural networks to MCMC developments, including lack of parameter identifiability due to weight symmetries, prior specification effects, and consequently high computational cost and convergence failure. Population and manifold MCMC algorithms are combined to demonstrate these challenges via multilayer perceptron (MLP) examples and to develop case studies for assessing the capacity of approximate inference methods to uncover the posterior covariance of neural network parameters. Some of these challenges, such as high computational cost arising from the application of neural networks to big data and parameter identifiability arising from weight symmetries, stimulate research towards more scalable approximate MCMC methods or towards MCMC methods in reduced parameter spaces.

Related articles: Most relevant | Search more
arXiv:2409.01464 [stat.ML] (Published 2024-09-02)
Stein transport for Bayesian inference
arXiv:2408.03307 [stat.ML] (Published 2024-08-06)
Pre-training and in-context learning IS Bayesian inference a la De Finetti
arXiv:2202.13774 [stat.ML] (Published 2022-02-28)
Selection, Ignorability and Challenges With Causal Fairness