arXiv Analytics

Sign in

arXiv:1301.3856 [cs.LG]AbstractReferencesReviewsResources

Being Bayesian about Network Structure

Nir Friedman, Daphne Koller

Published 2013-01-16Version 1

In many domains, we are interested in analyzing the structure of the underlying distribution, e.g., whether one variable is a direct parent of the other. Bayesian model-selection attempts to find the MAP model and use its structure to answer these questions. However, when the amount of available data is modest, there might be many models that have non-negligible posterior. Thus, we want compute the Bayesian posterior of a feature, i.e., the total posterior probability of all models that contain it. In this paper, we propose a new approach for this task. We first show how to efficiently compute a sum over the exponential number of networks that are consistent with a fixed ordering over network variables. This allows us to compute, for a given ordering, both the marginal probability of the data and the posterior of a feature. We then use this result as the basis for an algorithm that approximates the Bayesian posterior of a feature. Our approach uses a Markov Chain Monte Carlo (MCMC) method, but over orderings rather than over network structures. The space of orderings is much smaller and more regular than the space of structures, and has a smoother posterior `landscape'. We present empirical results on synthetic and real-life datasets that compare our approach to full model averaging (when possible), to MCMC over network structures, and to a non-Bayesian bootstrap approach.

Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)
Categories: cs.LG, cs.AI, stat.ML
Related articles: Most relevant | Search more
arXiv:1912.02714 [cs.LG] (Published 2019-11-16)
Inferring the Optimal Policy using Markov Chain Monte Carlo
arXiv:1702.08670 [cs.LG] (Published 2017-02-28)
On architectural choices in deep learning: From network structure to gradient convergence and parameter estimation
arXiv:1912.12264 [cs.LG] (Published 2019-12-27)
Predicting Attributes of Nodes Using Network Structure