arXiv:1301.7393 Abstract | arXiv Analytics

arXiv:1301.7393 [cs.LG]Abstract References Reviews Resources

Mixture Representations for Inference and Learning in Boltzmann Machines

Neil D. Lawrence, Christopher M. Bishop, Michael I. Jordan

Published 2013-01-30Version 1

Boltzmann machines are undirected graphical models with two-state stochastic variables, in which the logarithms of the clique potentials are quadratic functions of the node states. They have been widely studied in the neural computing literature, although their practical applicability has been limited by the difficulty of finding an effective learning algorithm. One well-established approach, known as mean field theory, represents the stochastic distribution using a factorized approximation. However, the corresponding learning algorithm often fails to find a good solution. We conjecture that this is due to the implicit uni-modality of the mean field approximation which is therefore unable to capture multi-modality in the true distribution. In this paper we use variational methods to approximate the stochastic distribution using multi-modal mixtures of factorized distributions. We present results for both inference and learning to demonstrate the effectiveness of this approach.

Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

Categories: cs.LG, stat.ML

Keywords: boltzmann machines, mixture representations, stochastic distribution, learning algorithm, two-state stochastic variables

Tags: conference paper

Related articles: Most relevant | Search more

arXiv:1312.3970 [cs.LG] (Published 2013-12-13)

An Extensive Evaluation of Filtering Misclassified Instances in Supervised Classification Tasks

Michael R. Smith, Tony Martinez

arXiv:2405.06582 [cs.LG] (Published 2024-05-10)

The Role of Learning Algorithms in Collective Action

Omri Ben-Dov, Jake Fawkes, Samira Samadi, Amartya Sanyal

arXiv:2007.12815 [cs.LG] (Published 2020-07-25)