arXiv:2101.03288 Abstract | arXiv Analytics

arXiv:2101.03288 [cs.LG]Abstract References Reviews Resources

How to Train Your Energy-Based Models

Published 2021-01-09Version 1

Energy-Based Models (EBMs), also known as non-normalized probabilistic models, specify probability density or mass functions up to an unknown normalizing constant. Unlike most other probabilistic models, EBMs do not place a restriction on the tractability of the normalizing constant, thus are more flexible to parameterize and can model a more expressive family of probability distributions. However, the unknown normalizing constant of EBMs makes training particularly difficult. Our goal is to provide a friendly introduction to modern approaches for EBM training. We start by explaining maximum likelihood training with Markov chain Monte Carlo (MCMC), and proceed to elaborate on MCMC-free approaches, including Score Matching (SM) and Noise Constrastive Estimation (NCE). We highlight theoretical connections among these three approaches, and end with a brief survey on alternative training methods, which are still under active research. Our tutorial is targeted at an audience with basic understanding of generative models who want to apply EBMs or start a research project in this direction.

Categories: cs.LG, stat.ML

Keywords: energy-based models, unknown normalizing constant, markov chain monte carlo, mass functions, noise constrastive estimation

Related articles: Most relevant | Search more

arXiv:2504.10612 [cs.LG] (Published 2025-04-14, updated 2025-05-14)

Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling

Michal Balcerak, Tamaz Amiranashvili, Antonio Terpin, Suprosanna Shit, Sebastian Kaltenbach, Petros Koumoutsakos, Bjoern Menze

arXiv:1912.02714 [cs.LG] (Published 2019-11-16)

Inferring the Optimal Policy using Markov Chain Monte Carlo

Brandon Trabucco, Albert Qu, Simon Li, Ganeshkumar Ashokavardhanan

arXiv:2406.13661 [cs.LG] (Published 2024-06-19)

Hitchhiker's guide on Energy-Based Models: a comprehensive review on the relation with other generative models, sampling and statistical physics

Davide Carbone

arXiv Analytics

arXiv:2101.03288 [cs.LG]Abstract References Reviews Resources

How to Train Your Energy-Based Models

Links

Toolbox

arXiv:2101.03288 [cs.LG]AbstractReferencesReviewsResources

How to Train Your Energy-Based Models

Links

Toolbox

arXiv:2101.03288 [cs.LG]Abstract References Reviews Resources