arXiv:2106.06245 Abstract | arXiv Analytics

arXiv:2106.06245 [stat.ML]Abstract References Reviews Resources

Model Selection for Bayesian Autoencoders

Ba-Hien Tran, Simone Rossi, Dimitrios Milios, Pietro Michiardi, Edwin V. Bonilla, Maurizio Filippone

Published 2021-06-11Version 1

We develop a novel method for carrying out model selection for Bayesian autoencoders (BAEs) by means of prior hyper-parameter optimization. Inspired by the common practice of type-II maximum likelihood optimization and its equivalence to Kullback-Leibler divergence minimization, we propose to optimize the distributional sliced-Wasserstein distance (DSWD) between the output of the autoencoder and the empirical data distribution. The advantages of this formulation are that we can estimate the DSWD based on samples and handle high-dimensional problems. We carry out posterior estimation of the BAE parameters via stochastic gradient Hamiltonian Monte Carlo and turn our BAE into a generative model by fitting a flexible Dirichlet mixture model in the latent space. Consequently, we obtain a powerful alternative to variational autoencoders, which are the preferred choice in modern applications of autoencoders for representation learning with uncertainty. We evaluate our approach qualitatively and quantitatively using a vast experimental campaign on a number of unsupervised learning tasks and show that, in small-data regimes where priors matter, our approach provides state-of-the-art results, outperforming multiple competitive baselines.

Categories: stat.ML, cs.LG

Keywords: model selection, bayesian autoencoders, stochastic gradient hamiltonian monte carlo, type-ii maximum likelihood optimization, flexible dirichlet mixture model

Related articles: Most relevant | Search more

arXiv:1804.07344 [stat.ML] (Published 2018-04-19)

Effects of sampling skewness of the importance-weighted risk estimator on model selection

Wouter M. Kouw, Marco Loog

arXiv:2310.16320 [stat.ML] (Published 2023-10-25)

Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo

Ziyi Wang, Yujie Chen, Qifan Song, Ruqi Zhang

arXiv:1812.01181 [stat.ML] (Published 2018-12-04)

Parallel-tempered Stochastic Gradient Hamiltonian Monte Carlo for Approximate Multimodal Posterior Sampling

Rui Luo, Qiang Zhang, Yaodong Yang, Yuanyuan Liu