arXiv Analytics

Sign in

arXiv:2202.04599 [cs.LG]AbstractReferencesReviewsResources

Missing Data Imputation and Acquisition with Deep Hierarchical Models and Hamiltonian Monte Carlo

Ignacio Peis, Chao Ma, José Miguel Hernández-Lobato

Published 2022-02-09Version 1

Variational Autoencoders (VAEs) have recently been highly successful at imputing and acquiring heterogeneous missing data and identifying outliers. However, within this specific application domain, existing VAE methods are restricted by using only one layer of latent variables and strictly Gaussian posterior approximations. To address these limitations, we present HH-VAEM, a Hierarchical VAE model for mixed-type incomplete data that uses Hamiltonian Monte Carlo with automatic hyper-parameter tuning for improved approximate inference. Our experiments show that HH-VAEM outperforms existing baselines in the tasks of missing data imputation, supervised learning and outlier identification with missing features. Finally, we also present a sampling-based approach for efficiently computing the information gain when missing features are to be acquired with HH-VAEM. Our experiments show that this sampling-based approach is superior to alternatives based on Gaussian approximations.

Related articles: Most relevant | Search more
arXiv:1911.07572 [cs.LG] (Published 2019-11-18)
Bayesian Recurrent Framework for Missing Data Imputation and Prediction with Clinical Time Series
arXiv:2107.00100 [cs.LG] (Published 2021-06-26)
FCMI: Feature Correlation based Missing Data Imputation
arXiv:2411.12847 [cs.LG] (Published 2024-11-19)
mDAE : modified Denoising AutoEncoder for missing data imputation