arXiv:1806.02920 Abstract | arXiv Analytics

arXiv:1806.02920 [cs.LG]Abstract References Reviews Resources

GAIN: Missing Data Imputation using Generative Adversarial Nets

Jinsung Yoon, James Jordon, Mihaela van der Schaar

Published 2018-06-07Version 1

We propose a novel method for imputing missing data by adapting the well-known Generative Adversarial Nets (GAN) framework. Accordingly, we call our method Generative Adversarial Imputation Nets (GAIN). The generator (G) observes some components of a real data vector, imputes the missing components conditioned on what is actually observed, and outputs a completed vector. The discriminator (D) then takes a completed vector and attempts to determine which components were actually observed and which were imputed. To ensure that D forces G to learn the desired distribution, we provide D with some additional information in the form of a hint vector. The hint reveals to D partial information about the missingness of the original sample, which is used by D to focus its attention on the imputation quality of particular components. This hint ensures that G does in fact learn to generate according to the true data distribution. We tested our method on various datasets and found that GAIN significantly outperforms state-of-the-art imputation methods.

Comments: 10 pages, 3 figures, 2018 International Conference of Machine Learning

Categories: cs.LG, stat.ML

Keywords: generative adversarial nets, missing data imputation, outperforms state-of-the-art imputation methods, significantly outperforms state-of-the-art imputation, method generative adversarial imputation nets

Tags: conference paper

Related articles: Most relevant | Search more

arXiv:1711.08267 [cs.LG] (Published 2017-11-22)

GraphGAN: Graph Representation Learning with Generative Adversarial Nets

Hongwei Wang et al.

arXiv:1809.02064 [cs.LG] (Published 2018-09-06)

Sample-Efficient Imitation Learning via Generative Adversarial Nets

Lionel Blondé, Alexandros Kalousis

arXiv:1911.07572 [cs.LG] (Published 2019-11-18)

Bayesian Recurrent Framework for Missing Data Imputation and Prediction with Clinical Time Series

Yang Guo, Zhengyuan Liu, Pavitra Krishnswamy, Savitha Ramasamy

arXiv Analytics

arXiv:1806.02920 [cs.LG]Abstract References Reviews Resources

GAIN: Missing Data Imputation using Generative Adversarial Nets

Links

Toolbox

arXiv:1806.02920 [cs.LG]AbstractReferencesReviewsResources

GAIN: Missing Data Imputation using Generative Adversarial Nets

Links

Toolbox

arXiv:1806.02920 [cs.LG]Abstract References Reviews Resources