arXiv:2001.09700 Abstract | arXiv Analytics

arXiv:2001.09700 [cs.LG]Abstract References Reviews Resources

DP-CGAN: Differentially Private Synthetic Data and Label Generation

Reihaneh Torkzadehmahani, Peter Kairouz, Benedict Paten

Published 2020-01-27Version 1

Generative Adversarial Networks (GANs) are one of the well-known models to generate synthetic data including images, especially for research communities that cannot use original sensitive datasets because they are not publicly accessible. One of the main challenges in this area is to preserve the privacy of individuals who participate in the training of the GAN models. To address this challenge, we introduce a Differentially Private Conditional GAN (DP-CGAN) training framework based on a new clipping and perturbation strategy, which improves the performance of the model while preserving privacy of the training dataset. DP-CGAN generates both synthetic data and corresponding labels and leverages the recently introduced Renyi differential privacy accountant to track the spent privacy budget. The experimental results show that DP-CGAN can generate visually and empirically promising results on the MNIST dataset with a single-digit epsilon parameter in differential privacy.

Comments: 7 pages, 4 figures

Categories: cs.LG, stat.ML

Keywords: differentially private synthetic data, label generation, renyi differential privacy accountant, single-digit epsilon parameter, generate synthetic data

Related articles: Most relevant | Search more

arXiv:2403.13612 [cs.LG] (Published 2024-03-20)

Does Differentially Private Synthetic Data Lead to Synthetic Discoveries?

Ileana Montoya Perez, Parisa Movahedi, Valtteri Nieminen, Antti Airola, Tapio Pahikkala

arXiv:2011.05537 [cs.LG] (Published 2020-11-11)

Differentially Private Synthetic Data: Applied Evaluations and Enhancements

Lucas Rosenblatt, Xiaoyan Liu, Samira Pouyanfar, Eduardo de Leon, Anuj Desai, Joshua Allen

arXiv:2206.00686 [cs.LG] (Published 2022-06-01)

Federated Learning in Non-IID Settings Aided by Differentially Private Synthetic Data