arXiv Analytics

Sign in

arXiv:2001.09700 [cs.LG]AbstractReferencesReviewsResources

DP-CGAN: Differentially Private Synthetic Data and Label Generation

Reihaneh Torkzadehmahani, Peter Kairouz, Benedict Paten

Published 2020-01-27Version 1

Generative Adversarial Networks (GANs) are one of the well-known models to generate synthetic data including images, especially for research communities that cannot use original sensitive datasets because they are not publicly accessible. One of the main challenges in this area is to preserve the privacy of individuals who participate in the training of the GAN models. To address this challenge, we introduce a Differentially Private Conditional GAN (DP-CGAN) training framework based on a new clipping and perturbation strategy, which improves the performance of the model while preserving privacy of the training dataset. DP-CGAN generates both synthetic data and corresponding labels and leverages the recently introduced Renyi differential privacy accountant to track the spent privacy budget. The experimental results show that DP-CGAN can generate visually and empirically promising results on the MNIST dataset with a single-digit epsilon parameter in differential privacy.

Related articles: Most relevant | Search more
arXiv:2403.13612 [cs.LG] (Published 2024-03-20)
Does Differentially Private Synthetic Data Lead to Synthetic Discoveries?
arXiv:2011.05537 [cs.LG] (Published 2020-11-11)
Differentially Private Synthetic Data: Applied Evaluations and Enhancements
arXiv:2206.00686 [cs.LG] (Published 2022-06-01)
Federated Learning in Non-IID Settings Aided by Differentially Private Synthetic Data