arXiv:1805.12017 Abstract | arXiv Analytics

arXiv:1805.12017 [cs.LG]Abstract References Reviews Resources

Counterstrike: Defending Deep Learning Architectures Against Adversarial Samples by Langevin Dynamics with Supervised Denoising Autoencoder

Vignesh Srinivasan, Arturo Marban, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima

Published 2018-05-30Version 1

Adversarial attacks on deep learning models have been demonstrated to be imperceptible to a human, while decreasing the model performance considerably. Attempts to provide invariance against such attacks have denoised adversarial samples to only send cleaned samples to the classifier. In a similar spirit this paper proposes a novel effective strategy that allows to relax adversarial samples onto the underlying manifold of the (unknown) target class distribution. Specifically, given an off-manifold adversarial example, our Metroplis-adjusted Langevin algorithm (Mala) guided through a supervised denoising autoencoder network (sDAE) allows to drive the adversarial samples towards high density regions of the data generating distribution. So, in a nutshell the adversarial example is transformed back from off-manifold onto the data manifold for which the learning model was originally trained and where it can perform well and robustly. Experiments on various benchmark datasets show that our novel Malade method exhibits a high robustness against blackbox and whitebox attacks and outperforms state-of-the-art defense algorithms.

Categories: cs.LG, stat.ML

Keywords: adversarial samples, defending deep learning architectures, supervised denoising autoencoder, langevin dynamics, counterstrike

Related articles: Most relevant | Search more

arXiv:1806.03316 [cs.LG] (Published 2018-06-08)

Adversarial Meta-Learning

Chengxiang Yin, Jian Tang, Zhiyuan Xu, Yanzhi Wang

arXiv:1706.06529 [cs.LG] (Published 2017-06-20)

A Divergence Bound for Hybrids of MCMC and Variational Inference and an Application to Langevin Dynamics and SGVI

Justin Domke

arXiv:1901.08121 [cs.LG] (Published 2019-01-23)

Sitatapatra: Blocking the Transfer of Adversarial Samples

Ilia Shumailov, Xitong Gao, Yiren Zhao, Robert Mullins, Ross Anderson, Cheng-Zhong Xu

arXiv Analytics

arXiv:1805.12017 [cs.LG]Abstract References Reviews Resources

Counterstrike: Defending Deep Learning Architectures Against Adversarial Samples by Langevin Dynamics with Supervised Denoising Autoencoder

Links

Toolbox

arXiv:1805.12017 [cs.LG]AbstractReferencesReviewsResources

Counterstrike: Defending Deep Learning Architectures Against Adversarial Samples by Langevin Dynamics with Supervised Denoising Autoencoder

Links

Toolbox

arXiv:1805.12017 [cs.LG]Abstract References Reviews Resources