arXiv Analytics

Sign in

arXiv:1906.06765 [cs.CV]AbstractReferencesReviewsResources

Defending Against Adversarial Attacks Using Random Forests

Yifan Ding, Liqiang Wang, Huan Zhang, Jinfeng Yi, Deliang Fan, Boqing Gong

Published 2019-06-16Version 1

As deep neural networks (DNNs) have become increasingly important and popular, the robustness of DNNs is the key to the safety of both the Internet and the physical world. Unfortunately, some recent studies show that adversarial examples, which are hard to be distinguished from real examples, can easily fool DNNs and manipulate their predictions. Upon observing that adversarial examples are mostly generated by gradient-based methods, in this paper, we first propose to use a simple yet very effective non-differentiable hybrid model that combines DNNs and random forests, rather than hide gradients from attackers, to defend against the attacks. Our experiments show that our model can successfully and completely defend the white-box attacks, has a lower transferability, and is quite resistant to three representative types of black-box attacks; while at the same time, our model achieves similar classification accuracy as the original DNNs. Finally, we investigate and suggest a criterion to define where to grow random forests in DNNs.

Related articles: Most relevant | Search more
arXiv:1802.06806 [cs.CV] (Published 2018-02-19)
Divide, Denoise, and Defend against Adversarial Attacks
arXiv:1812.06570 [cs.CV] (Published 2018-12-17)
Defense-VAE: A Fast and Accurate Defense against Adversarial Attacks
arXiv:2007.09916 [cs.CV] (Published 2020-07-20)
Evaluating a Simple Retraining Strategy as a Defense Against Adversarial Attacks