arXiv:2204.02481 Abstract | arXiv Analytics

arXiv:2204.02481 [cs.CV]Abstract References Reviews Resources

Adversarial Robustness through the Lens of Convolutional Filters

Published 2022-04-05Version 1

Deep learning models are intrinsically sensitive to distribution shifts in the input data. In particular, small, barely perceivable perturbations to the input data can force models to make wrong predictions with high confidence. An common defense mechanism is regularization through adversarial training which injects worst-case perturbations back into training to strengthen the decision boundaries, and to reduce overfitting. In this context, we perform an investigation of 3x3 convolution filters that form in adversarially-trained models. Filters are extracted from 71 public models of the linf-RobustBench CIFAR-10/100 and ImageNet1k leaderboard and compared to filters extracted from models built on the same architectures but trained without robust regularization. We observe that adversarially-robust models appear to form more diverse, less sparse, and more orthogonal convolution filters than their normal counterparts. The largest differences between robust and normal models are found in the deepest layers, and the very first convolution layer, which consistently and predominantly forms filters that can partially eliminate perturbations, irrespective of the architecture. Data & Project website: https://github.com/paulgavrikov/cvpr22w_RobustnessThroughTheLens

Comments: Accepted at the CVPR 2022 "The Art of Robustness" Workshop

Categories: cs.CV, cs.AI, cs.LG

Keywords: convolutional filters, adversarial robustness, input data, common defense mechanism, orthogonal convolution filters

Related articles: Most relevant | Search more

arXiv:2108.06885 [cs.CV] (Published 2021-08-16)

Neural Architecture Dilation for Adversarial Robustness

Yanxi Li, Zhaohui Yang, Yunhe Wang, Chang Xu

arXiv:2212.11005 [cs.CV] (Published 2022-12-21)

Revisiting Residual Networks for Adversarial Robustness: An Architectural Perspective

Shihua Huang, Zhichao Lu, Kalyanmoy Deb, Vishnu Naresh Boddeti

arXiv:1811.07275 [cs.CV] (Published 2018-11-18, updated 2018-11-26)

RePr: Improved Training of Convolutional Filters