arXiv Analytics

Sign in

arXiv:2210.06807 [cs.LG]AbstractReferencesReviewsResources

Improving Out-of-Distribution Generalization by Adversarial Training with Structured Priors

Qixun Wang, Yifei Wang, Hong Zhu, Yisen Wang

Published 2022-10-13Version 1

Deep models often fail to generalize well in test domains when the data distribution differs from that in the training domain. Among numerous approaches to address this Out-of-Distribution (OOD) generalization problem, there has been a growing surge of interest in exploiting Adversarial Training (AT) to improve OOD performance. Recent works have revealed that the robust model obtained by conducting sample-wise AT also retains transferability to biased test domains. In this paper, we empirically show that sample-wise AT has limited improvement on OOD performance. Specifically, we find that AT can only maintain performance at smaller scales of perturbation while Universal AT (UAT) is more robust to larger-scale perturbations. This provides us with clues that adversarial perturbations with universal (low dimensional) structures can enhance the robustness against large data distribution shifts that are common in OOD scenarios. Inspired by this, we propose two AT variants with low-rank structures to train OOD-robust models. Extensive experiments on DomainBed benchmark show that our proposed approaches outperform Empirical Risk Minimization (ERM) and sample-wise AT. Our code is available at https://github.com/NOVAglow646/NIPS22-MAT-and-LDAT-for-OOD.

Related articles: Most relevant | Search more
arXiv:1611.03383 [cs.LG] (Published 2016-11-10)
Disentangling factors of variation in deep representations using adversarial training
arXiv:2205.15130 [cs.LG] (Published 2022-05-30)
Why Adversarial Training of ReLU Networks Is Difficult?
arXiv:2205.01663 [cs.LG] (Published 2022-05-03)
Adversarial Training for High-Stakes Reliability