arXiv Analytics

Sign in

arXiv:2403.10336 [cs.CV]AbstractReferencesReviewsResources

How Powerful Potential of Attention on Image Restoration?

Cong Wang, Jinshan Pan, Yeying Jin, Liyan Wang, Wei Wang, Gang Fu, Wenqi Ren, Xiaochun Cao

Published 2024-03-15Version 1

Transformers have demonstrated their effectiveness in image restoration tasks. Existing Transformer architectures typically comprise two essential components: multi-head self-attention and feed-forward network (FFN). The former captures long-range pixel dependencies, while the latter enables the model to learn complex patterns and relationships in the data. Previous studies have demonstrated that FFNs are key-value memories \cite{geva2020transformer}, which are vital in modern Transformer architectures. In this paper, we conduct an empirical study to explore the potential of attention mechanisms without using FFN and provide novel structures to demonstrate that removing FFN is flexible for image restoration. Specifically, we propose Continuous Scaling Attention (\textbf{CSAttn}), a method that computes attention continuously in three stages without using FFN. To achieve competitive performance, we propose a series of key components within the attention. Our designs provide a closer look at the attention mechanism and reveal that some simple operations can significantly affect the model performance. We apply our \textbf{CSAttn} to several image restoration tasks and show that our model can outperform CNN-based and Transformer-based image restoration approaches.

Related articles: Most relevant | Search more
arXiv:2108.07153 [cs.CV] (Published 2021-08-16)
Escaping the Gradient Vanishing: Periodic Alternatives of Softmax in Attention Mechanism
arXiv:2411.05838 [cs.CV] (Published 2024-11-06)
StegaVision: Enhancing Steganography with Attention Mechanism
arXiv:2403.08499 [cs.CV] (Published 2024-03-13)
Improved YOLOv5 Based on Attention Mechanism and FasterNet for Foreign Object Detection on Railway and Airway tracks