arXiv:2007.01755 Abstract | arXiv Analytics

arXiv:2007.01755 [cs.CV]Abstract References Reviews Resources

Multi-Label Image Recognition with Multi-Class Attentional Regions

Published 2020-07-03Version 1

Multi-label image recognition is a practical and challenging task compared to single-label image classification. However, previous works may be suboptimal because of a great number of object proposals or complex attentional region generation modules. In this paper, we propose a simple but efficient two-stream framework to recognize multi-category objects from global image to local regions, similar to how human beings perceive objects. To bridge the gap between global and local streams, we propose a multi-class attentional region module which aims to make the number of attentional regions as small as possible and keep the diversity of these regions as high as possible. Our method can efficiently and effectively recognize multi-class objects with an affordable computation cost and a parameter-free region localization module. Over three benchmarks on multi-label image classification, we create new state-of-the-art results with a single model only using image semantics without label dependency. In addition, the effectiveness of the proposed method is extensively demonstrated under different factors such as global pooling strategy, input size and network architecture.

Categories: cs.CV

Keywords: multi-label image recognition, complex attentional region generation modules, parameter-free region localization module, multi-class attentional region module, image classification

Related articles: Most relevant | Search more

arXiv:1612.04844 [cs.CV] (Published 2016-12-14)

The More You Know: Using Knowledge Graphs for Image Classification

Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta

arXiv:1812.06707 [cs.CV] (Published 2018-12-17)

Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation

Rakshith Shetty, Bernt Schiele, Mario Fritz

arXiv:1412.6598 [cs.CV] (Published 2014-12-20, updated 2015-04-11)

Automatic Discovery and Optimization of Parts for Image Classification

Sobhan Naderi Parizi, Andrea Vedaldi, Andrew Zisserman, Pedro Felzenszwalb