arXiv:2205.13092 Abstract | arXiv Analytics

arXiv:2205.13092 [cs.CV]Abstract References Reviews Resources

Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels

Tao Pu, Tianshui Chen, Hefeng Wu, Yongyi Lu, Liang Lin

Published 2022-05-26Version 1

Despite achieving impressive progress, current multi-label image recognition (MLR) algorithms heavily depend on large-scale datasets with complete labels, making collecting large-scale datasets extremely time-consuming and labor-intensive. Training the multi-label image recognition models with partial labels (MLR-PL) is an alternative way to address this issue, in which merely some labels are known while others are unknown for each image (see Figure 1). However, current MLP-PL algorithms mainly rely on the pre-trained image classification or similarity models to generate pseudo labels for the unknown labels. Thus, they depend on a certain amount of data annotations and inevitably suffer from obvious performance drops, especially when the known label proportion is low. To address this dilemma, we propose a unified semantic-aware representation blending (SARB) that consists of two crucial modules to blend multi-granularity category-specific semantic representation across different images to transfer information of known labels to complement unknown labels. Extensive experiments on the MS-COCO, Visual Genome, and Pascal VOC 2007 datasets show that the proposed SARB consistently outperforms current state-of-the-art algorithms on all known label proportion settings. Concretely, it obtain the average mAP improvement of 1.9%, 4.5%, 1.0% on the three benchmark datasets compared with the second-best algorithm.

Comments: Technical Report. arXiv admin note: substantial text overlap with arXiv:2203.02172

Categories: cs.CV

Keywords: multi-label image recognition, semantic-aware representation blending, partial labels, multi-granularity category-specific semantic representation, large-scale datasets

Related articles: Most relevant | Search more

arXiv:2203.02172 [cs.CV] (Published 2022-03-04)

Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels

Tao Pu, Tianshui Chen, Hefeng Wu, Liang Lin

arXiv:2007.01755 [cs.CV] (Published 2020-07-03)

Multi-Label Image Recognition with Multi-Class Attentional Regions

Bin-Bin Gao, Hong-Yu Zhou

arXiv:2407.17630 [cs.CV] (Published 2024-07-24)

Revising the Problem of Partial Labels from the Perspective of CNNs' Robustness