arXiv Analytics

Sign in

arXiv:2205.13092 [cs.CV]AbstractReferencesReviewsResources

Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels

Tao Pu, Tianshui Chen, Hefeng Wu, Yongyi Lu, Liang Lin

Published 2022-05-26Version 1

Despite achieving impressive progress, current multi-label image recognition (MLR) algorithms heavily depend on large-scale datasets with complete labels, making collecting large-scale datasets extremely time-consuming and labor-intensive. Training the multi-label image recognition models with partial labels (MLR-PL) is an alternative way to address this issue, in which merely some labels are known while others are unknown for each image (see Figure 1). However, current MLP-PL algorithms mainly rely on the pre-trained image classification or similarity models to generate pseudo labels for the unknown labels. Thus, they depend on a certain amount of data annotations and inevitably suffer from obvious performance drops, especially when the known label proportion is low. To address this dilemma, we propose a unified semantic-aware representation blending (SARB) that consists of two crucial modules to blend multi-granularity category-specific semantic representation across different images to transfer information of known labels to complement unknown labels. Extensive experiments on the MS-COCO, Visual Genome, and Pascal VOC 2007 datasets show that the proposed SARB consistently outperforms current state-of-the-art algorithms on all known label proportion settings. Concretely, it obtain the average mAP improvement of 1.9%, 4.5%, 1.0% on the three benchmark datasets compared with the second-best algorithm.

Comments: Technical Report. arXiv admin note: substantial text overlap with arXiv:2203.02172
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:2203.02172 [cs.CV] (Published 2022-03-04)
Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels
arXiv:2007.01755 [cs.CV] (Published 2020-07-03)
Multi-Label Image Recognition with Multi-Class Attentional Regions
arXiv:2407.17630 [cs.CV] (Published 2024-07-24)
Revising the Problem of Partial Labels from the Perspective of CNNs' Robustness