arXiv:2107.13757 Abstract | arXiv Analytics

arXiv:2107.13757 [cs.CV]Abstract References Reviews Resources

Bridging Gap between Image Pixels and Semantics via Supervision: A Survey

Published 2021-07-29, updated 2022-04-10Version 3

The fact that there exists a gap between low-level features and semantic meanings of images, called the semantic gap, is known for decades. Resolution of the semantic gap is a long standing problem. The semantic gap problem is reviewed and a survey on recent efforts in bridging the gap is made in this work. Most importantly, we claim that the semantic gap is primarily bridged through supervised learning today. Experiences are drawn from two application domains to illustrate this point: 1) object detection and 2) metric learning for content-based image retrieval (CBIR). To begin with, this paper offers a historical retrospective on supervision, makes a gradual transition to the modern data-driven methodology and introduces commonly used datasets. Then, it summarizes various supervision methods to bridge the semantic gap in the context of object detection and metric learning.

Comments: Jiali Duan and C.-C. Jay Kuo (2022), "Bridging Gap between Image Pixels and Semantics via Supervision: A Survey", APSIPA Transactions on Signal and Information Processing: Vol. 11: No. 1, e2. http://dx.doi.org/10.1561/116.00000038

Categories: cs.CV

Keywords: image pixels, bridging gap, object detection, semantic gap problem, supervision methods

Related articles: Most relevant | Search more

arXiv:1609.02948 [cs.CV] (Published 2016-09-09)

The Role of Context Selection in Object Detection

Ruichi Yu, Xi Chen, Vlad I. Morariu, Larry S. Davis

arXiv:1707.04406 [cs.CV] (Published 2017-07-14)

Inner-Scene Similarities as a Contextual Cue for Object Detection

Noa Arbel, Tamar Avraham, Michael Lindenbaum

arXiv:1805.08009 [cs.CV] (Published 2018-05-21)

Object Detection in Equirectangular Panorama

Wenyan Yang, Yanlin Qian, Francesco Cricri, Lixin Fan, Joni-Kristian Kamarainen