arXiv:2207.14227 Abstract | arXiv Analytics

arXiv:2207.14227 [cs.CV]Abstract References Reviews Resources

Visual Recognition by Request

Chufeng Tang, Lingxi Xie, Xiaopeng Zhang, Xiaolin Hu, Qi Tian

Published 2022-07-28Version 1

In this paper, we present a novel protocol of annotation and evaluation for visual recognition. Different from traditional settings, the protocol does not require the labeler/algorithm to annotate/recognize all targets (objects, parts, etc.) at once, but instead raises a number of recognition instructions and the algorithm recognizes targets by request. This mechanism brings two beneficial properties to reduce the burden of annotation, namely, (i) variable granularity: different scenarios can have different levels of annotation, in particular, object parts can be labeled only in large and clear instances, (ii) being open-domain: new concepts can be added to the database in minimal costs. To deal with the proposed setting, we maintain a knowledge base and design a query-based visual recognition framework that constructs queries on-the-fly based on the requests. We evaluate the recognition system on two mixed-annotated datasets, CPP and ADE20K, and demonstrate its promising ability of learning from partially labeled data as well as adapting to new concepts with only text labels.

Categories: cs.CV

Keywords: annotation, algorithm recognizes targets, query-based visual recognition framework, recognition instructions, novel protocol

Related articles: Most relevant | Search more

arXiv:2308.10174 [cs.CV] (Published 2023-08-20)

Neural Interactive Keypoint Detection

Jie Yang, Ailing Zeng, Feng Li, Shilong Liu, Ruimao Zhang, Lei Zhang

arXiv:2506.19331 [cs.CV] (Published 2025-06-24)

Segment Any 3D-Part in a Scene from a Sentence

Hongyu Wu, Pengwan Yang, Yuki M. Asano, Cees G. M. Snoek

arXiv:1607.04564 [cs.CV] (Published 2016-07-15)

DAVE: A Unified Framework for Fast Vehicle Detection and Annotation

Yi Zhou, Li Liu, Ling Shao, Matt Mellor