arXiv:2104.09807 Abstract | arXiv Analytics

arXiv:2104.09807 [cs.CV]Abstract References Reviews Resources

Visual Navigation with Spatial Attention

Published 2021-04-20Version 1

This work focuses on object goal visual navigation, aiming at finding the location of an object from a given class, where in each step the agent is provided with an egocentric RGB image of the scene. We propose to learn the agent's policy using a reinforcement learning algorithm. Our key contribution is a novel attention probability model for visual navigation tasks. This attention encodes semantic information about observed objects, as well as spatial information about their place. This combination of the "what" and the "where" allows the agent to navigate toward the sought-after object effectively. The attention model is shown to improve the agent's policy and to achieve state-of-the-art results on commonly-used datasets.

Categories: cs.CV, cs.LG

Keywords: spatial attention, object goal visual navigation, novel attention probability model, agents policy, attention encodes semantic information

Related articles: Most relevant | Search more

arXiv:2401.02656 [cs.CV] (Published 2024-01-05)

GTA: Guided Transfer of Spatial Attention from Object-Centric Representations

SeokHyun Seo, Jinwoo Hong, JungWoo Chae, Kyungyul Kim, Sangheum Hwang

arXiv:2307.07370 [cs.CV] (Published 2023-07-14)

AIC-AB NET: A Neural Network for Image Captioning with Spatial Attention and Text Attributes

Guoyun Tu, Ying Liu, Vladimir Vlassov

arXiv:2407.01782 [cs.CV] (Published 2024-07-01)

Addressing a fundamental limitation in deep vision models: lack of spatial attention

Ali Borji

arXiv Analytics

arXiv:2104.09807 [cs.CV]Abstract References Reviews Resources

Visual Navigation with Spatial Attention

Links

Toolbox

arXiv:2104.09807 [cs.CV]AbstractReferencesReviewsResources

Visual Navigation with Spatial Attention

Links

Toolbox

arXiv:2104.09807 [cs.CV]Abstract References Reviews Resources