arXiv Analytics

Sign in

arXiv:2108.07422 [cs.CV]AbstractReferencesReviewsResources

Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences

Hyunjong Park, Sanghoon Lee, Junghyup Lee, Bumsub Ham

Published 2021-08-17Version 1

We address the problem of visible-infrared person re-identification (VI-reID), that is, retrieving a set of person images, captured by visible or infrared cameras, in a cross-modal setting. Two main challenges in VI-reID are intra-class variations across person images, and cross-modal discrepancies between visible and infrared images. Assuming that the person images are roughly aligned, previous approaches attempt to learn coarse image- or rigid part-level person representations that are discriminative and generalizable across different modalities. However, the person images, typically cropped by off-the-shelf object detectors, are not necessarily well-aligned, which distract discriminative person representation learning. In this paper, we introduce a novel feature learning framework that addresses these problems in a unified way. To this end, we propose to exploit dense correspondences between cross-modal person images. This allows to address the cross-modal discrepancies in a pixel-level, suppressing modality-related features from person representations more effectively. This also encourages pixel-wise associations between cross-modal local features, further facilitating discriminative feature learning for VI-reID. Extensive experiments and analyses on standard VI-reID benchmarks demonstrate the effectiveness of our approach, which significantly outperforms the state of the art.

Related articles: Most relevant | Search more
arXiv:2304.01537 [cs.CV] (Published 2023-04-04)
PartMix: Regularization Strategy to Learn Part Discovery for Visible-Infrared Person Re-identification
arXiv:1912.01230 [cs.CV] (Published 2019-12-03)
Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification
arXiv:2312.07853 [cs.CV] (Published 2023-12-13)
High-Order Structure Based Middle-Feature Learning for Visible-Infrared Person Re-Identification