arXiv Analytics

Sign in

arXiv:1912.01230 [cs.CV]AbstractReferencesReviewsResources

Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification

Seokeon Choi, Sumin Lee, Youngeun Kim, Taekyung Kim, Changick Kim

Published 2019-12-03Version 1

Visible-infrared person re-identification (VI-ReID) is an important task in night-time surveillance applications, since visible cameras are difficult to capture valid appearance information under poor illumination conditions. Compared to traditional person re-identification that handles only the intra-modality discrepancy, VI-ReID suffers from additional cross-modality discrepancy caused by different types of imaging systems. To reduce both intra- and cross-modality discrepancies, we propose a Hierarchical Cross-Modality Disentanglement (Hi-CMD) method, which automatically disentangles ID-discriminative factors and ID-excluded factors from visible-thermal images. We only use ID-discriminative factors for robust cross-modality matching without ID-excluded factors such as pose or illumination. To implement our approach, we introduce an ID-preserving person image generation network and a hierarchical feature learning module. Our generation network learns the disentangled representation by generating a new cross-modality image with different poses and illuminations while preserving a person's identity. At the same time, the feature learning module enables our model to explicitly extract the common ID-discriminative characteristic between visible-infrared images. Extensive experimental results demonstrate that our method outperforms the state-of-the-art methods on two VI-ReID datasets.

Related articles: Most relevant | Search more
arXiv:2302.01512 [cs.CV] (Published 2023-02-03)
Spectral Aware Softmax for Visible-Infrared Person Re-Identification
arXiv:2304.01537 [cs.CV] (Published 2023-04-04)
PartMix: Regularization Strategy to Learn Part Discovery for Visible-Infrared Person Re-identification
arXiv:2108.07422 [cs.CV] (Published 2021-08-17)
Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences