arXiv Analytics

Sign in

arXiv:2210.12758 [cs.CV]AbstractReferencesReviewsResources

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

Zixuan Xu, Banghuai Li, Ye Yuan, Anhong Dang

Published 2022-10-23Version 1

Recently significant progress has been made in pedestrian detection, but it remains challenging to achieve high performance in occluded and crowded scenes. It could be attributed mostly to the widely used representation of pedestrians, i.e., 2D axis-aligned bounding box, which just describes the approximate location and size of the object. Bounding box models the object as a uniform distribution within the boundary, making pedestrians indistinguishable in occluded and crowded scenes due to much noise. To eliminate the problem, we propose a novel representation based on 2D beta distribution, named Beta Representation. It pictures a pedestrian by explicitly constructing the relationship between full-body and visible boxes, and emphasizes the center of visual mass by assigning different probability values to pixels. As a result, Beta Representation is much better for distinguishing highly-overlapped instances in crowded scenes with a new NMS strategy named BetaNMS. What's more, to fully exploit Beta Representation, a novel pipeline Beta R-CNN equipped with BetaHead and BetaMask is proposed, leading to high detection performance in occluded and crowded scenes.

Related articles: Most relevant | Search more
arXiv:2204.01026 [cs.CV] (Published 2022-04-03)
STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes
Peishan Cong et al.
arXiv:2407.18637 [cs.CV] (Published 2024-07-26)
DynamicTrack: Advancing Gigapixel Tracking in Crowded Scenes
arXiv:2305.09401 [cs.CV] (Published 2023-05-16)
Diffusion Dataset Generation: Towards Closing the Sim2Real Gap for Pedestrian Detection