arXiv Analytics

Sign in

arXiv:1605.06177 [cs.CV]AbstractReferencesReviewsResources

Fine-Grained Classification of Pedestrians in Video: Benchmark and State of the Art

David Hall, Pietro Perona

Published 2016-05-20Version 1

A video dataset that is designed to study fine-grained categorisation of pedestrians is introduced. Pedestrians were recorded "in-the-wild" from a moving vehicle. Annotations include bounding boxes, tracks, 14 keypoints with occlusion information and the fine-grained categories of age (5 classes), sex (2 classes), weight (3 classes) and clothing style (4 classes). There are a total of 27,454 bounding box and pose labels across 4222 tracks. This dataset is designed to train and test algorithms for fine-grained categorisation of people, it is also useful for benchmarking tracking, detection and pose estimation of pedestrians. State-of-the-art algorithms for fine-grained classification and pose estimation were tested using the dataset and the results are reported as a useful performance baseline.

Related articles: Most relevant | Search more
arXiv:1709.01450 [cs.CV] (Published 2017-09-05)
The Devil is in the Tails: Fine-grained Classification in the Wild
arXiv:2208.03142 [cs.CV] (Published 2022-08-05)
BoxShrink: From Bounding Boxes to Segmentation Masks
arXiv:1511.09209 [cs.CV] (Published 2015-11-30)
Fine-Grained Classification via Mixture of Deep Convolutional Neural Networks