arXiv Analytics

Sign in

arXiv:1610.02055 [cs.CV]AbstractReferencesReviewsResources

Places: An Image Database for Deep Scene Understanding

Bolei Zhou, Aditya Khosla, Agata Lapedriza, Antonio Torralba, Aude Oliva

Published 2016-10-06Version 1

The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification at tasks such as object and scene recognition. Here we describe the Places Database, a repository of 10 million scene photographs, labeled with scene semantic categories and attributes, comprising a quasi-exhaustive list of the types of environments encountered in the world. Using state of the art Convolutional Neural Networks, we provide impressive baseline performances at scene classification. With its high-coverage and high-diversity of exemplars, the Places Database offers an ecosystem to guide future progress on currently intractable visual recognition problems.

Related articles:
arXiv:2206.02086 [cs.CV] (Published 2022-06-05)
Towards the Creation of a Nutrition and Food Group Based Image Database
arXiv:1705.08280 [cs.CV] (Published 2017-05-23)
How hard can it be? Estimating the difficulty of visual search in an image
arXiv:1702.00187 [cs.CV] (Published 2017-02-01)
ImageNet MPEG-7 Visual Descriptors - Technical Report