arXiv Analytics

Sign in

arXiv:2206.03033 [cs.CV]AbstractReferencesReviewsResources

Deep Learning Techniques for Visual Counting

Luca Ciampi

Published 2022-06-07Version 1

In this thesis, I investigated and enhanced the visual counting task, which automatically estimates the number of objects in still images or video frames. Recently, due to the growing interest in it, several CNN-based solutions have been suggested by the scientific community. These artificial neural networks provide a way to automatically learn effective representations from raw visual data and can be successfully employed to address typical challenges characterizing this task, such as different illuminations and object scales. But apart from these difficulties, I targeted some other crucial limitations in the adoption of CNNs, proposing solutions that I experimentally evaluated in the context of the counting task which turns out to be particularly affected by these shortcomings. In particular, I tackled the problem related to the lack of data needed for training current CNN-based solutions. Given that the budget for labeling is limited, data scarcity still represents an open problem, particularly evident in tasks such as the counting one, where the objects to be labeled are thousands per image. Specifically, I introduced synthetic datasets gathered from virtual environments, where the training labels are automatically collected. I proposed Domain Adaptation strategies aiming at mitigating the domain gap existing between the training and test data distributions. I presented a counting strategy where I took advantage of the redundant information characterizing datasets labeled by multiple annotators. Moreover, I tackled the engineering challenges coming out of the adoption of CNN techniques in environments with limited power resources. I introduced solutions for counting vehicles directly onboard embedded vision systems. Finally, I designed an embedded modular Computer Vision-based system that can carry out several tasks to help monitor individual and collective human safety rules.

Comments: Version with high-quality images can be found at https://etd.adm.unipi.it/theses/available/etd-04262022-163702/. arXiv admin note: text overlap with arXiv:1802.03601, arXiv:1707.01202, arXiv:1809.02165, arXiv:1901.06026, arXiv:1808.01244 by other authors
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:2004.05214 [cs.CV] (Published 2020-04-10)
A Review on Deep Learning Techniques for Video Prediction
arXiv:2103.14872 [cs.CV] (Published 2021-03-27)
Deep Learning Techniques for In-Crop Weed Identification: A Review
Kun Hu et al.
arXiv:2412.02072 [cs.CV] (Published 2024-12-03)
Performance Comparison of Deep Learning Techniques in Naira Classification