arXiv:2206.11459 Abstract | arXiv Analytics

arXiv:2206.11459 [cs.CV]Abstract References Reviews Resources

Explore Spatio-temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and Baseline

Kailai Zhou, Yibo Wang, Tao Lv, Yunqian Li, Linsen Chen, Qiu Shen, Xun Cao

Published 2022-06-23Version 1

We endeavor on a rarely explored task named Insubstantial Object Detection (IOD), which aims to localize the object with following characteristics: (1) amorphous shape with indistinct boundary; (2) similarity to surroundings; (3) absence in color. Accordingly, it is far more challenging to distinguish insubstantial objects in a single static frame and the collaborative representation of spatial and temporal information is crucial. Thus, we construct an IOD-Video dataset comprised of 600 videos (141,017 frames) covering various distances, sizes, visibility, and scenes captured by different spectral ranges. In addition, we develop a spatio-temporal aggregation framework for IOD, in which different backbones are deployed and a spatio-temporal aggregation loss (STAloss) is elaborately designed to leverage the consistency along the time axis. Experiments conducted on IOD-Video dataset demonstrate that spatio-temporal aggregation can significantly improve the performance of IOD. We hope our work will attract further researches into this valuable yet challenging task. The code will be available at: \url{https://github.com/CalayZhou/IOD-Video}.

Categories: cs.CV

Keywords: benchmark dataset, task named insubstantial object detection, iod-video dataset demonstrate, spatio-temporal aggregation loss, spatio-temporal aggregation framework

Related articles: Most relevant | Search more

arXiv:1909.06441 [cs.CV] (Published 2019-09-13)

MinneApple: A Benchmark Dataset for Apple Detection and Segmentation

Nicolai Häni, Pravakar Roy, Volkan Isler

arXiv:1511.02459 [cs.CV] (Published 2015-11-08)

SCUT-FBP: A Benchmark Dataset for Facial Beauty Perception

Duorui Xie, Lingyu Liang, Lianwen Jin, Jie Xu, Mengru Li

arXiv:2408.08623 [cs.CV] (Published 2024-08-16)

SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis

Xingyue Lin, Xingjian Hu, Shuai Peng, Jianhua Zhu, Liangcai Gao