arXiv Analytics

Sign in

arXiv:2211.09788 [cs.CV]AbstractReferencesReviewsResources

DiffusionDet: Diffusion Model for Object Detection

Shoufa Chen, Peize Sun, Yibing Song, Ping Luo

Published 2022-11-17Version 1

We propose DiffusionDet, a new framework that formulates object detection as a denoising diffusion process from noisy boxes to object boxes. During training stage, object boxes diffuse from ground-truth boxes to random distribution, and the model learns to reverse this noising process. In inference, the model refines a set of randomly generated boxes to the output results in a progressive way. The extensive evaluations on the standard benchmarks, including MS-COCO and LVIS, show that DiffusionDet achieves favorable performance compared to previous well-established detectors. Our work brings two important findings in object detection. First, random boxes, although drastically different from pre-defined anchors or learned queries, are also effective object candidates. Second, object detection, one of the representative perception tasks, can be solved by a generative way. Our code is available at https://github.com/ShoufaChen/DiffusionDet.

Comments: Tech report. Code is available at https://github.com/ShoufaChen/DiffusionDet
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:2305.13840 [cs.CV] (Published 2023-05-23)
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Weifeng Chen et al.
arXiv:2309.07254 [cs.CV] (Published 2023-09-13)
Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement
arXiv:2309.00287 [cs.CV] (Published 2023-09-01)
Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution