arXiv Analytics

Sign in

arXiv:2408.08086 [cs.CV]AbstractReferencesReviewsResources

Single-image coherent reconstruction of objects and humans

Sarthak Batra, Partha P. Chakrabarti, Simon Hadfield, Armin Mustafa

Published 2024-08-15Version 1

Existing methods for reconstructing objects and humans from a monocular image suffer from severe mesh collisions and performance limitations for interacting occluding objects. This paper introduces a method to obtain a globally consistent 3D reconstruction of interacting objects and people from a single image. Our contributions include: 1) an optimization framework, featuring a collision loss, tailored to handle human-object and human-human interactions, ensuring spatially coherent scene reconstruction; and 2) a novel technique to robustly estimate 6 degrees of freedom (DOF) poses, specifically for heavily occluded objects, exploiting image inpainting. Notably, our proposed method operates effectively on images from real-world scenarios, without necessitating scene or object-level 3D supervision. Extensive qualitative and quantitative evaluation against existing methods demonstrates a significant reduction in collisions in the final reconstructions of scenes with multiple interacting humans and objects and a more coherent scene reconstruction.

Related articles: Most relevant | Search more
arXiv:1805.09243 [cs.CV] (Published 2018-05-23)
Subspace Clustering by Block Diagonal Representation
arXiv:2205.02413 [cs.CV] (Published 2022-05-05)
Surface Reconstruction from Point Clouds: A Survey and a Benchmark
arXiv:2312.03045 [cs.CV] (Published 2023-12-05)
Customization Assistant for Text-to-image Generation