arXiv:2310.06968 Abstract | arXiv Analytics

arXiv:2310.06968 [cs.CV]Abstract References Reviews Resources

ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning

Alec Helbling, Evan Montoya, Duen Horng Chau

Published 2023-10-10Version 1

Recent text-to-image generative models can generate high-fidelity images from text prompts. However, these models struggle to consistently generate the same objects in different contexts with the same appearance. Consistent object generation is important to many downstream tasks like generating comic book illustrations with consistent characters and setting. Numerous approaches attempt to solve this problem by extending the vocabulary of diffusion models through fine-tuning. However, even lightweight fine-tuning approaches can be prohibitively expensive to run at scale and in real-time. We introduce a method called ObjectComposer for generating compositions of multiple objects that resemble user-specified images. Our approach is training-free, leveraging the abilities of preexisting models. We build upon the recent BLIP-Diffusion model, which can generate images of single objects specified by reference images. ObjectComposer enables the consistent generation of compositions containing multiple specific objects simultaneously, all without modifying the weights of the underlying models.

Categories: cs.CV, cs.LG

Keywords: multiple objects, consistent generation, objectcomposer, fine-tuning, compositions containing multiple specific objects

Related articles: Most relevant | Search more

arXiv:1706.06629 [cs.CV] (Published 2017-06-20)

Co-Fusion: Real-time Segmentation, Tracking and Fusion of Multiple Objects

Martin Rünz, Lourdes Agapito

arXiv:2307.11077 [cs.CV] (Published 2023-07-20)

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection

Ming Li et al.

arXiv:2407.12371 [cs.CV] (Published 2024-07-17)

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

Xintao Lv et al.

arXiv Analytics

arXiv:2310.06968 [cs.CV]Abstract References Reviews Resources

ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning

Links

Toolbox

arXiv:2310.06968 [cs.CV]AbstractReferencesReviewsResources

ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning

Links

Toolbox

arXiv:2310.06968 [cs.CV]Abstract References Reviews Resources