arXiv Analytics

Sign in

arXiv:2403.07605 [cs.CV]AbstractReferencesReviewsResources

Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation

Michael Ogezi, Ning Shi

Published 2024-03-12Version 1

In text-to-image generation, using negative prompts, which describe undesirable image characteristics, can significantly boost image quality. However, producing good negative prompts is manual and tedious. To address this, we propose NegOpt, a novel method for optimizing negative prompt generation toward enhanced image generation, using supervised fine-tuning and reinforcement learning. Our combined approach results in a substantial increase of 25% in Inception Score compared to other approaches and surpasses ground-truth negative prompts from the test set. Furthermore, with NegOpt we can preferentially optimize the metrics most important to us. Finally, we construct Negative Prompts DB, a dataset of negative prompts.

Related articles: Most relevant | Search more
arXiv:2404.08031 [cs.CV] (Published 2024-04-11)
Latent Guard: a Safety Framework for Text-to-image Generation
arXiv:2305.15328 [cs.CV] (Published 2023-05-24)
Visual Programming for Text-to-Image Generation and Evaluation
arXiv:2307.05134 [cs.CV] (Published 2023-07-11)
TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation