arXiv Analytics

Sign in

arXiv:2407.21703 [cs.CV]AbstractReferencesReviewsResources

Hyper-parameter tuning for text guided image editing

Shiwen Zhang

Published 2024-07-31Version 1

The test-time finetuning text-guided image editing method, Forgedit, is capable of tackling general and complex image editing problems given only the input image itself and the target text prompt. During finetuning stage, using the same set of finetuning hyper-paramters every time for every given image, Forgedit remembers and understands the input image in 30 seconds. During editing stage, the workflow of Forgedit might seem complicated. However, in fact, the editing process of Forgedit is not more complex than previous SOTA Imagic, yet completely solves the overfitting problem of Imagic. In this paper, we will elaborate the workflow of Forgedit editing stage with examples. We will show how to tune the hyper-parameters in an efficient way to obtain ideal editing results.

Comments: Codes are available at https://github.com/witcherofresearch/Forgedit/
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:1211.2881 [cs.CV] (Published 2012-11-13, updated 2012-11-28)
Deep Attribute Networks
arXiv:1905.03556 [cs.CV] (Published 2019-05-09)
Cycle-IR: Deep Cyclic Image Retargeting
arXiv:2005.14600 [cs.CV] (Published 2020-05-29)
Fixed-size Objects Encoding for Visual Relationship Detection