arXiv:2107.04481 Abstract | arXiv Analytics

arXiv:2107.04481 [cs.CV]Abstract References Reviews Resources

Semantic and Geometric Unfolding of StyleGAN Latent Space

Mustafa Shukor, Xu Yao, Bharath Bhushan Damodaran, Pierre Hellier

Published 2021-07-09Version 1

Generative adversarial networks (GANs) have proven to be surprisingly efficient for image editing by inverting and manipulating the latent code corresponding to a natural image. This property emerges from the disentangled nature of the latent space. In this paper, we identify two geometric limitations of such latent space: (a) euclidean distances differ from image perceptual distance, and (b) disentanglement is not optimal and facial attribute separation using linear model is a limiting hypothesis. We thus propose a new method to learn a proxy latent representation using normalizing flows to remedy these limitations, and show that this leads to a more efficient space for face image editing.

Comments: 16 pages

Categories: cs.CV

Keywords: stylegan latent space, geometric unfolding, proxy latent representation, facial attribute separation, image perceptual distance

Related articles: Most relevant | Search more

arXiv:2206.14892 [cs.CV] (Published 2022-06-29)

Semantic Unfolding of StyleGAN Latent Space

Mustafa Shukor, Xu Yao, Bharath Bushan Damodaran, Pierre Hellier

arXiv:1904.03189 [cs.CV] (Published 2019-04-05)

Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?

Rameen Abdal, Yipeng Qin, Peter Wonka

arXiv:2205.06102 [cs.CV] (Published 2022-05-12)

Tensor-based Emotion Editing in the StyleGAN Latent Space