arXiv Analytics

Sign in

arXiv:2107.04481 [cs.CV]AbstractReferencesReviewsResources

Semantic and Geometric Unfolding of StyleGAN Latent Space

Mustafa Shukor, Xu Yao, Bharath Bhushan Damodaran, Pierre Hellier

Published 2021-07-09Version 1

Generative adversarial networks (GANs) have proven to be surprisingly efficient for image editing by inverting and manipulating the latent code corresponding to a natural image. This property emerges from the disentangled nature of the latent space. In this paper, we identify two geometric limitations of such latent space: (a) euclidean distances differ from image perceptual distance, and (b) disentanglement is not optimal and facial attribute separation using linear model is a limiting hypothesis. We thus propose a new method to learn a proxy latent representation using normalizing flows to remedy these limitations, and show that this leads to a more efficient space for face image editing.

Related articles: Most relevant | Search more
arXiv:2206.14892 [cs.CV] (Published 2022-06-29)
Semantic Unfolding of StyleGAN Latent Space
arXiv:1904.03189 [cs.CV] (Published 2019-04-05)
Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?
arXiv:2205.06102 [cs.CV] (Published 2022-05-12)
Tensor-based Emotion Editing in the StyleGAN Latent Space