arXiv:2011.00954 Abstract | arXiv Analytics

arXiv:2011.00954 [cs.CV]Abstract References Reviews Resources

Learning a Deep Reinforcement Learning Policy Over the Latent Space of a Pre-trained GAN for Semantic Age Manipulation

Kumar Shubham, Gopalakrishnan Venkatesh, Reijul Sachdev, Akshi, Dinesh Babu Jayagopi, G. Srinivasaraghavan

Published 2020-11-02Version 1

Learning a disentangled representation of the latent space has become one of the most fundamental problems studied in computer vision. Recently, many generative adversarial networks (GANs) have shown promising results in generating high fidelity images. However, studies to understand the semantic layout of the latent space of pre-trained models are still limited. Several works train conditional GANs to generate faces with required semantic attributes. Unfortunately, in these attempts often the generated output is not as photo-realistic as the state of the art models. Besides, they also require large computational resources and specific datasets to generate high fidelity images. In our work, we have formulated a Markov Decision Process (MDP) over the rich latent space of a pre-trained GAN model to learn a conditional policy for semantic manipulation along specific attributes under defined identity bounds. Further, we have defined a semantic age manipulation scheme using a locally linear approximation over the latent space. Results show that our learned policy can sample high fidelity images with required age variations, while at the same time preserve the identity of the person.

Comments: 12 pages, 8 images

Categories: cs.CV

Keywords: latent space, deep reinforcement learning policy, semantic age manipulation, pre-trained gan, generate high fidelity images

Related articles: Most relevant | Search more

arXiv:1907.10786 [cs.CV] (Published 2019-07-25)

Interpreting the Latent Space of GANs for Semantic Face Editing

Yujun Shen, Jinjin Gu, Xiaoou Tang, Bolei Zhou

arXiv:2206.13078 [cs.CV] (Published 2022-06-27)

Video2StyleGAN: Encoding Video in Latent Space for Manipulation

Jiyang Yu, Jingen Liu, Jing Huang, Wei Zhang, Tao Mei

arXiv:2202.12929 [cs.CV] (Published 2022-02-25)

OptGAN: Optimizing and Interpreting the Latent Space of the Conditional Text-to-Image GANs