arXiv Analytics

Sign in

arXiv:2011.00954 [cs.CV]AbstractReferencesReviewsResources

Learning a Deep Reinforcement Learning Policy Over the Latent Space of a Pre-trained GAN for Semantic Age Manipulation

Kumar Shubham, Gopalakrishnan Venkatesh, Reijul Sachdev, Akshi, Dinesh Babu Jayagopi, G. Srinivasaraghavan

Published 2020-11-02Version 1

Learning a disentangled representation of the latent space has become one of the most fundamental problems studied in computer vision. Recently, many generative adversarial networks (GANs) have shown promising results in generating high fidelity images. However, studies to understand the semantic layout of the latent space of pre-trained models are still limited. Several works train conditional GANs to generate faces with required semantic attributes. Unfortunately, in these attempts often the generated output is not as photo-realistic as the state of the art models. Besides, they also require large computational resources and specific datasets to generate high fidelity images. In our work, we have formulated a Markov Decision Process (MDP) over the rich latent space of a pre-trained GAN model to learn a conditional policy for semantic manipulation along specific attributes under defined identity bounds. Further, we have defined a semantic age manipulation scheme using a locally linear approximation over the latent space. Results show that our learned policy can sample high fidelity images with required age variations, while at the same time preserve the identity of the person.

Related articles: Most relevant | Search more
arXiv:1907.10786 [cs.CV] (Published 2019-07-25)
Interpreting the Latent Space of GANs for Semantic Face Editing
arXiv:2206.13078 [cs.CV] (Published 2022-06-27)
Video2StyleGAN: Encoding Video in Latent Space for Manipulation
arXiv:2202.12929 [cs.CV] (Published 2022-02-25)
OptGAN: Optimizing and Interpreting the Latent Space of the Conditional Text-to-Image GANs