arXiv Analytics

Sign in

arXiv:2402.19186 [cs.CV]AbstractReferencesReviewsResources

Disentangling representations of retinal images with generative models

Sarah Müller, Lisa M. Koch, Hendrik P. A. Lensch, Philipp Berens

Published 2024-02-29, updated 2024-09-20Version 2

Retinal fundus images play a crucial role in the early detection of eye diseases. However, the impact of technical factors on these images can pose challenges for reliable AI applications in ophthalmology. For example, large fundus cohorts are often confounded by factors like camera type, bearing the risk of learning shortcuts rather than the causal relationships behind the image generation process. Here, we introduce a population model for retinal fundus images that effectively disentangles patient attributes from camera effects, enabling controllable and highly realistic image generation. To achieve this, we propose a disentanglement loss based on distance correlation. Through qualitative and quantitative analyses, we show that our models encode desired information in disentangled subspaces and enable controllable image generation based on the learned subspaces, demonstrating the effectiveness of our disentanglement loss. The project's code is publicly available: https://github.com/berenslab/disentangling-retinal-images.

Related articles: Most relevant | Search more
arXiv:1805.06605 [cs.CV] (Published 2018-05-17)
Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models
arXiv:1912.04564 [cs.CV] (Published 2019-12-10)
Towards Latent Space Optimality for Auto-Encoder Based Generative Models
arXiv:2210.06188 [cs.CV] (Published 2022-10-12)
Anomaly Detection using Generative Models and Sum-Product Networks in Mammography Scans