arXiv:2304.07658 Abstract | arXiv Analytics

arXiv:2304.07658 [stat.ML]Abstract References Reviews Resources

Dimensionality Reduction as Probabilistic Inference

Aditya Ravuri, Francisco Vargas, Vidhi Lalchand, Neil D. Lawrence

Published 2023-04-15Version 1

Dimensionality reduction (DR) algorithms compress high-dimensional data into a lower dimensional representation while preserving important features of the data. DR is a critical step in many analysis pipelines as it enables visualisation, noise reduction and efficient downstream processing of the data. In this work, we introduce the ProbDR variational framework, which interprets a wide range of classical DR algorithms as probabilistic inference algorithms in this framework. ProbDR encompasses PCA, CMDS, LLE, LE, MVU, diffusion maps, kPCA, Isomap, (t-)SNE, and UMAP. In our framework, a low-dimensional latent variable is used to construct a covariance, precision, or a graph Laplacian matrix, which can be used as part of a generative model for the data. Inference is done by optimizing an evidence lower bound. We demonstrate the internal consistency of our framework and show that it enables the use of probabilistic programming languages (PPLs) for DR. Additionally, we illustrate that the framework facilitates reasoning about unseen data and argue that our generative models approximate Gaussian processes (GPs) on manifolds. By providing a unified view of DR, our framework facilitates communication, reasoning about uncertainties, model composition, and extensions, particularly when domain knowledge is present.

Comments: Workshop version preprint

Categories: stat.ML, cs.LG

Keywords: dimensionality reduction, algorithms compress high-dimensional data, generative models approximate gaussian processes, framework facilitates communication, probabilistic inference algorithms

Related articles: Most relevant | Search more

arXiv:1811.00115 [stat.ML] (Published 2018-10-31)

Dimensionality Reduction has Quantifiable Imperfections: Two Geometric Bounds

Kry Yik Chau Lui, Gavin Weiguang Ding, Ruitong Huang, Robert J. McCann

arXiv:2011.11477 [stat.ML] (Published 2020-11-23)

Dimensionality reduction, regularization, and generalization in overparameterized regressions

Ningyuan, Huang, David W. Hogg, Soledad Villar

arXiv:2008.08044 [stat.ML] (Published 2020-08-18)

Bayesian neural networks and dimensionality reduction