arXiv Analytics

Sign in

arXiv:2105.04504 [stat.ML]AbstractReferencesReviewsResources

Deep Neural Networks as Point Estimates for Deep Gaussian Processes

Vincent Dutordoir, James Hensman, Mark van der Wilk, Carl Henrik Ek, Zoubin Ghahramani, Nicolas Durrande

Published 2021-05-10Version 1

Deep Gaussian processes (DGPs) have struggled for relevance in applications due to the challenges and cost associated with Bayesian inference. In this paper we propose a sparse variational approximation for DGPs for which the approximate posterior mean has the same mathematical structure as a Deep Neural Network (DNN). We make the forward pass through a DGP equivalent to a ReLU DNN by finding an interdomain transformation that represents the GP posterior mean as a sum of ReLU basis functions. This unification enables the initialisation and training of the DGP as a neural network, leveraging the well established practice in the deep learning community, and so greatly aiding the inference task. The experiments demonstrate improved accuracy and faster training compared to current DGP methods, while retaining favourable predictive uncertainties.

Related articles: Most relevant | Search more
arXiv:1907.02177 [stat.ML] (Published 2019-07-04)
Adaptive Approximation and Estimation of Deep Neural Network to Intrinsic Dimensionality
arXiv:1211.0358 [stat.ML] (Published 2012-11-02, updated 2013-03-23)
Deep Gaussian Processes
arXiv:2410.11113 [stat.ML] (Published 2024-10-14)
Statistical Properties of Deep Neural Networks with Dependent Data