arXiv Analytics

Sign in

arXiv:1402.4306 [stat.ML]AbstractReferencesReviewsResources

Student-t Processes as Alternatives to Gaussian Processes

Amar Shah, Andrew Gordon Wilson, Zoubin Ghahramani

Published 2014-02-18, updated 2014-02-19Version 2

We investigate the Student-t process as an alternative to the Gaussian process as a nonparametric prior over functions. We derive closed form expressions for the marginal likelihood and predictive distribution of a Student-t process, by integrating away an inverse Wishart process prior over the covariance kernel of a Gaussian process model. We show surprising equivalences between different hierarchical Gaussian process models leading to Student-t processes, and derive a new sampling scheme for the inverse Wishart process, which helps elucidate these equivalences. Overall, we show that a Student-t process can retain the attractive properties of a Gaussian process -- a nonparametric representation, analytic marginal and predictive distributions, and easy model selection through covariance kernels -- but has enhanced flexibility, and predictive covariances that, unlike a Gaussian process, explicitly depend on the values of training observations. We verify empirically that a Student-t process is especially useful in situations where there are changes in covariance structure, or in applications like Bayesian optimization, where accurate predictive covariances are critical for good performance. These advantages come at no additional computational cost over Gaussian processes.

Comments: 13 pages, 6 figures, 1 table. To appear in "The Seventeenth International Conference on Artificial Intelligence and Statistics (AISTATS), 2014."
Categories: stat.ML, cs.AI, cs.LG, stat.ME
Related articles: Most relevant | Search more
arXiv:2312.00742 [stat.ML] (Published 2023-12-01)
Scalable Meta-Learning with Gaussian Processes
arXiv:1805.08463 [stat.ML] (Published 2018-05-22)
Variational Learning on Aggregate Outputs with Gaussian Processes
arXiv:2506.17366 [stat.ML] (Published 2025-06-20)
Gaussian Processes and Reproducing Kernels: Connections and Equivalences