arXiv:2305.16703 Abstract | arXiv Analytics

arXiv:2305.16703 [stat.ML]Abstract References Reviews Resources

Sources of Uncertainty in Machine Learning -- A Statisticians' View

Cornelia Gruber, Patrick Oliver Schenk, Malte Schierholz, Frauke Kreuter, Göran Kauermann

Published 2023-05-26Version 1

Machine Learning and Deep Learning have achieved an impressive standard today, enabling us to answer questions that were inconceivable a few years ago. Besides these successes, it becomes clear, that beyond pure prediction, which is the primary strength of most supervised machine learning algorithms, the quantification of uncertainty is relevant and necessary as well. While first concepts and ideas in this direction have emerged in recent years, this paper adopts a conceptual perspective and examines possible sources of uncertainty. By adopting the viewpoint of a statistician, we discuss the concepts of aleatoric and epistemic uncertainty, which are more commonly associated with machine learning. The paper aims to formalize the two types of uncertainty and demonstrates that sources of uncertainty are miscellaneous and can not always be decomposed into aleatoric and epistemic. Drawing parallels between statistical concepts and uncertainty in machine learning, we also demonstrate the role of data and their influence on uncertainty.

Categories: stat.ML, cs.LG

Keywords: statistician, pure prediction, primary strength, supervised machine learning algorithms, demonstrate

Related articles: Most relevant | Search more

arXiv:2007.14528 [stat.ML] (Published 2020-07-28)

Surrogate Locally-Interpretable Models with Supervised Machine Learning Algorithms

Linwei Hu, Jie Chen, Vijayan N. Nair, Agus Sudjianto

arXiv:2204.12868 [stat.ML] (Published 2022-04-27)

Performance and Interpretability Comparisons of Supervised Machine Learning Algorithms: An Empirical Study

Alice J. Liu, Linwei Hu, Jie Chen, Vijayan Nair

arXiv:1802.10510 [stat.ML] (Published 2018-02-28)

Decision functions from supervised machine learning algorithms as collective variables for accelerating molecular simulations

Mohammad M. Sultan, Vijay S. Pande