arXiv Analytics

Sign in

arXiv:1708.01611 [cs.LG]AbstractReferencesReviewsResources

Identification of Probabilities

Paul M. B. Vitanyi, Nick Chater

Published 2017-08-04Version 1

Within psychology, neuroscience and artificial intelligence, there has been increasing interest in the proposal that the brain builds probabilistic models of sensory and linguistic input: that is, to infer a probabilistic model from a sample. The practical problems of such inference are substantial: the brain has limited data and restricted computational resources. But there is a more fundamental question: is the problem of inferring a probabilistic model from a sample possible even in principle? We explore this question and find some surprisingly positive and general results. First, for a broad class of probability distributions characterised by computability restrictions, we specify a learning algorithm that will almost surely identify a probability distribution in the limit given a finite i.i.d. sample of sufficient but unknown length. This is similarly shown to hold for sequences generated by a broad class of Markov chains, subject to computability assumptions. The technical tool is the strong law of large numbers. Second, for a large class of dependent sequences, we specify an algorithm which identifies in the limit a computable measure for which the sequence is typical, in the sense of Martin-Lof (there may be more than one such measure). The technical tool is the theory of Kolmogorov complexity. We analyse the associated predictions in both cases. We also briefly consider special cases, including language learning, and wider theoretical implications for psychology.

Comments: 31 pages LaTeX. arXiv admin note: substantial text overlap with arXiv:1311.7385
Journal: Journal of Mathematical Psychology 51, 135-163 (2007)
Categories: cs.LG, cs.AI
Related articles: Most relevant | Search more
arXiv:2003.03778 [cs.LG] (Published 2020-03-08)
Adversarial Attacks on Probabilistic Autoregressive Forecasting Models
arXiv:2102.04593 [cs.LG] (Published 2021-02-09)
Regularized Generative Adversarial Network
arXiv:1905.08513 [cs.LG] (Published 2019-05-21)
Stochastic Inverse Reinforcement Learning