arXiv Analytics

Sign in

arXiv:math/0211020 [math.PR]AbstractReferencesReviewsResources

Entropy and the Law of Small Numbers

Ioannis Kontoyiannis, Peter Harremoes, Oliver Johnson

Published 2002-11-01, updated 2004-11-17Version 2

Two new information-theoretic methods are introduced for establishing Poisson approximation inequalities. First, using only elementary information-theoretic techniques it is shown that, when $S_n=\sum_{i=1}^nX_i$ is the sum of the (possibly dependent) binary random variables $X_1,X_2,...,X_n$, with $E(X_i)=p_i$ and $E(S_n)=\la$, then \ben D(P_{S_n}\|\Pol)\leq \sum_{i=1}^n p_i^2 + \Big[\sum_{i=1}^nH(X_i) - H(X_1,X_2,..., X_n)\Big], \een where $D(P_{S_n}\|{Po}(\la))$ is the relative entropy between the distribution of $S_n$ and the Poisson($\la$) distribution. The first term in this bound measures the individual smallness of the $X_i$ and the second term measures their dependence. A general method is outlined for obtaining corresponding bounds when approximating the distribution of a sum of general discrete random variables by an infinitely divisible distribution. Second, in the particular case when the $X_i$ are independent, the following sharper bound is established, \ben D(P_{S_n}\|\Pol)\leq \frac{1}{\lambda} \sum_{i=1}^n \frac{p_i^3}{1-p_i}, % \label{eq:abs2} \een and it is also generalized to the case when the $X_i$ are general integer-valued random variables. Its proof is based on the derivation of a subadditivity property for a new discrete version of the Fisher information, and uses a recent logarithmic Sobolev inequality for the Poisson distribution.

Comments: 15 pages. To appear, IEEE Trans Inform Theory
Journal: IEEE Transactions on Information Theory, Vol 51/2, 2005, pages 466-472
Categories: math.PR
Related articles: Most relevant | Search more
arXiv:0903.4373 [math.PR] (Published 2009-03-25, updated 2009-03-26)
A note on the distribution of the maximum of a set of Poisson random variables
arXiv:1507.07576 [math.PR] (Published 2015-07-27)
Grand Lebesgue norm estimation for binary random variables, with applications
arXiv:1111.0151 [math.PR] (Published 2011-11-01)
The Distribution of Mixing Times in Markov Chains