{ "id": "2311.13548", "version": "v1", "published": "2023-11-22T17:44:18.000Z", "updated": "2023-11-22T17:44:18.000Z", "title": "Efficient Numerical Integration in Reproducing Kernel Hilbert Spaces via Leverage Scores Sampling", "authors": [ "Antoine Chatalic", "Nicolas Schreuder", "Ernesto De Vito", "Lorenzo Rosasco" ], "comment": "46 pages, 5 figures. Submitted to JMLR", "categories": [ "stat.ML", "cs.LG", "cs.NA", "math.NA" ], "abstract": "In this work we consider the problem of numerical integration, i.e., approximating integrals with respect to a target probability measure using only pointwise evaluations of the integrand. We focus on the setting in which the target distribution is only accessible through a set of $n$ i.i.d. observations, and the integrand belongs to a reproducing kernel Hilbert space. We propose an efficient procedure which exploits a small i.i.d. random subset of $m