arXiv:0711.1452 Abstract | arXiv Analytics

arXiv:0711.1452 [cond-mat.dis-nn]Abstract References Reviews Resources

(Un)detectable cluster structure in sparse networks

Published 2007-11-09Version 1

We study the problem of recovering a known cluster structure in a sparse network, also known as the planted partitioning problem, by means of statistical mechanics. We find a sharp transition from un-recoverable to recoverable structure as a function of the separation of the clusters. For multivariate data, such transitions have been observed frequently, but always as a function of the number of data points provided, i.e. given a large enough data set, two point clouds can always be recognized as different clusters, as long as their separation is non-zero. In contrast, for the sparse networks studied here, a cluster structure remains undetectable even in an infinitely large network if a critical separation is not exceeded. We give analytic formulas for this critical separation as a function of the degree distribution of the network and calculate the shape of the recoverability-transition. Our findings have implications for unsupervised learning and data-mining in relational data bases and provide bounds on the achievable performance of graph clustering algorithms.

Comments: 4 Pages, 2 Figures

Journal: Phys. Rev. Lett. 101, 078701 (2008)

DOI: 10.1103/PhysRevLett.101.078701

Categories: cond-mat.dis-nn, cond-mat.stat-mech

Subjects: 89.75.Hc, 05.50.+q, 89.65.-s

Keywords: sparse network, detectable cluster structure, relational data bases, critical separation, data points

Tags: journal article

Related articles:

arXiv:2204.06466 [cond-mat.dis-nn] (Published 2022-04-13)

Grand canonical ensembles of sparse networks and Bayesian inference

Ginestra Bianconi

arXiv:cond-mat/0201256 (Published 2002-01-15)