{ "id": "2005.10039", "version": "v1", "published": "2020-05-20T13:36:09.000Z", "updated": "2020-05-20T13:36:09.000Z", "title": "The Effects of Randomness on the Stability of Node Embeddings", "authors": [ "Tobias Schumacher", "Hinrikus Wolf", "Martin Ritzert", "Florian Lemmerich", "Jan Bachmann", "Florian Frantzen", "Max Klabunde", "Martin Grohe", "Markus Strohmaier" ], "categories": [ "cs.LG", "cs.SI", "stat.ML" ], "abstract": "We systematically evaluate the (in-)stability of state-of-the-art node embedding algorithms due to randomness, i.e., the random variation of their outcomes given identical algorithms and graphs. We apply five node embeddings algorithms---HOPE, LINE, node2vec, SDNE, and GraphSAGE---to synthetic and empirical graphs and assess their stability under randomness with respect to (i) the geometry of embedding spaces as well as (ii) their performance in downstream tasks. We find significant instabilities in the geometry of embedding spaces independent of the centrality of a node. In the evaluation of downstream tasks, we find that the accuracy of node classification seems to be unaffected by random seeding while the actual classification of nodes can vary significantly. This suggests that instability effects need to be taken into account when working with node embeddings. Our work is relevant for researchers and engineers interested in the effectiveness, reliability, and reproducibility of node embedding approaches.", "revisions": [ { "version": "v1", "updated": "2020-05-20T13:36:09.000Z" } ], "analyses": { "keywords": [ "randomness", "downstream tasks", "state-of-the-art node embedding algorithms", "node embeddings algorithms-hope", "node embedding approaches" ], "note": { "typesetting": "TeX", "pages": 0, "language": "en", "license": "arXiv", "status": "editable" } } }