arXiv Analytics

Sign in

arXiv:2308.15000 [math.CO]AbstractReferencesReviewsResources

Optimal Data Distribution for Big-Data All-to-All Comparison using Finite Projective and Affine Planes

Joanne L. Hall, Wayne Kelly, Yu-Chu Tian

Published 2023-08-29Version 1

An All-to-All Comparison problem is where every element of a data set is compared with every other element. This is analogous to projective planes and affine planes where every pair of points share a common line. For large data sets, the comparison computations can be distributed across a cluster of computers. All-to-All Comparison does not fit the highly successful Map-Reduce pattern, so a new distributed computing framework is required. The principal challenge is to distribute the data in such a way that computations can be scheduled where the data already lies. This paper uses projective planes, affine planes and balanced incomplete block designs to design data distributions and schedule computations. The data distributions based on these geometric and combinatorial structures achieve minimal data replication whilst balancing the computational load across the cluster.

Related articles: Most relevant | Search more
arXiv:1604.04945 [math.CO] (Published 2016-04-18)
Affine planes, ternary rings, and examples of non-Desarguesian planes
arXiv:2411.19202 [math.CO] (Published 2024-11-28)
Points below a parabola in affine planes of prime order
arXiv:2210.11961 [math.CO] (Published 2022-10-21)
Sets of mutually orthogoval projective and affine planes