arXiv Analytics

Sign in

arXiv:1602.03619 [cs.LG]AbstractReferencesReviewsResources

Optimality of Belief Propagation for Crowdsourced Classification

Jungseul Ok, Sewoong Oh, Jinwoo Shin, Yung Yi

Published 2016-02-11Version 1

Crowdsourcing systems are popular for solving large-scale labelling tasks with low-paid (or even non-paid) workers. We study the problem of recovering the true labels from the possibly erroneous crowdsourced labels under the popular Dawid-Skene model. To address this inference problem, several algorithms have recently been proposed, but the best known guarantee is still significantly larger than the fundamental limit. We close this gap under a simple but canonical scenario where each worker is assigned at most two tasks. In particular, we introduce a tighter lower bound on the fundamental limit and prove that Belief Propagation (BP) exactly matches this lower bound. The guaranteed optimality of BP is the strongest in the sense that it is information-theoretically impossible for any other algorithm to correctly label a larger fraction of the tasks. In the general setting, when more than two tasks are assigned to each worker, we establish the dominance result on BP that it outperforms all existing algorithms with provable guarantees. Experimental results suggest that BP is close to optimal for all regimes considered, while all other algorithms show suboptimal performances in certain regimes.

Related articles: Most relevant | Search more
arXiv:1907.05444 [cs.LG] (Published 2019-07-11)
On the Optimality of Trees Generated by ID3
arXiv:1206.5291 [cs.LG] (Published 2012-06-20)
Improved Dynamic Schedules for Belief Propagation
arXiv:2111.00734 [cs.LG] (Published 2021-11-01, updated 2022-02-24)
Robust Deep Learning from Crowds with Belief Propagation