arXiv Analytics

Sign in

arXiv:1104.1872 [math.OC]AbstractReferencesReviewsResources

Convex and Network Flow Optimization for Structured Sparsity

Julien Mairal, Rodolphe Jenatton, Guillaume Obozinski, Francis Bach

Published 2011-04-11, updated 2011-09-16Version 3

We consider a class of learning problems regularized by a structured sparsity-inducing norm defined as the sum of l_2- or l_infinity-norms over groups of variables. Whereas much effort has been put in developing fast optimization techniques when the groups are disjoint or embedded in a hierarchy, we address here the case of general overlapping groups. To this end, we present two different strategies: On the one hand, we show that the proximal operator associated with a sum of l_infinity-norms can be computed exactly in polynomial time by solving a quadratic min-cost flow problem, allowing the use of accelerated proximal gradient methods. On the other hand, we use proximal splitting techniques, and address an equivalent formulation with non-overlapping groups, but in higher dimension and with additional constraints. We propose efficient and scalable algorithms exploiting these two strategies, which are significantly faster than alternative approaches. We illustrate these methods with several problems such as CUR matrix factorization, multi-task learning of tree-structured dictionaries, background subtraction in video sequences, image denoising with wavelets, and topographic dictionary learning of natural image patches.

Comments: to appear in the Journal of Machine Learning Research (JMLR)
Journal: Journal of Machine Learning Research 12 (2011) 2681?2720
Categories: math.OC, cs.LG, stat.ML
Related articles: Most relevant | Search more
arXiv:1105.0728 [math.OC] (Published 2011-05-04, updated 2011-12-15)
Structured Sparsity via Alternating Direction Methods
arXiv:1508.04096 [math.OC] (Published 2015-08-17)
Distributed SDDM Solvers: Theory & Applications
arXiv:math/0611498 [math.OC] (Published 2006-11-16)
A note on the representation of positive polynomials with structured sparsity