arXiv:1611.04982 Abstract | arXiv Analytics

arXiv:1611.04982 [math.OC]Abstract References Reviews Resources

Oracle Complexity of Second-Order Methods for Finite-Sum Problems

Published 2016-11-15Version 1

Finite-sum optimization problems are ubiquitous in machine learning, and are commonly solved using first-order methods which rely on gradient computations. Recently, there has been growing interest in \emph{second-order} methods, which rely on both gradients and Hessians. In principle, second-order methods can require much fewer iterations than first-order methods, and hold the promise for more efficient algorithms. Although computing and manipulating Hessians is prohibitive for high-dimensional problems in general, the Hessians of individual functions in finite-sum problems can often be efficiently computed, e.g. because they possess a low-rank structure. Can second-order information indeed be used to solve such problems more efficiently? In this paper, we provide evidence that the answer -- perhaps surprisingly -- is negative, at least in terms of worst-case guarantees. However, we also discuss what additional assumptions and algorithmic approaches might potentially circumvent this negative result.

Comments: 23 pages

Categories: math.OC, cs.LG, stat.ML

Keywords: second-order methods, finite-sum problems, oracle complexity, first-order methods, finite-sum optimization problems

Related articles: Most relevant | Search more

arXiv:1705.07260 [math.OC] (Published 2017-05-20)

Oracle Complexity of Second-Order Methods for Smooth Convex Optimization

Ohad Shamir, Ron Shiff

arXiv:2103.08280 [math.OC] (Published 2021-03-15)

Lower Complexity Bounds of Finite-Sum Optimization Problems: The Results and Construction

Yuze Han, Guangzeng Xie, Zhihua Zhang

arXiv:1602.02915 [math.OC] (Published 2016-02-09)

Calculus of the exponent of Kurdyka-Łojasiewicz inequality and its applications to linear convergence of first-order methods