arXiv Analytics

Sign in

arXiv:2112.09002 [math.OC]AbstractReferencesReviewsResources

On the Finite-Time Complexity and Practical Computation of Approximate Stationarity Concepts of Lipschitz Functions

Lai Tian, Kaiwen Zhou, Anthony Man-Cho So

Published 2021-12-16, updated 2022-08-01Version 2

We report a practical finite-time algorithmic scheme to compute approximately stationary points for nonconvex nonsmooth Lipschitz functions. In particular, we are interested in two kinds of approximate stationarity notions for nonconvex nonsmooth problems, i.e., Goldstein approximate stationarity (GAS) and near-approximate stationarity (NAS). For GAS, our scheme removes the unrealistic subgradient selection oracle assumption in (Zhang et al., 2020, Assumption 1) and computes GAS with the same finite-time complexity. For NAS, Davis & Drusvyatskiy (2019) showed that $\rho$-weakly convex functions admit finite-time computation, while Tian & So (2021) provided the matching impossibility results of dimension-free finite-time complexity for first-order methods. Complement to these developments, in this paper, we isolate a new class of functions that could be Clarke irregular (and thus not weakly convex anymore) and show that our new algorithmic scheme can compute NAS points for functions in that class within finite time. To demonstrate the wide applicability of our new theoretical framework, we show that $\rho$-margin SVM, $1$-layer, and $2$-layer ReLU neural networks, all being Clarke irregular, satisfy our new conditions.

Related articles: Most relevant | Search more
arXiv:2005.09760 [math.OC] (Published 2020-05-19)
Some remarks on a coupling method for the practical computation of homogenized coefficients
arXiv:2406.19723 [math.OC] (Published 2024-06-28)
LIPO+: Frugal Global Optimization for Lipschitz Functions
arXiv:2410.03023 [math.OC] (Published 2024-10-03, updated 2024-10-14)
$γ$-Competitiveness: An Approach to Multi-Objective Optimization with High Computation Costs in Lipschitz Functions