arXiv:2003.00617 Abstract | arXiv Analytics

arXiv:2003.00617 [stat.ML]Abstract References Reviews Resources

Approximate Cross-validation: Guarantees for Model Assessment and Selection

Ashia Wilson, Maximilian Kasy, Lester Mackey

Published 2020-03-02Version 1

Cross-validation (CV) is a popular approach for assessing and selecting predictive models. However, when the number of folds is large, CV suffers from a need to repeatedly refit a learning procedure on a large number of training datasets. Recent work in empirical risk minimization (ERM) approximates the expensive refitting with a single Newton step warm-started from the full training set optimizer. While this can greatly reduce runtime, several open questions remain including whether these approximations lead to faithful model selection and whether they are suitable for non-smooth objectives. We address these questions with three main contributions: (i) we provide uniform non-asymptotic, deterministic model assessment guarantees for approximate CV; (ii) we show that (roughly) the same conditions also guarantee model selection performance comparable to CV; (iii) we provide a proximal Newton extension of the approximate CV framework for non-smooth prediction problems and develop improved assessment guarantees for problems such as l1-regularized ERM.

Comments: 8 pages, 3 figures

Categories: stat.ML, cs.LG

Keywords: approximate cross-validation, deterministic model assessment guarantees, full training set optimizer, approximate cv framework, guarantee model selection performance comparable

Related articles:

arXiv:2006.12669 [stat.ML] (Published 2020-06-23)

Approximate Cross-Validation for Structured Models

Soumya Ghosh, William T. Stephenson, Tin D. Nguyen, Sameer K. Deshpande, Tamara Broderick

arXiv:2008.10547 [stat.ML] (Published 2020-08-24)

Approximate Cross-Validation with Low-Rank Data in High Dimensions

William T. Stephenson, Madeleine Udell, Tamara Broderick

arXiv Analytics

arXiv:2003.00617 [stat.ML]Abstract References Reviews Resources

Approximate Cross-validation: Guarantees for Model Assessment and Selection

Links

Toolbox

arXiv:2003.00617 [stat.ML]AbstractReferencesReviewsResources

Approximate Cross-validation: Guarantees for Model Assessment and Selection

Links

Toolbox

arXiv:2003.00617 [stat.ML]Abstract References Reviews Resources