arXiv:1811.09409 Abstract | arXiv Analytics

arXiv:1811.09409 [stat.ML]Abstract References Reviews Resources

Learning Multiple Defaults for Machine Learning Algorithms

Florian Pfisterer, Jan N. van Rijn, Philipp Probst, Andreas Müller, Bernd Bischl

Published 2018-11-23Version 1

The performance of modern machine learning methods highly depends on their hyperparameter configurations. One simple way of selecting a configuration is to use default settings, often proposed along with the publication and implementation of a new algorithm. Those default values are usually chosen in an ad-hoc manner to work good enough on a wide variety of datasets. To address this problem, different automatic hyperparameter configuration algorithms have been proposed, which select an optimal configuration per dataset. This principled approach usually improves performance, but adds additional algorithmic complexity and computational costs to the training procedure. As an alternative to this, we propose learning a set of complementary default values from a large database of prior empirical results. Selecting an appropriate configuration on a new dataset then requires only a simple, efficient and embarrassingly parallel search over this set. We demonstrate the effectiveness and efficiency of the approach we propose in comparison to random search and Bayesian Optimization.

Categories: stat.ML, cs.LG

Keywords: machine learning algorithms, learning multiple defaults, machine learning methods, adds additional algorithmic complexity, automatic hyperparameter configuration algorithms

Related articles: Most relevant | Search more

arXiv:1206.2944 [stat.ML] (Published 2012-06-13, updated 2012-08-29)

Practical Bayesian Optimization of Machine Learning Algorithms

Jasper Snoek, Hugo Larochelle, Ryan P. Adams

arXiv:2303.07139 [stat.ML] (Published 2023-03-13, updated 2024-06-06)

Comparing statistical and machine learning methods for time series forecasting in data-driven logistics -- A simulation study

Lena Schmid, Moritz Roidl, Markus Pauly

arXiv:2106.09512 [stat.ML] (Published 2021-06-17)

Machine learning methods for postprocessing ensemble forecasts of wind gusts: A systematic comparison

Benedikt Schulz, Sebastian Lerch