arXiv Analytics

Sign in

arXiv:2010.14907 [cs.LG]AbstractReferencesReviewsResources

Online feature selection for rapid, low-overhead learning in networked systems

Xiaoxuan Wang, Forough Shahab Samani, Rolf Stadler

Published 2020-10-28Version 1

Data-driven functions for operation and management often require measurements collected through monitoring for model training and prediction. The number of data sources can be very large, which requires a significant communication and computing overhead to continuously extract and collect this data, as well as to train and update the machine-learning models. We present an online algorithm, called OSFS, that selects a small feature set from a large number of available data sources, which allows for rapid, low-overhead, and effective learning and prediction. OSFS is instantiated with a feature ranking algorithm and applies the concept of a stable feature set, which we introduce in the paper. We perform extensive, experimental evaluation of our method on data from an in-house testbed. We find that OSFS requires several hundreds measurements to reduce the number of data sources by two orders of magnitude, from which models are trained with acceptable prediction accuracy. While our method is heuristic and can be improved in many ways, the results clearly suggests that many learning tasks do not require a lengthy monitoring phase and expensive offline training.

Comments: A short version of this paper has been published at IFIP/IEEE 16th International Conference on Network and Service Management, 2-6 November 2020
Categories: cs.LG
Subjects: I.2.6
Related articles: Most relevant | Search more
arXiv:1908.06134 [cs.LG] (Published 2019-08-16)
Online Feature Selection for Activity Recognition using Reinforcement Learning with Multiple Feedback
arXiv:1902.01729 [cs.LG] (Published 2019-02-05)
Robust Regression via Online Feature Selection under Adversarial Data Corruption
arXiv:2309.05682 [cs.LG] (Published 2023-09-10)
A compendium of data sources for data science, machine learning, and artificial intelligence