arXiv:2007.03742 Abstract | arXiv Analytics

arXiv:2007.03742 [cs.LG]Abstract References Reviews Resources

Meta-active Learning in Probabilistically-Safe Optimization

Mariah L. Schrum, Mark Connolly, Eric Cole, Mihir Ghetiya, Robert Gross, Matthew C. Gombolay

Published 2020-07-07Version 1

Learning to control a safety-critical system with latent dynamics (e.g. for deep brain stimulation) requires taking calculated risks to gain information as efficiently as possible. To address this problem, we present a probabilistically-safe, meta-active learning approach to efficiently learn system dynamics and optimal configurations. We cast this problem as meta-learning an acquisition function, which is represented by a Long-Short Term Memory Network (LSTM) encoding sampling history. This acquisition function is meta-learned offline to learn high quality sampling strategies. We employ a mixed-integer linear program as our policy with the final, linearized layers of our LSTM acquisition function directly encoded into the objective to trade off expected information gain (e.g., improvement in the accuracy of the model of system dynamics) with the likelihood of safe control. We set a new state-of-the-art in active learning for control of a high-dimensional system with altered dynamics (i.e., a damaged aircraft), achieving a 46% increase in information gain and a 20% speedup in computation time over baselines. Furthermore, we demonstrate our system's ability to learn the optimal parameter settings for deep brain stimulation in a rat's brain while avoiding unwanted side effects (i.e., triggering seizures), outperforming prior state-of-the-art approaches with a 58% increase in information gain. Additionally, our algorithm achieves a 97% likelihood of terminating in a safe state while losing only 15% of information gain.

Comments: 9 pages

Categories: cs.LG, stat.ML

Keywords: information gain, probabilistically-safe optimization, meta-active learning, deep brain stimulation, learn high quality sampling strategies

Related articles: Most relevant | Search more

arXiv:2401.03947 [cs.LG] (Published 2024-01-08)

Guiding drones by information gain

Alouette van Hove, Kristoffer Aalstad, Norbert Pirk

arXiv:2001.08677 [cs.LG] (Published 2020-01-23)

Towards Automatic Clustering Analysis using Traces of Information Gain: The InfoGuide Method

Paulo Rocha, Diego Pinheiro, Martin Cadeiras, Carmelo Bastos-Filho

arXiv:2501.19073 [cs.LG] (Published 2025-01-31)

Pareto-frontier Entropy Search with Variational Lower Bound Maximization