arXiv Analytics

Sign in

arXiv:2210.08886 [math.OC]AbstractReferencesReviewsResources

Regret Bounds for Learning Decentralized Linear Quadratic Regulator with Partially Nested Information Structure

Lintao Ye, Ming Chi, Vijay Gupta

Published 2022-10-17Version 1

We study the problem of learning decentralized linear quadratic regulator under a partially nested information constraint, when the system model is unknown a priori. We propose an online learning algorithm that adaptively designs a control policy as new data samples from a single system trajectory become available. Our algorithm design uses a disturbance-feedback representation of state-feedback controllers coupled with online convex optimization with memory and delayed feedback. We show that our online algorithm yields a controller that satisfies the desired information constraint and enjoys an expected regret that scales as $\sqrt{T}$ with the time horizon $T$.

Related articles: Most relevant | Search more
arXiv:2110.07112 [math.OC] (Published 2021-10-14, updated 2022-05-27)
On the Sample Complexity of Decentralized Linear Quadratic Regulator with Partially Nested Information Structure
arXiv:1907.12125 [math.OC] (Published 2019-07-28)
Decentralized Stochastic Control in Partially Nested Information Structures
arXiv:2310.06194 [math.OC] (Published 2023-10-09)
Stability and Regret bounds on Distributed Truncated Predictive Control for Networked Dynamical Systems