Title
Finite Time Analysis of Optimal Adaptive Policies for Linear-Quadratic Systems.
Abstract
We consider the classical problem of control of linear systems with quadratic cost. When the true system dynamics are unknown, an adaptive policy is required for learning the model parameters and planning a control policy simultaneously. Addressing this trade-off between accurate estimation and good control represents the main challenge in the area of adaptive control. Another important issue is to prevent the system becoming destabilized due to lack of knowledge of its dynamics. Asymptotically optimal approaches have been extensively studied in the literature, but there are very few non-asymptotic results which also do not provide a comprehensive treatment of the problem. In this work, we establish finite time high probability regret bounds that are optimal up to logarithmic factors. We also provide high probability guarantees for a stabilization algorithm based on random linear feedbacks. The results are obtained under very mild assumptions, requiring: (i) stabilizability of the matrices encoding the systemu0027s dynamics, and (ii) degree of heaviness of the noise distribution. To derive our results, we also introduce a number of new concepts and technical tools.
Year
Venue
Field
2017
arXiv: Systems and Control
Mathematical optimization,Linear system,Regret,Matrix (mathematics),Control theory,System dynamics,Adaptive control,Logarithm,Asymptotically optimal algorithm,Mathematics,Encoding (memory)
DocType
Volume
Citations 
Journal
abs/1711.07230
4
PageRank 
References 
Authors
0.47
2
3
Name
Order
Citations
PageRank
Mohamad Kazem Shirani Faradonbeh1245.96
Ambuj Tewari2137199.22
George Michailidis3495.88