Abstract | ||
---|---|---|
We present the first computationally-efficient algorithm with $widetilde O(sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics. By that, we resolve an open question of Abbasi-Yadkori and Szepesvu0027ari (2011) and Dean, Mania, Matni, Recht, and Tu (2018). |
Year | Venue | DocType |
---|---|---|
2019 | arXiv: Learning | Journal |
Volume | Citations | PageRank |
abs/1902.06223 | 0 | 0.34 |
References | Authors | |
14 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alon Cohen | 1 | 11 | 5.28 |
Tomer Koren | 2 | 182 | 22.99 |
Yishay Mansour | 3 | 6211 | 745.95 |