Robust policy search with applications to safe vehicle navigation. - Citegraph

Paper Info

Title
Robust policy search with applications to safe vehicle navigation.

Abstract
This work studies the design of reliable control laws of robotic systems operating in uncertain environments. We introduce a new approach to stochastic policy optimization based on probably approximately correct (PAC) bounds on the expected performance of control policies. An algorithm is constructed which directly minimizes an upper confidence bound on the expected cost of trajectories instead of employing a standard approach based on the expected cost itself. This algorithm thus has built-in robustness to uncertainty, since the bound can be regarded as a certificate for guaranteed future performance. The approach is evaluated on two challenging robot control scenarios in simulation: a car with side slip and a quadrotor navigating through obstacle-ridden environments. We show that the bound accurately predicts future performance and results in improved robustness measured by lower average cost and lower probability of collision. The performance of the technique is studied empirically and compared to several existing policy search algorithms.

Year	DOI	Venue
2017	10.1109/ICRA.2017.7989271	ICRA
Field	DocType	Volume
Robot control,Approximation algorithm,Mathematical optimization,Search algorithm,Probably approximately correct learning,Computer science,Control theory,Stochastic process,Robustness (computer science),Average cost,Control engineering,Collision	Conference	2017
Issue	Citations	PageRank
1	1	0.36
References	Authors
13	3

Authors (3 rows)

Cited by (1 rows)

References (13 rows)

Name	Order	Citations	PageRank
Matthew Sheckells	1	3	1.78
Gowtham Garimella	2	3	1.10
Marin Kobilarov	3	103	14.28

1