Safe Exploration for Optimization with Gaussian Processes - Citegraph

Paper Info

Title
Safe Exploration for Optimization with Gaussian Processes

Abstract
We consider sequential decision problems under uncertainty, where we seek to optimize an unknown function from noisy samples. This requires balancing exploration (learning about the objective) and exploitation (localizing the maximum), a problem well-studied in the multiarmed bandit literature. In many applications, however, we require that the sampled function values exceed some prespecified \"safety\" threshold, a requirement that existing algorithms fail to meet. Examples include medical applications where patient comfort must be guaranteed, recommender systems aiming to avoid user dissatisfaction, and robotic control, where one seeks to avoid controls causing physical harm to the platform. We tackle this novel, yet rich, set of problems under the assumption that the unknown function satisfies regularity conditions expressed via a Gaussian process prior. We develop an efficient algorithm called SAFEOPT, and theoretically guarantee its convergence to a natural notion of optimum reachable under safety constraints. We evaluate SAFEOPT on synthetic data, as well as two real applications: movie recommendation, and therapeutic spinal cord stimulation.

Year	Venue	Field
2015	International Conference on Machine Learning	Recommender system,Convergence (routing),Decision problem,Computer science,Robotic control,Safety constraints,Synthetic data,Artificial intelligence,Gaussian process,Spinal cord stimulation,Machine learning
DocType	Citations	PageRank
Conference	20	1.18
References	Authors
12	4

Authors (4 rows)

Cited by (20 rows)

References (12 rows)

Name	Order	Citations	PageRank
Yanan Sui	1	30	5.85
Alkis Gotovos	2	56	3.55
Burdick, J.W.	3	2988	516.87
Andreas Krause	4	5822	368.37

1