Title
Safe Exploration for Active Learning with Gaussian Processes
Abstract
In this paper, the problem of safe exploration in the active learning context is considered. Safe exploration is especially important for data sampling from technical and industrial systems, e.g. combustion engines and gas turbines, where critical and unsafe measurements need to be avoided. The objective is to learn data-based regression models from such technical systems using a limited budget of measured, i.e. labelled, points while ensuring that critical regions of the considered systems are avoided during measurements. We propose an approach for learning such models and exploring new data regions based on Gaussian processes (GP's). In particular, we employ a problem specific GP classifier to identify safe and unsafe regions, while using a differential entropy criterion for exploring relevant data regions. A theoretical analysis is shown for the proposed algorithm, where we provide an upper bound for the probability of failure. To demonstrate the efficiency and robustness of our safe exploration scheme in the active learning setting, we test the approach on a policy exploration task for the inverse pendulum hold up problem.
Year
DOI
Venue
2015
10.1007/978-3-319-23461-8_9
ECML/PKDD
Field
DocType
Volume
Mathematical optimization,Active learning,Upper and lower bounds,Computer science,Robustness (computer science),Gaussian process,Artificial intelligence,Differential entropy,Classifier (linguistics),Pendulum,Decision boundary,Machine learning
Conference
9286
ISSN
Citations 
PageRank 
0302-9743
14
0.87
References 
Authors
13
6
Name
Order
Citations
PageRank
Jens Schreiter1211.83
duy nguyentuong243826.22
Mona Eberts3140.87
Bastian Bischoff415410.64
Heiner Markert5535.97
marc toussaint6129997.23