Safe Exploration for Active Learning with Gaussian Processes - Citegraph

Paper Info

Title
Safe Exploration for Active Learning with Gaussian Processes

Abstract
In this paper, the problem of safe exploration in the active learning context is considered. Safe exploration is especially important for data sampling from technical and industrial systems, e.g. combustion engines and gas turbines, where critical and unsafe measurements need to be avoided. The objective is to learn data-based regression models from such technical systems using a limited budget of measured, i.e. labelled, points while ensuring that critical regions of the considered systems are avoided during measurements. We propose an approach for learning such models and exploring new data regions based on Gaussian processes (GP's). In particular, we employ a problem specific GP classifier to identify safe and unsafe regions, while using a differential entropy criterion for exploring relevant data regions. A theoretical analysis is shown for the proposed algorithm, where we provide an upper bound for the probability of failure. To demonstrate the efficiency and robustness of our safe exploration scheme in the active learning setting, we test the approach on a policy exploration task for the inverse pendulum hold up problem.

Year	DOI	Venue
2015	10.1007/978-3-319-23461-8_9	ECML/PKDD
Field	DocType	Volume
Mathematical optimization,Active learning,Upper and lower bounds,Computer science,Robustness (computer science),Gaussian process,Artificial intelligence,Differential entropy,Classifier (linguistics),Pendulum,Decision boundary,Machine learning	Conference	9286
ISSN	Citations	PageRank
0302-9743	14	0.87
References	Authors
13	6

Authors (6 rows)

Cited by (14 rows)

References (13 rows)

Name	Order	Citations	PageRank
Jens Schreiter	1	21	1.83
duy nguyentuong	2	438	26.22
Mona Eberts	3	14	0.87
Bastian Bischoff	4	154	10.64
Heiner Markert	5	53	5.97
marc toussaint	6	1299	97.23

1