Title
Joint Model Choice and Hyperparameter Optimization with Factorized Multilayer Perceptrons
Abstract
Recent work has demonstrated that hyperparameter optimization within the sequential model-based optimization (SMBO) framework is generally possible. This approach replaces the expensive-to-evaluate function that maps hyperparameters to the performance of a learned model on validation data by a surrogate model which is much cheaper to evaluate. The current state of the art in hyperparameter optimization learns these surrogate models across a variety of solved data sets where a grid search has already been employed. In this way, surrogate models are learned across data sets, and thus able to generalize better. However, meta features that describe characteristics of a data set are usually needed in order for the surrogate model to differentiate between same hyperparameter configurations on different data sets. Another research area that is closely related focuses on model choice, i.e. picking the right model for a given task, which is also a problem that many practitioners face in machine learning. In this paper, we aim to solve both of these problems with a unified surrogate model that learns across different data sets, different classifiers and their respective hyperparameters. We employ factorized multilayer perceptrons, a surrogate model that consists of a multilayer perceptron architecture, but offers the prediction of a factorization machine in the first layer. In this way, data sets, models and hyperparameters are being represented in a joint lower dimensional latent feature space. Experiments on a publicly available meta data set containing 59 individual data sets and 19 prediction models demonstrate the efficiency of our approach.
Year
DOI
Venue
2015
10.1109/ICTAI.2015.24
IEEE International Conference on Tools with Artificial Intelligence
Keywords
Field
DocType
Hyperparameter Optimization, Model Choice, Sequential Model-Based Optimization
Hyperparameter optimization,Data modeling,Data set,Feature vector,Hyperparameter,Pattern recognition,Computer science,Surrogate model,Multilayer perceptron,Artificial intelligence,Perceptron,Machine learning
Conference
ISSN
Citations 
PageRank 
1082-3409
0
0.34
References 
Authors
20
4
Name
Order
Citations
PageRank
Nicolas Schilling1999.24
Martin Wistuba215419.66
Lucas Drumond339524.27
Lars Schmidt-Thieme43802216.58