Weighted Likelihood Policy Search with Model Selection. - Citegraph

Paper Info

Title
Weighted Likelihood Policy Search with Model Selection.

Abstract
Reinforcement learning (RL) methods based on direct policy search (DPS) have been actively discussed to achieve an efficient approach to complicated Markov decision processes (MDPs). Although they have brought much progress in practical applications of RL, there still remains an unsolved problem in DPS related to model selection for the policy. In this paper, we propose a novel DPS method, {\it weighted likelihood policy search (WLPS)}, where a policy is efficiently learned through the weighted likelihood estimation. WLPS naturally connects DPS to the statistical inference problem and thus various sophisticated techniques in statistics can be applied to DPS problems directly. Hence, by following the idea of the {\it information criterion}, we develop a new measurement for model comparison in DPS based on the weighted log-likelihood.

Year	Venue	Field
2012	NIPS	Computer science,Model selection,Markov decision process,Statistical inference,Artificial intelligence,Machine learning,Reinforcement learning
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
8	4

Authors (4 rows)

Cited by (0 rows)

References (8 rows)

Name	Order	Citations	PageRank
Tsuyoshi Ueno	1	14	4.37
Hayashi, Kohei	2	159	15.31
Takashi Washio	3	1775	190.58
Kawahara, Yoshinobu	4	317	31.30

1