Bayesian Nonparametric Methods for Partially-Observable Reinforcement Learning. - Citegraph

Paper Info

Title
Bayesian Nonparametric Methods for Partially-Observable Reinforcement Learning.

Abstract
Making intelligent decisions from incomplete information is critical in many applications: for example, robots must choose actions based on imperfect sensors, and speech-based interfaces must infer a user's needs from noisy microphone inputs. What makes these tasks hard is that often we do not have a natural representation with which to model the domain and use for choosing actions; we must learn about the domain's properties while simultaneously performing the task. Learning a representation also involves trade-offs between modeling the data that we have seen previously and being able to make predictions about new data. This article explores learning representations of stochastic systems using Bayesian nonparametric statistics. Bayesian nonparametric methods allow the sophistication of a representation to scale gracefully with the complexity in the data. Our main contribution is a careful empirical evaluation of how representations learned using Bayesian nonparametric methods compare to other standard learning approaches, especially in support of planning and control. We show that the Bayesian aspects of the methods result in achieving state-of-the-art performance in decision making with relatively few samples, while the nonparametric aspects often result in fewer computations. These results hold across a variety of different techniques for choosing actions given a representation.

Year	DOI	Venue
2015	535AB9F9-6117-4ACE-9583-6FE422F7AA6C	IEEE transactions on pattern analysis and machine intelligence
Keywords	Field	DocType
reinforcement learning,artificial intelligence,partially-observable markov decision process,hdp-hmm,pomdp,hierarchial dirichlet process hidden markov model,machine learning,history,hidden markov models,learning artificial intelligence,markov processes,computational modeling,partially observable markov decision process,knowledge representation	Knowledge representation and reasoning,Markov process,Pattern recognition,Computer science,Partially observable Markov decision process,Nonparametric statistics,Artificial intelligence,Hidden Markov model,Machine learning,Complete information,Reinforcement learning,Bayesian probability	Journal
Volume	Issue	ISSN
37	2	1939-3539
Citations	PageRank	References
10	0.53	27
Authors
4

Authors (4 rows)

Cited by (10 rows)

References (27 rows)

Name	Order	Citations	PageRank
finale doshivelez	1	574	51.99
Pfau, David	2	80	6.76
Frank D. Wood	3	10	0.53
Nicholas Roy	4	3644	288.27

1