Abstract | ||
---|---|---|
In this paper we present a technique for estimating policies which combines instance-based learning and reinforcement learning algorithms in Markovian environments. This approach has been developed for speeding up the convergence of adaptive intelligent agents that using reinforcement learning algorithms. Speeding up the learning of an intelligent agent is a complex task since the choice of inadequate updating techniques may cause delays in the learning process or even induce an unexpected acceleration that causes the agent to converge to a non-satisfactory policy. Experimental results in real-world scenarios have shown that the proposed technique is able to speed up the convergence of the agents while achieving optimal policies, overcoming problems of classical reinforcement learning approaches. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1007/978-3-642-40654-6_11 | Lecture Notes in Business Information Processing |
Keywords | Field | DocType |
Reinforcement learning,Dynamic environments,Adaptive agents | Intelligent agent,Markov process,Instance-based learning,Active learning (machine learning),Computer science,Markov decision process,Q-learning,Algorithm,Unsupervised learning,Artificial intelligence,Machine learning,Reinforcement learning | Conference |
Volume | ISSN | Citations |
141 | 1865-1348 | 0 |
PageRank | References | Authors |
0.34 | 25 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Richardson Ribeiro | 1 | 43 | 11.12 |
Fábio Favarim | 2 | 8 | 1.95 |
Marco A. C. Barbosa | 3 | 2 | 2.06 |
Alessandro L. Koerich | 4 | 525 | 39.59 |
Fabrício Enembreck | 5 | 274 | 38.42 |