Title
Acceleration of game learning with prediction-based reinforcement learning: toward the emergence of planning behavior
Abstract
When humans solve a problem, it is unlikely that they use only the current state of the problem to decide upon an action. It is difficult to explain the human action decision strategy by means of the state to action model, which is the major method used in conventional reinforcement learning (RL). On the contrary, humans appear to predict a future state through the use of past experience and decide upon an action based on that predicted state. In this paper, we propose a prediction-based RL model (PRLmodel). In the PRL model, a state prediction module and an action memory module are added to an actor-critic type RL, and the system predicts and evaluates a future state from a current one based on an expected value table. Then, the system chooses a point of action decision in order to perform the appropriate action. To evaluate the proposed model, we perform a computer simulation using a simple ping pong game. We also discuss the possibility that the PRL model may represent an evolutional change in conventional RL as well as a step toward modeling of hmuan planning behavior, because state prediction and its evaluation are the basic elements of planning in symbolic AI.
Year
DOI
Venue
2003
10.1007/3-540-44989-2_94
ICANN
Keywords
Field
DocType
action memory module,state prediction module,human action decision strategy,appropriate action,action model,future state,prediction-based reinforcement learning,prl model,state prediction,action decision,current state,computer simulation,expected value,reinforcement learning
State prediction,Computer science,Decision strategy,Expected value,Artificial intelligence,Acceleration,Machine learning,Memory module,Reinforcement learning,Ping pong
Conference
Volume
ISSN
ISBN
2714
0302-9743
3-540-40408-2
Citations 
PageRank 
References 
1
0.36
3
Authors
4
Name
Order
Citations
PageRank
Yu Ohigashi110.70
Takashi Omori221.41
Koji Morikawa3133.67
Natsuki Oka44114.18