Title
Unsupervised Predictive Memory in a Goal-Directed Agent.
Abstract
Animals execute goal-directed behaviours despite the limited range and scope of their sensors. To cope, they explore environments and store memories maintaining estimates of important information that is not presently available. Recently, progress has been made with artificial intelligence (AI) agents that learn to perform tasks from sensory input, even at a human level, by merging reinforcement learning (RL) algorithms with deep neural networks, and the excitement surrounding these results has led to the pursuit of related ideas as explanations of non-human animal learning. However, we demonstrate that contemporary RL algorithms struggle to solve simple tasks when enough information is concealed from the sensors of the agent, a property called observability. An obvious requirement for handling partially observed tasks is access to extensive memory, but we show memory is not enough; it is critical that the right information be stored in the right format. We develop a model, the Memory, RL, and Inference Network (MERLIN), in which memory formation is guided by a process of predictive modeling. MERLIN facilitates the solution of tasks in 3D virtual reality environments for which partial observability is severe and memories must be maintained over long durations. Our model demonstrates a single learning agent architecture that can solve canonical behavioural tasks in psychology and neurobiology without strong simplifying assumptions about the dimensionality of sensory input or the duration of experiences.
Year
Venue
Field
2018
arXiv: Learning
Observability,Architecture,Virtual reality,Inference,Curse of dimensionality,Artificial intelligence,Merge (version control),Deep neural networks,Mathematics,Machine learning,Reinforcement learning
DocType
Volume
Citations 
Journal
abs/1803.10760
13
PageRank 
References 
Authors
0.87
7
24
Name
Order
Citations
PageRank
Greg Wayne159231.86
Chia-Chun Hung2403.35
Amos David3615.40
Mehdi Mirza4170386.14
Arun Ahuja5727.45
Agnieszka Grabska-Barwińska627210.12
Jack Rae7758.77
Piotr W. Mirowski817813.09
Leibo, Joel Z.929921.41
Adam Santoro1043820.37
Mevlana Gemici11231.87
Malcolm Reynolds12131.21
Tim Harley1378427.43
Josh Abramson14161.58
Shakir Mohamed15153871.62
Danilo Jimenez Rezende16156781.67
David Saxton171125.81
Adam Cain18212.29
Chloe Hillier191504.77
David Silver208252363.86
Koray Kavukcuoglu2110189504.11
Matthew M Botvinick2249425.34
Demis Hassabis234924191.12
Timothy P. Lillicrap244377170.65