Modelling Individual Differences In The Form Of Pavlovian Conditioned Approach Responses: A Dual Learning Systems Approach With Factored Representations - Citegraph

Paper Info

Title
Modelling Individual Differences In The Form Of Pavlovian Conditioned Approach Responses: A Dual Learning Systems Approach With Factored Representations

Abstract
Reinforcement Learning has greatly influenced models of conditioning, providing powerful explanations of acquired behaviour and underlying physiological observations. However, in recent autoshaping experiments in rats, variation in the form of Pavlovian conditioned responses (CRs) and associated dopamine activity, have questioned the classical hypothesis that phasic dopamine activity corresponds to a reward prediction error-like signal arising from a classical Model-Free system, necessary for Pavlovian conditioning. Over the course of Pavlovian conditioning using food as the unconditioned stimulus (US), some rats (sign-trackers) come to approach and engage the conditioned stimulus (CS) itself - a lever - more and more avidly, whereas other rats (goal-trackers) learn to approach the location of food delivery upon CS presentation. Importantly, although both sign-trackers and goal-trackers learn the CS-US association equally well, only in sign-trackers does phasic dopamine activity show classical reward prediction error-like bursts. Furthermore, neither the acquisition nor the expression of a goal-tracking CR is dopamine-dependent. Here we present a computational model that can account for such individual variations. We show that a combination of a Model-Based system and a revised Model-Free system can account for the development of distinct CRs in rats. Moreover, we show that revising a classical Model-Free system to individually process stimuli by using factored representations can explain why classical dopaminergic patterns may be observed for some rats and not for others depending on the CR they develop. In addition, the model can account for other behavioural and pharmacological results obtained using the same, or similar, autoshaping procedures. Finally, the model makes it possible to draw a set of experimental predictions that may be verified in a modified experimental protocol. We suggest that further investigation of factored representations in computational neuroscience studies may be useful.Author SummaryAcquisition of responses towards full predictors of rewards, namely Pavlovian conditioning, has long been explained using the reinforcement learning theory. This theory formalizes learning processes that, by attributing values to situations and actions, makes it possible to direct behaviours towards rewarding objectives. Interestingly, the implied mechanisms rely on a reinforcement signal that parallels the activity of dopamine neurons in such experiments. However, recent studies challenged the classical view of explaining Pavlovian conditioning with a single process. When presented with a lever whose retraction preceded the delivery of food, some rats started to chew and bite the food magazine whereas others chew and bite the lever, even if no interactions were necessary to get the food. These differences were also visible in brain activity and when tested with drugs, suggesting the coexistence of multiple systems. We present a computational model that extends the classical theory to account for these data. Interestingly, we can draw predictions from this model that may be experimentally verified. Inspired by mechanisms used to model instrumental behaviours, where actions are required to get rewards, and advanced Pavlovian behaviours (such as overexpectation, negative patterning), it offers an entry point to start modelling the strong interactions observed between them.

Year	DOI	Venue
2014	10.1371/journal.pcbi.1003466	PLOS COMPUTATIONAL BIOLOGY
Keywords	Field	DocType
bioinformatics,biomedical research	Computational neuroscience,Neuroscience,Biology,Shaping,Dopaminergic,Conditioning,Artificial intelligence,Stimulus (physiology),Bioinformatics,PVLV,Classical conditioning,Reinforcement learning	Journal
Volume	Issue	ISSN
10	2	1553-7358
Citations	PageRank	References
3	0.45	15
Authors
5

Authors (5 rows)

Cited by (3 rows)

References (15 rows)

Name	Order	Citations	PageRank
Florian Lesaint	1	3	0.45
Olivier Sigaud	2	539	53.35
Shelly B Flagel	3	3	0.45
Terry E Robinson	4	3	1.13
Khamassi Mehdi	5	112	16.51

1