TU Berlin

Neuronale InformationsverarbeitungMaschinelles Lernen

Neuronale Informationsverarbeitung


zur Navigation

Maschnielles Lernen


Neural Systems for Choice and Valuation with Counterfactual Learning Signals
Zitatschlüssel Tobia14b
Autor Tobia, M. J. and Guo, R. and Schwarze, U. and Böhmer, W. and Gläscher, J. and Finckh, B. and Marschner, A. and Büchel, C. and Obermayer, K. and Sommer, T.
Seiten 57-69
Jahr 2014
ISBN 978-3-319-23527-1, 978-3-319-23528-8
DOI 10.1007/978-3-319-23528-8_8
Journal NeuroImage
Jahrgang 89
Verlag Elsevier
Serie Lecture Notes in Computer Science
Zusammenfassung The purpose of this experiment was to test a computational model of reinforcement learning with and without fictive prediction error (FPE) signals to investigate how counterfactual consequences contribute to acquired representations of action-specific expected value, and to determine the functional neuroanatomy and neuromodulator systems that are involved. 80 male participants underwent dietary depletion of either tryptophan or tyrosine/phenylalanine to manipulate serotonin (5HT) and dopamine (DA), respectively. They completed 80 rounds (240 trials) of a strategic sequential investment task that required accepting interim losses in order to access a lucrative state and maximize long-term gains, while being scanned. We extended the standard Q-learning model by incorporating both counterfactual gains and losses into separate error signals. The FPE model explained the participants' data significantly better than a model that did not include counterfactual learning signals. Expected value from the FPE model was significantly correlated with BOLD signal change in the ventromedial prefrontal cortex (vmPFC) and posterior orbitofrontal cortex (OFC), whereas expected value from the standard model did not predict changes in neural activity. The depletion procedure revealed significantly different neural responses to expected value in the vmPFC, caudate, and dopaminergic midbrain in the vicinity of the substantia nigra (SN). Differences in neural activity were not evident in the standard Q-learning computational model. These findings demonstrate that FPE signals are an important component of valuation for decision making, and that the neural representation of expected value incorporates cortical and subcortical structures via interactions among serotonergic and dopaminergic modulator systems.
Link zur Publikation Download Bibtex Eintrag



Schnellnavigation zur Seite über Nummerneingabe