Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis

Full text
Author(s):
da Silva, Carolina Feher [1] ; Victorino, Camila Gomes [2] ; Caticha, Nestor [1] ; Baldo, Marcus Vinicius Chrysostomo [2]
Total Authors: 4
Affiliation:
[1] Univ Sao Paulo, Inst Phys, Dept Gen Phys, Rua Matao 1371, Cidade Univ, BR-05508090 Sao Paulo, SP - Brazil
[2] Univ Sao Paulo, Inst Biomed Sci, Dept Physiol & Biophys, Av Prof Lineu Prestes 1524, ICB 1, Cidade Univ, BR-05508000 Sao Paulo, SP - Brazil
Total Affiliations: 2
Document type: Journal article
Source: SCIENTIFIC REPORTS; v. 7, NOV 10 2017.
Web of Science Citations: 0
Abstract

Research has not yet reached a consensus on why humans match probabilities instead of maximise in a probability learning task. The most influential explanation is that they search for patterns in the random sequence of outcomes. Other explanations, such as expectation matching, are plausible, but do not consider how reinforcement learning shapes people's choices. We aimed to quantify how human performance in a probability learning task is affected by pattern search and reinforcement learning. We collected behavioural data from 84 young adult participants who performed a probability learning task wherein the majority outcome was rewarded with 0.7 probability, and analysed the data using a reinforcement learning model that searches for patterns. Model simulations indicated that pattern search, exploration, recency (discounting early experiences), and forgetting may impair performance. Our analysis estimated that 85% (95% HDI {[}76, 94]) of participants searched for patterns and believed that each trial outcome depended on one or two previous ones. The estimated impact of pattern search on performance was, however, only 6%, while those of exploration and recency were 19% and 13% respectively. This suggests that probability matching is caused by uncertainty about how outcomes are generated, which leads to pattern search, exploration, and recency. (AU)

FAPESP's process: 13/10694-0 - The role of reward and the basal ganglia in decision making
Grantee:Carolina Feher da Silva
Support Opportunities: Scholarships in Brazil - Post-Doctoral
FAPESP's process: 13/13352-2 - Repeated binary choices as a model for decision-making in non-spatial perceptual dimensions
Grantee:Marcus Vinícius Chrysóstomo Baldo
Support Opportunities: Scholarships abroad - Research