UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Mapping the unknown: The spatially correlated multi-armed bandit

Wu, CM; Schulz, E; Speekenbrink, M; Nelson, JD; Meder, B; (2017) Mapping the unknown: The spatially correlated multi-armed bandit. In: Proceedings of the 39th annual meeting of the Cognitive Science Society 2017. (pp. pp. 1357-1362). Cognitive Science Society: London,UK. Green open access

[thumbnail of wu2017mapping.pdf]
Preview
Text
wu2017mapping.pdf - Published Version

Download (7MB) | Preview

Abstract

We introduce the spatially correlated multi-armed bandit as a task coupling function learning with the explorationexploitation trade-off. Participants interacted with bi-variate reward functions on a two-dimensional grid, with the goal of either gaining the largest average score or finding the largest payoff. By providing an opportunity to learn the underlying reward function through spatial correlations, we model to what extent people form beliefs about unexplored payoffs and how that guides search behavior. Participants adapted to assigned payoff conditions, performed better in smooth than in rough environments, and—surprisingly—sometimes performed equally well in short as in long search horizons. Our modeling results indicate a preference for local search options, which when accounted for, still suggests participants were best-described as forming local inferences about unexplored regions, combined with a search strategy that directly traded off between exploiting high expected rewards and exploring to reduce uncertainty about the spatial structure of rewards.

Type: Proceedings paper
Title: Mapping the unknown: The spatially correlated multi-armed bandit
Event: Annual meeting of the Cognitive Science Society 2017
ISBN: 978-0-9911967-6-0
Open access status: An open access version is available from UCL Discovery
Publisher version: https://cognitivesciencesociety.org/wp-content/upl...
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Exploration-exploitation; Multi-armed bandits; Active Learning; Gaussian Processes;
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Experimental Psychology
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10140320
Downloads since deposit
144Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item