142
Views
1
CrossRef citations to date
0
Altmetric
Articles

A cognitive satisficing strategy for bandit problems

&
Pages 232-242 | Received 14 Feb 2015, Accepted 18 Jul 2015, Published online: 02 Sep 2015
 

Abstract

When learning and test phases are not separated, there is a trade-off between speed and accuracy. This is a universal problem for agents acting under uncertainty. To address this trade-off, we employ a strategy called satisficing, which looks for actions that are satisfactory with respect to a given reference level. In this study, we introduce a satisficing value function, the loosely symmetric model with variable reference (LSVR) which is an extension of the loosely symmetric model inspired by some causal and perceptual human properties. We tested the performance of the LSVR in K-armed bandit problems that deal with the trade-off in the simplest possible way. Our results show that the LSVR enables effective online optimisation through satisficing.

Notes

No potential conflict of interest was reported by the authors.

This study was presented in part at the 6th International Conference on Soft Computing and Intelligent Systems–the 13th International Symposium on Advanced Intelligent Systems (SCIS-ISIS 2012), Kobe, Japan, 20–24 November 2012, and the 12th International Conference of Numerical Analysis and Applied Mathematics (ICNAAM 2014), Rhodes, Greece, 22–28 September 2014.

Additional information

Funding

Part of this work was carried out with the support of JSPS KAKENHI, [grant number 26–10453] (awarded to Y.K.), [grant number 25730150] (to T.T.); the Cooperative Research Project Program [H25/A12] (to T.T.) of the Research Institute of Electrical Communication, Tohoku University.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 763.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.