142
Views
1
CrossRef citations to date
0
Altmetric
Articles

A cognitive satisficing strategy for bandit problems

&
Pages 232-242 | Received 14 Feb 2015, Accepted 18 Jul 2015, Published online: 02 Sep 2015
 

Abstract

When learning and test phases are not separated, there is a trade-off between speed and accuracy. This is a universal problem for agents acting under uncertainty. To address this trade-off, we employ a strategy called satisficing, which looks for actions that are satisfactory with respect to a given reference level. In this study, we introduce a satisficing value function, the loosely symmetric model with variable reference (LSVR) which is an extension of the loosely symmetric model inspired by some causal and perceptual human properties. We tested the performance of the LSVR in K-armed bandit problems that deal with the trade-off in the simplest possible way. Our results show that the LSVR enables effective online optimisation through satisficing.

Notes

No potential conflict of interest was reported by the authors.

This study was presented in part at the 6th International Conference on Soft Computing and Intelligent Systems–the 13th International Symposium on Advanced Intelligent Systems (SCIS-ISIS 2012), Kobe, Japan, 20–24 November 2012, and the 12th International Conference of Numerical Analysis and Applied Mathematics (ICNAAM 2014), Rhodes, Greece, 22–28 September 2014.

Additional information

Funding

Part of this work was carried out with the support of JSPS KAKENHI, [grant number 26–10453] (awarded to Y.K.), [grant number 25730150] (to T.T.); the Cooperative Research Project Program [H25/A12] (to T.T.) of the Research Institute of Electrical Communication, Tohoku University.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.