A cognitive satisficing strategy for bandit problems: International Journal of Parallel, Emergent and Distributed Systems: Vol 32, No 2

142

Views

CrossRef citations to date

Altmetric

Abstract

When learning and test phases are not separated, there is a trade-off between speed and accuracy. This is a universal problem for agents acting under uncertainty. To address this trade-off, we employ a strategy called satisficing, which looks for actions that are satisfactory with respect to a given reference level. In this study, we introduce a satisficing value function, the loosely symmetric model with variable reference (LSVR) which is an extension of the loosely symmetric model inspired by some causal and perceptual human properties. We tested the performance of the LSVR in K-armed bandit problems that deal with the trade-off in the simplest possible way. Our results show that the LSVR enables effective online optimisation through satisficing.

Keywords:

Notes

No potential conflict of interest was reported by the authors.

This study was presented in part at the 6th International Conference on Soft Computing and Intelligent Systems–the 13th International Symposium on Advanced Intelligent Systems (SCIS-ISIS 2012), Kobe, Japan, 20–24 November 2012, and the 12th International Conference of Numerical Analysis and Applied Mathematics (ICNAAM 2014), Rhodes, Greece, 22–28 September 2014.

Additional information

Funding

Part of this work was carried out with the support of JSPS KAKENHI, [grant number 26–10453] (awarded to Y.K.), [grant number 25730150] (to T.T.); the Cooperative Research Project Program [H25/A12] (to T.T.) of the Research Institute of Electrical Communication, Tohoku University.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

A cognitive satisficing strategy for bandit problems

Information for

Open access

Opportunities

Help and information

A cognitive satisficing strategy for bandit problems

Abstract

Notes

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature