12
Views
1
CrossRef citations to date
0
Altmetric
Original Articles

Dual Control Strategies for Discrete State Markov Processes Part II

Pages 317-330 | Received 12 Sep 1967, Published online: 27 Feb 2007
 

ABSTRACT

This paper constitutes the second part of an investigation of the dual control of long duration stationary ergodic discrete Markov processes. In it the concept of decision space is introduced as a theoretical framework within which the performances of different control strategies may be compared. The evolution of a strategy is described in terms of a decision trajectory which converges by descending a ‘ hill of uncertainty ’. A study of the trajectory of the ideal strategy leads to a generalization of the definition of optimality. The existence of a whole class of sub-optimal strategies is demonstrated; it is shown that the optimal strategy of part I is a member of the class, and that any other member may be optimal also under certain conditions. Two examples of sub-optimal strategies are presented, one involving the estimation of confidence intervals, the other using a learning reinforcement technique.

Notes

†Communicated by the Author.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.