57
Views
34
CrossRef citations to date
0
Altmetric
Original Articles

Estimation and control in discounted stochastic dynamic programming

Pages 51-71 | Published online: 15 Jun 2010

References

  • Abdel-Fattah , Y. M. 1980 . Recursive estimation and control in Markov chains. Faculty of Sciences , University Mohamed V, Rabat, Marocco .
  • Bauer , H. 1978 . Wahrscheinlichkeitstheorie und Grundzuge der Mafitheorie , Berlin : De Gruyter .
  • Berge , C. 1963 . Topological Spaces , Edinburgh and London : Oliver and Boyd .
  • Borkar , V. and Varaiya , P. 1979 . Adaptive control of Markov chains . IEEE Trans. Aut. Cont AC-24 , 6 : 953 – 957 .
  • Doshi , B. and Shreve , S. E. 1979 . “ Strong consistency of modified maximum likelihood estimator for controlled Markov chains ” . In Dept. of Math. Sci. , Newark, Delaware : University of Delaware .
  • Federgruen , A. , Hordijk , A. and Tijms , H. C. 1979 . Denumerable state semi-Markov decision processes with unbounded costs:Average cost criterion . Stock Proc. Appl , 9 : 223 – 235 .
  • van Hee , K. M. 1978 . Bayesian control of Markov chains , Mathematical Centre Tract 95, Amsterdam .
  • van Hee , K. M. and Wessels , J. 1978 . Markov decision processes and strongly excessive functions . Stock Proc. Appl , 8 : 59 – 76 .
  • Hinderer , K. 1970 . Foundations of Non-stationary Dynamic Programming with Discrete Time-parameter , Vol. 33 , Berlin-Heidelberg-New York : Springer-Verlag . Lect. Notes Operations Research and Math. Systems
  • Issaacson , D. 1979 . A characterization of geometric ergodicity, Z . Wahrscheinlichkeitstheorie verw. Geb , 49 : 267 – 273 .
  • Kolonko , M. 1981 . “ The average-optimal adaptive control of a Markov renewal model in presence of an unknown parameter ” . In Part I:Theoretical results. Part II:Application to a M/G/1-queueing model , Vol. 430 , SFB 72 Univ. Bonn . To be published
  • Kolonko , M. 1981 . Strongly consistent estimation in a controlled Markov renewal model , Vol. 447 , SFB 72 Univ. Bonn . To be published
  • Kolonko , M. 1982 . Bounds for the regret loss in dynamic programming under adaptive control , Univ. Karlsruhe . To be published
  • Kurano , M. 1972 . Discrete-time Markovian decision processes with an unknown parameter-Average return criterion . J. Op. Res. Soc. Jap , 15 : 67 – 76 .
  • Lippman , S. A. 1975 . On dynamic programming with unbounded rewards . Man. Science , 21 : 1225 – 1233 .
  • Mandl , P. 1974 . Estimation and control in Markov chains . Adv. Appl. Prob , 6 : 40 – 60 .
  • van Nunen , J. A. E. E. and Wessels , J. 1977 . “ Markov decision processes with unbounded rewards ” . In arkov Decision Theory , Edited by: Tijms , H. C. and Wessels , J. Amsterdam . Mathematical Centre Tract 93
  • Popov , N. N. 1977 . Conditions for geometric ergodicity of countable Markov chains . Soviet Math. Dokl , 18 : 676 – 679 .
  • Revuz , D. 1975 . Markov chains , North-Holland : Amsterdam .
  • Royden , H. L. 1968 . Real Analysis , New York : MacMillan .
  • Schal , M. 1971 . “ Ein verallgemeinertes stationares Entscheidungsmodell der dynamischen Optimierung In: ” . In Methods of Operations Research, Vol. X. ed. R. Henn. Meisenheim: , 145 – 162 . A. Hain .
  • Schal , M. 1975 . Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal, Z . Wahrscheinlichkeitstheorie und verw. Geb , 32 : 179 – 196 .
  • Schal , M. 1981 . “ Estimation and control in discounted stochastic dynamic programming ” . In Preprint no 428 part 1, SFB 72 , Univ. Bonn .
  • Schal , M. 1981 . Estimation and control in a GI/M/1-system. Preprint no. 428 part 2, SFB 72 , Dordrecht : Univ. Bonn To be published in:Operations Research in Progress. Reidel .
  • Schal , M. 1982 . “ Estimation and control in finite state discounted dynamic programming Preprint no 521 part 1, SFB 72, Univ. Bonn ” . In To be published in:Optimization:Theory and Control, Lecture notes in pure and applied mathematics , Marcel Dekker .
  • Schal , M. 1982 . “ Asymptotic results for sequential Markov decision models under uncertainty. Submitted to:Statistics and Decisions ” . In Preprint no. 521 part 2, SFB 72 , Univ. Bonn .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.