Search in:

Advanced search

Mathematische Operationsforschung und Statistik. Series Optimization Volume 15, 1984 - Issue 3

Journal homepage

Views

CrossRef citations to date

Altmetric

Original Articles

On iterative optimization ol structured Markov decision processes with discounted rewards

Marce Hendrikx Dept.Math. & Comp.Sci, University of Technology, Eindhoven

Jo van Nunen Graduate School of Management Delft, Delft, P.A, 2612

Jaap Wessels Dept.Math. & Comp.Sci, University of Technology, Eindhoven

Pages 439-459 | Received 01 May 1982, Published online: 04 Mar 2011

Cite this article
https://doi.org/10.1080/02331938408842960

References
Citations
Metrics
Reprints & Permissions

References

Bartmann , D. 1979 . A method of bisection for dis counted Maekov decision problems . Zeitschrift für Oper, Res , 23 : 275 – 287 .
Google Scholar
Bellman , R. 1957 . A Markovian decision process . J. Math. Mech , 6 : 679 – 684 .
Web of Science ®Google Scholar
De Gheilink , G.T. and Eppen , G.D. 1967 . Linear programming solutions for separable Markov decision problems . Management Sci , 6 : 371 – 394 .
Google Scholar
Blackwell , D. 1965 . Discounted dynamic programming . Ann. Math. Statist , 36 : 226 – 235 .
Google Scholar
Denardo , E.Y. 1908 . Separable Markovian Decision problems . Management Sci , 14 : 279 – 289 . problems. Management Sci
Google Scholar
D'Epenoux , F. 1960 . Sur une probième de prod ta-t ion et de stockage dans l'aléatoire . Rev.Franc. Rech .Opér , 14 : 3 – 16 .
Google Scholar
Hastings , N.A.J. 1968 . Some notes on dynamic programming ami replacement . Oper. Res Q , 19 : 453 – 464 .
Google Scholar
Hastings , N.A.J. 1967 . A test for nonoptimal actions in undiscoundted finite Markov decision chains . Management Sci , 23 : 87 – 91 .
Google Scholar
Hastings , N.A.J. and Van Nuenen , J.A.E.E . 1977 . “ The action elimination algorithem for Markov decision processes ” . In Markov decision theory. MC-tract 93, Mathematical Centre Edited by: Tijms , H. and Wessels , J. 161 – 170 . Amsterdam
Google Scholar
Howard , R.A. 1960 . Dynamic programming and Markov decision processes , Cambridge, Mass : M.I.T. Press .
Google Scholar
Hübner , G. Improved procedures for eliminating suboptimal actions in Markov programming by the use of contradiction properties . Transactions of the 7th Prague Conference on Information theory . pp. 257 – 263 . Prague : Academica . Statistical decision functions and Random Processes
Google Scholar
Johansen , S.G. and Stidham , S. 1980 . Control of Arrivals to a Stochastic Input-Out- put System . Adv.Appl. Prob , 12 : 972 – 999 . JR
Web of Science ®Google Scholar
Macqueen , J. 1966 . A modified dynamic method for Markovian decision problems . J. Math, Anal. Appl , 14 : 38 – 43 .
Google Scholar
Macqueen , J. 1967 . A test for subopfcimal actions in Markovian decisions problems . Oper. Res , 15 : 559 – 561 .
Google Scholar
Manne , A.S. 1960 . Linear programming and sequential decisions . Management Sci , 6 : 259 – 267 .
Web of Science ®Google Scholar
Morton , T.E. and Wecker , W. 1977 . Discounting ergodieity and convergence for Markov decision processes . Management Sci , 23 : 890 – 900 .
Web of Science ®Google Scholar
Morton , T.E. 1971 . Undiscounted Markov Renewal programming via modified successive approximations . Oper. Res , 19 : 1081 – 1089 .
Google Scholar
Van Nunen , J.A.E.E . 1976 . Contracting Markov decision processes , Amsterdam : Mathematical Centre (Mathematical Centre Tract 71) .
Google Scholar
Van Nunen , J.A.E.E . 1976 . A set of successive approximation methods for discounted Markovian decision problems . Zeitschrift fur Oper. Res , 20 : 203 – 208 .
Google Scholar
Van Nunen , J.A.E.E . 1981 . Action dependent stopping times and Markov decision processes with unbounded rewards . O.R.-Spectrum , 8 : 145 – 152 .
Google Scholar
Van Nunen , J.A.E.E . 1983 . On computing optimal policies for G/M/S Queuing systems . Management Sci , 29 : 725 – 734 .
Google Scholar
Porteus , E.L. 1971 . Some bounds for discounted sequential decision processes . Management Sci , 18 : 7 – 11 .
Web of Science ®Google Scholar
Porteus , E.L. 1975 . Bounds and transformations for discounted finite Markov decision chains . Oper. Res , 23 : 161 – 184 .
Google Scholar
Porteus , E.L. 1978 . Improved iterative computation of the expected discounted return in Markov and seini-MARKQV chains , Stanford University . Research paper no. 344
Google Scholar
Puterman , M.L. and Shin , M.C. 1978 . Modified policy iteration algorithms for discounted Markov decision problems . Management Sci , 24 : 1127 – 1137 .
Web of Science ®Google Scholar
Reetz , D. 1973 . Solution of a Markovian decision problem by overtaxation . Z. Oper, Res , 17 : 28 – 32 .
Google Scholar
Scarf , H. 1960 . “ The Optimally of (S s) Policies in the Dynamic Inventory Problem ” . In Mathematical Methods in the Social Sciences , Edited by: Arrow , K.J. , Karlin , S. and Suppes , P. Standford : Stanford University Press . Chap. 13
Google Scholar
Sebastian , H.J. and Sjeber , N. 1981 . Diskrete Dynamische Optimierung , Leipzig : Akndemische Yerhmsgesellschaft . Geest & Porting K.-G
Google Scholar
Stidham , S. Jr . 1981 . On the Convergence of Successive Approximations in Dynamic Programming with Non-Zero Terminal Reward . Z. für Opns. Res , 25 : 57 – 77 .
Google Scholar
Stidham , S. and Wijngaard , J. 1983 . Ship free Markov decision processes , Report North Carolina State University . to appear
Google Scholar
Wessels , J. 1977 . Stopping times and Markov programming . Transactions of the 7th Prague Conference on Information theory, statistical decision funtions . 1977 . pp. 575 – 585 . Academia . Random processes Prague
Google Scholar
Wessels , J. Markov decision processes: Implementation aspects.Memorandum Cosor 80-14 , Kindhovon : Eindhoven University of Technology . Department of Mathematics
Google Scholar
Geilleit , R.A.A.M . “ A heuristic method for an inventory problem with two stages and two final products ” . In Department of Industrial Engineering , Eindhoven University of Technology . Research Report (1981) (to appear)
Google Scholar
Wijngaard , J. Aug 1972 . An inventory problem with constraint order capacity , Aug , Eindhoven University of Technology . TH-report 72-wsk-03
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

On iterative optimization ol structured Markov decision processes with discounted rewards

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

On iterative optimization ol structured Markov decision processes with discounted rewards

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date