Search in:

Advanced search

IIE Transactions Volume 36, 2004 - Issue 4

Submit an article Journal homepage

365

Views

CrossRef citations to date

Altmetric

Original Articles

A reinforcement learning approach to stochastic business games

KIRAN KUMAR RAVULAPATI Delta Technology, Atlanta, GA, 30354, USA E-mail: irankumar@delt

JAIDEEP RAO Pilgrim Software, Tampa, FL, 33618, USA E-mail: aoj@pilgrimsof

TAPAS K. DAS Department of Industrial and Management Systems Engineering, University of South Florida, Tampa, FL, 33620, USA E-mail: [email protected]

Pages 373-385 | Received 01 Feb 2001, Accepted 01 Nov 2003, Published online: 17 Aug 2010

Cite this article
https://doi.org/10.1080/07408170490278698

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

References

Abounadi , J. , Bertsekas , D. and Borkar , V. S. 1998 . Learning algorithms for Markov decision processes with average cost report. LIDS-P-2434 , Cambridge, MA : Laboratory for Information and Decision Systems, MIT .
Google Scholar
Anupindi , R. , Bassok , Y. and Zemel , E. 2001 . A general framework for the study of decentralized distribution systems . Journal of Manufacturing and Service Operations Management , : 4
Google Scholar
Bellman , R. E. 1957 . Dynamic Programming , Princeton, NJ : Princeton University Press .
Google Scholar
Bertsekas , D. and Tsitsiklis , J. 1996 . Neurodynamic Programming , Belmont, MA : Athena Scientific .
Google Scholar
Darken , C. , Chang , J. and Moody , J. 1992 . “ Learning rate schedules for faster stochastic gradient search ” . In Neural Networks for Signal Processing 2—Proceedings of the 1992 IEEE Workshop , Edited by: White , D. A. and Sofge , D. A. Piscataway, NJ : IEEE Press .
Google Scholar
Das , T. K. , Gosavi , A. , Mahadevan , S. and Marchalleck , N. 1999 . Solving semi-Markov decision problems using average reward reinforcement learning . Management Science , 45 ( 4 ) : 560 – 574 .
Web of Science ®Google Scholar
Erev , I. and Roth , A. E. 1998 . Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria . The American Economic Review , 88 ( 4 ) : 848 – 881 .
Web of Science ®Google Scholar
Filar , J. and Vrieze , K. 1997 . Competitive Markov Decision Processes , New York, NY : Springer-Verlag .
Google Scholar
Gosavi , A. 2004 . “ Reinforcement learning for long-run average cost ” . In European Journal of Operations Research to appear
Google Scholar
Gosavi , A. , Bandla , N. and Das , T. K. 2002 . A reinforcement learning approach to airline seat allocation for multiple fare classes with overbooking . IIE Transactions , 34 ( 9 ) : 729 – 742 .
Web of Science ®Google Scholar
Hu , J. and Wellman , M. P. 1998 . “ Multi-agent reinforcement learning: theoretical framework and an algorithm ” . In Proceedings of the 15th International Conference on Machine Learning 242 – 250 .
Google Scholar
Li , J. and Das , T. K. 2003 . Learning Nash equilibrium for average reward irreducible stochastic games , Tampa, FL : University of South Florida . Working paper, Department of Industrial and Management Systems Engineering
Google Scholar
Littman , M. L. 1994 . “ Markov games as a framework for multi-agent reinforcement learning ” . In Proceedings of the 11th International Conference on Machine Learning 157 – 163 .
Google Scholar
Nash , J. F. 1951 . Non-cooperative games . Annals of Mathematics , 54 : 286 – 295 .
Web of Science ®Google Scholar
Owen , G. 1975 . On the core of linear production games . Mathamatical Programming , 9 : 358 – 370 .
Web of Science ®Google Scholar
Paternina , C. D. and Das , T. K. 2000 . Intelligent dynamic control policies for serial production lines . IIE Transactions , 33 ( 1 ) : 65 – 77 .
Web of Science ®Google Scholar
Puterman , M. L. 1994 . Markov Decision Processes , New York, NY : Wiley .
Google Scholar
Ripley , B. D. 1996 . Pattern Recognition and Neural Networks , Oxford, UK : Cambridge University Press .
Google Scholar
Robbins , H. and Monro , S. 1951 . A stochastic approximation method . Annals of Mathematical and Statistics , 22 : 400 – 407 .
Google Scholar
Shapley , L. and Shubik , M. 1975 . Competitive outcomes in the core of market games Technical report R-1692-NSF, The Rand Corporation
Google Scholar
Sutton , R. S. and Barto , A. 1998 . Reinforcement Learning , Cambridge, MA : MIT Press .
Google Scholar
Van der Lann , G. , Talman , A. J. J. and Van der Heyden , L. 1987 . “ Simplicial variable dimension algorithms for solving the nonlinear complimentary problem on a product of unit simplices using a general labeling ” . In Mathematics of Operations Research 377 – 397 .
Google Scholar
Van Roy , B. 1998 . Learning and value function approximation in complex decision processes , Cambridge, MA : Laboratory for Information and Decision Systems, MIT . Ph.D. thesis
Google Scholar
Watkins , C. J. C. H. 1989 . Learning from delayed rewards , Cambridge, UK : Cambridge University . Ph.D. thesis
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

A reinforcement learning approach to stochastic business games

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

A reinforcement learning approach to stochastic business games

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date