640
Views
22
CrossRef citations to date
0
Altmetric
Original Articles

Intelligent dynamic control of stochastic economic lot scheduling by agent-based reinforcement learning

, &
Pages 4381-4395 | Received 12 Oct 2010, Accepted 27 Apr 2011, Published online: 07 Jul 2011

References

  • Anupindi , R and Tayur , S . 1998 . Managing stochastic multiproduct systems: Model, measures, and analysis . Operations Research , 46 ( 3 ) : s98 – s111 .
  • Bianchi , RAC , Ribeiro , CHC and Costa , AHR . 2004 . “ Heuristically accelerated Q-learning: A new approach to speed up reinforcement learning ” . In Lecture notes in computer science , Berlin : Springer .
  • Bomberger , E . 1966 . A dynamic programming approach to a lot size scheduling problem . Management Science , 12 ( 1 ) : 778 – 784 .
  • Bourland , KE and Yano , CA . 1994 . The strategic use of capacity slack in the economic lot scheduling problem with random demands . Management Science , 40 ( 12 ) : 181 – 200 .
  • Cascon , A , Leachman , RC and Lefrancois , P . 1994 . Multi-item, single-machine scheduling problem with stochastic demands: A comparison of heuristics . International Journal of Production Research , 32 ( 3 ) : 583 – 596 .
  • Crasman , SE , Olsen , TL and Birge , JR . 2008 . Setting basestock levels in multiproduct systems with setups and random yield . IIE Transactions , 40 ( 12 ) : 1158 – 1170 .
  • Das , TK . 1999 . Solving semi-markov decision problems using average reward reinforcement learning . Management Science , 45 ( 4 ) : 560 – 574 .
  • Delporte , C and Thomas , L . 1978 . Lot sizing and sequencing for N products on one facility . Management Science , 23 ( 10 ) : 1070 – 1079 .
  • Elmaghraby , SE . 1978 . The economic lot scheduling problem (ELSP): Review and extensions . Management Science , 24 ( 6 ) : 587 – 598 .
  • Erkip , N , Gullu , R and Kocabiyikogl , A . 2000 . A quasi-birth-and-death model to evaluate fixed cycle time policies for stochastic muti-item production/inventory problem . Proceedings of MSOM conference . 2000 . Ann Harbor
  • Federgruen , A and Katalan , Z . 1994 . Approximating queue size and waiting time distributions in general polling systems . Queueing Systems , 18 ( 3–4 ) : 353 – 386 .
  • Federgruen , A and Katalan , Z . 1998 . Determining production schedules under base-stock policies in single facility multi-item production systems . Operations Research , 46 ( 6 ) : 883 – 893 .
  • Feldmann , M and Biskup , D . 2003 . Single-machine scheduling for minimizing earliness and tardiness penalties by meta-heuristic approaches . Computers and Industrial Engineering , 44 ( 2 ) : 307 – 323 .
  • Fransoo , JC , Sridharan , V and Bertrand , JWM . 1995 . A hierarchical approach for capacity coordination in multiple products single-machine production systems with stationary stochastic demands . European Journal of Operational Research , 86 ( 1 ) : 57 – 72 .
  • Gallego , G . 1990a . An extension to the class of easy economic lot scheduling problem easy? . IIE Transactions , 22 ( 2 ) : 189 – 190 .
  • Gallego , G . 1990b . Scheduling the production of several items with random demands in a single facility . Management Science , 36 ( 12 ) : 1579 – 1592 .
  • Gallego , G . 1994 . When is a base stock policy optimal in recovering disrupted cyclic schedules? . Naval Research Logistics , 41 ( 3 ) : 317 – 333 .
  • Gallego , G and Roundy , R . 1992 . The economic lot scheduling problem with finite back-order costs . Naval Research Logistics Quarterly , 39 ( 5 ) : 729 – 739 .
  • Giezenaar, R.B.L.M., 1997. Ontwerp voor een productie- en voorraadstrategie voor deproductieplant Laurox bij AKZO Nobel Chemicals te Deventer. Thesis (Masters), University of Twente
  • Gosavi , A . 2004 . Reinforcement learning for long-run average cost . European Journal of Operations Research , 155 ( 3 ) : 654 – 674 .
  • Harrison , JM . 1988 . “ Brownian models of queuing networks with heterogeneous customer populations ” . In Stochastic differential systems, stochastic control theory and applications , Edited by: Fleming , W. and Lions , P.L. 147 – 186 . New York : Springer .
  • Hsu , W . 1983 . On the general feasibility test of scheduling lot sizes for several products on one machine . Management Science , 29 ( 1 ) : 93 – 105 .
  • Jones , P and Inman , R . 1989 . When is the economic lot scheduling problem easy? . IIE Transactions , 21 ( 1 ) : 11 – 20 .
  • Kaelbling , LP , Littman , ML and Moore , AP . 1996 . Reinforcement learning: A survey . Journal of Artificial Intelligence Research , 4 ( 1 ) : 237 – 285 .
  • Khouja , M , Michalewicz , Z and Wilmot , M . 1998 . The use of genetic algoritms to solve the economic lot size scheduling problem . European Journal of Operational Research , 110 ( 3 ) : 509 – 524 .
  • Leachman , RC and Gascon , A . 1988 . A heuristic scheduling policy for multi-item, single-machine production systems with time-varying, stochastic demands . Management Science , 34 ( 3 ) : 377 – 390 .
  • Markowitz, D.M., Reiman, M.I., and Wein, L.M., 1995. The stochastic economic lot scheduling problem: Heavy traffic analysis of dynamic cyclic policies. Working paper 3863-95-MSA, Sloan School of Management, MIT, Cambridge, MA
  • Maxwell , WL . 1964 . The scheduling of economic lot sizes . Naval Research Logistics Quarterly , 11 ( 2/3 ) : 89 – 124 .
  • Paternina-Arboleda , CD and Das , TK . 2001 . Intelligent dynamic control of single-product serial production lines . IIE Transactions , 33 ( 1 ) : 65 – 77 .
  • Paternina-Arboleda , CD and Das , TK . 2005 . A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem . Simulation Modelling Practice and Theory , 13 ( 5 ) : 389 – 406 .
  • Qiu , J and Loulou , R . 1995 . Multiproduct production/inventory control under random demands . IEEE Transactions on Automatic Control , 40 ( 2 ) : 350 – 356 .
  • Raza , SA , Akgunduz , A and Chen , MY . 2006 . A tabu search algorithm for solving economic lot scheduling problem . Journal of Heuristics , 12 ( 6 ) : 413 – 426 .
  • Salomon , M . 1991 . Deterministic lot sizing models for production planning (Lecture Notes in Economics and Mathematical Systems) , Berlin : Springer .
  • Sox , CR and Muckstadt , JA . 1997 . Optimization-based planning for the stochastic lot scheduling problem . IIE Transactions , 29 ( 5 ) : 349 – 357 .
  • Sox , CR . 1999 . A review of the stochastic lot scheduling problem . International Journal of Production Economics , 62 ( 3 ) : 181 – 200 .
  • Wagner , BJ and Davis , DJ . 2002 . A search heuristic for the sequence-dependent economic lot scheduling . European Journal of Operational Research , 141 ( 1 ) : 133 – 146 .
  • Wagner , M and Smits , SR . 2004 . A local search algorithm for the optimization of the stochastic economic lot scheduling problem . International Journal of Production Economics , 90 ( 3 ) : 391 – 402 .
  • Watkins , CJ . 1989 . Learning from delayed rewards. Thesis (PhD) , Cambridge : Kings College .
  • Wein , ML . 1992 . Dynamic scheduling of a multiclass make-to-stock queue . Operations Research , 40 ( 4 ) : 724 – 735 .
  • Winands , E , Adan , I and van Houtum , GJ . 2011 . The stochastic economic lot scheduling problem: A survey . European Journal of Operational Research , 201 ( 1 ) : 1 – 9 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.