769
Views
60
CrossRef citations to date
0
Altmetric
Original Articles

A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking

, &
Pages 729-742 | Received 01 Oct 2000, Accepted 01 Dec 2001, Published online: 17 Apr 2007

References

  • Abounudi , J. ( 1998 ) Stochastic approximation for non-expansive maps: application to Q-learning algorithms. PhD thesis, MIT, Cambridge , MA .
  • Bundlu , N. ( 1998 ) Airline yield management using a reinforcement learning approach. Unpublished Master's thesis, Department of Industrial and Management Systems Engineering , University of South Florida, Tampa , FL .
  • Bellman , R. ( 1954 ) The theory of dynamic programming. Bulletin of the Americall Mathematical Society , 60 , 503 – 516 .
  • Belobaba , P. P. ( 1989 ) Application of a probabilistic decision model to airline scat inventory control. Operations Research , 37 , 183 – 197 .
  • Bertsekas , D. P. ( 1995 ) Dynamic Programming and Optimal Control , Vol. II , Athena Scientific, Belmont , MA .
  • Bertsekas , D. and Tsitsiklis , J. ( 1996 ) Neuro-Dynamic Programming , Athena Scientific, :Belmont, MA .
  • Brumellec , S. L. and McGill , J. I. ( 1993 ) Airline seat allocation with multiple nested fare classes. Operations Research , 41 , 127 – 137 .
  • Chatwin , R. E. ( 1998 ) Multiperiod airline overbooking with a single fare class. Operations Research , 46 ( 6 ), 805 – 819 .
  • Curry , R. E. ( 1990 ) Optimal airline seat allocation with fare classes nested by origins and destinations. Transportation Science , 24 , 193 – 204 .
  • Darken , C. , Chang , J. and Moody , J. ( 1992 ) Learning rate schedules for faster stochastic gradient search, in Neural Networks for Signal Processing 2 - Proceedings of the 1992 IEEE Workshop , White , D. A. and Sofge , D. A. (ed), IEEE Press, :Piscataway, NJ .
  • Das , T. K. , Gosavi , A. , Mahadevan , S. and Marchalleck , N. ( 1999 ) Solving semi-Markov decision problems using average reward reinforcement learning. Management Science , 45 ( 4 ), 560 – 574 .
  • Davis , P. ( 1994 ) Airline tics profitability yield to management. SIAM News , 27 ( 5 ).
  • Glover , F. , Glover R. , Lorenzo , J. and McMillan , C. ( 1982 ) The passenger-mix problem in the scheduled airlines. Interfaces , 12 , 73 – 79 .
  • Gosavi , A. ( 1999 ) An algorithm for solving semi-Markov decision problems using reinforcement learning: convergence analysis and numerical results. , PhD thesis, University of South Florida, Tampa FL .
  • Higle , J. L. and Sen , S. ( 1991 ) Stochastic decomposition: an algorithm for two-stage linear programs with recourse. Mathematics of Operations Research , 16 , 650 – 669 .
  • Howard , R. ( 1960 ) Dynamic Programming and Markov Processes , MIT Press, Cambridge , MA .
  • Howard , R. ( 1971 ) Dynamic Probabilities Systems Volume II; semiMarkov Decision Processes , John Wiley and Sons, New York , NY , p. 976 onwards.
  • Littlewood , K. ( 1972 ) Forecasting and control of passenger bookings, in Proceedings of the l2th AGIFORS Symposium , Nathanya , Israel , pp. 95 – 117 .
  • Martinez , R. and Sanchez , M. ( 1970 ) Automatic booking level control, in Proceedings of the 10th AGIFORS Symposium , Terrigal , Austrulia , pp. 1 – 20 .
  • McGill , J. I. and Van Ryzin , G. J. ( 1999 ) Revenue management: research overview and prospects. Transportation Science , 33 ( 2 ), 233 – 256 .
  • Puterman , M. L. ( 1994 ) Markov Decision Processes , Wiley lnterscience, New York , NY .
  • Robbins , H. and Monro , S. ( 1951 ) A stochastic approximation method , Annals of Mathematical Statistics , 22 , 400 – 407 .
  • Robinson , L. W. ( 1995 ) Optimal and approximate control policies for airline booking with sequential non monotonic fare classes. Operations Research , 43 , 252 – 263 .
  • Shapiro , A. ( 2000 ) Stochastic programming by Monte Carlo methods. , Preprint, Georgia Institute of Technology, Atlanta , GA .
  • Smith , B. C. , Leimkuhler , J. F. and Darrow , R. M. ( 1992 ) Yield management at American Airlines. Interfaces , 22 , 8 – 31 .
  • Subramaniam , J. , Stidham , Jr, S. and Lautenbacher , C. J , ( 1999 ) Airline yield management with overbooking, cancellations and noshows. Transportation Science , 33 ( 2 ), 147 – 167 ,
  • Sutton , R. S. ( 1988 ) Learning to predict by the methods of temporal differences. Machine Learning , 3 , 9 – 44 .
  • Sutton , R. and Barto , A. G. ( 1998 ) Reinforcement Learning , The MIT Press, Cambridge , MA .
  • Talluri , K. T. and Van Ryzin , G. J. ( 1999 ) Bid-price controls for network revenue management. Management Science , 44 , 1577 – 1593 .
  • Thomson , H. R. ( 1961 ) Statistical problems in airline reservation control. Operational Research Quarterly , 12 , 167 – 185 .
  • Van Ryzin , G. J. and McGill , J. I. ( 2000 ) Revenue management without forecasting or optimization: an adaptive algorithm for determining seat protection levels. Management Science , 46 ( 6 ), 760 – 775 .
  • Watkins , C. J. ( 1989 ) Learning from delayed rewards. PhD thesis , Kings College, Cambridge , UK .
  • Wheeler , R. and Narenda , K. ( 1986 ) Decentralized learning in finite Markov chains , IEEE Transactions on Automatic Control , 31 ( 6 ), 373 – 376 .
  • Wollmer , R. D. ( 1992 ) An airline seat management model for a single leg route when lower fare classes book first. Operations Research , 40 , 26 – 37 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.