Search in:

Advanced search

IIE Transactions Volume 34, 2002 - Issue 9

Submit an article Journal homepage

769

Views

CrossRef citations to date

Altmetric

Original Articles

A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking

ABHUIT GOSAVII College of Engineering, Industrial Engineering Program, University of Southern, Colorado, Pueblo, CO, 81001, USA E-mail: E-mail: [email protected]

NAVEEN BANDLA Sabre Technologies Inc, Soulhlake, TX, 76092, USA E-mail: E-mail: [email protected]

TAPAS K. DAS Department of Industrial and Management Systems Engineering, University of South Florida, Tampa, FL, 33620, USA E-mail: E-mail: [email protected]

Pages 729-742 | Received 01 Oct 2000, Accepted 01 Dec 2001, Published online: 17 Apr 2007

Cite this article
https://doi.org/10.1080/07408170208928908

References
Citations
Metrics
Reprints & Permissions

References

Abounudi , J. ( 1998 ) Stochastic approximation for non-expansive maps: application to Q-learning algorithms. PhD thesis, MIT, Cambridge , MA .
Google Scholar
Bundlu , N. ( 1998 ) Airline yield management using a reinforcement learning approach. Unpublished Master's thesis, Department of Industrial and Management Systems Engineering , University of South Florida, Tampa , FL .
Google Scholar
Bellman , R. ( 1954 ) The theory of dynamic programming. Bulletin of the Americall Mathematical Society , 60 , 503 – 516 .
Web of Science ®Google Scholar
Belobaba , P. P. ( 1989 ) Application of a probabilistic decision model to airline scat inventory control. Operations Research , 37 , 183 – 197 .
Web of Science ®Google Scholar
Bertsekas , D. P. ( 1995 ) Dynamic Programming and Optimal Control , Vol. II , Athena Scientific, Belmont , MA .
Google Scholar
Bertsekas , D. and Tsitsiklis , J. ( 1996 ) Neuro-Dynamic Programming , Athena Scientific, :Belmont, MA .
Google Scholar
Brumellec , S. L. and McGill , J. I. ( 1993 ) Airline seat allocation with multiple nested fare classes. Operations Research , 41 , 127 – 137 .
Web of Science ®Google Scholar
Chatwin , R. E. ( 1998 ) Multiperiod airline overbooking with a single fare class. Operations Research , 46 ( 6 ), 805 – 819 .
Web of Science ®Google Scholar
Curry , R. E. ( 1990 ) Optimal airline seat allocation with fare classes nested by origins and destinations. Transportation Science , 24 , 193 – 204 .
Web of Science ®Google Scholar
Darken , C. , Chang , J. and Moody , J. ( 1992 ) Learning rate schedules for faster stochastic gradient search, in Neural Networks for Signal Processing 2 - Proceedings of the 1992 IEEE Workshop , White , D. A. and Sofge , D. A. (ed), IEEE Press, :Piscataway, NJ .
Google Scholar
Das , T. K. , Gosavi , A. , Mahadevan , S. and Marchalleck , N. ( 1999 ) Solving semi-Markov decision problems using average reward reinforcement learning. Management Science , 45 ( 4 ), 560 – 574 .
Web of Science ®Google Scholar
Davis , P. ( 1994 ) Airline tics profitability yield to management. SIAM News , 27 ( 5 ).
Google Scholar
Glover , F. , Glover R. , Lorenzo , J. and McMillan , C. ( 1982 ) The passenger-mix problem in the scheduled airlines. Interfaces , 12 , 73 – 79 .
Web of Science ®Google Scholar
Gosavi , A. ( 1999 ) An algorithm for solving semi-Markov decision problems using reinforcement learning: convergence analysis and numerical results. , PhD thesis, University of South Florida, Tampa FL .
Google Scholar
Higle , J. L. and Sen , S. ( 1991 ) Stochastic decomposition: an algorithm for two-stage linear programs with recourse. Mathematics of Operations Research , 16 , 650 – 669 .
Web of Science ®Google Scholar
Howard , R. ( 1960 ) Dynamic Programming and Markov Processes , MIT Press, Cambridge , MA .
Google Scholar
Howard , R. ( 1971 ) Dynamic Probabilities Systems Volume II; semiMarkov Decision Processes , John Wiley and Sons, New York , NY , p. 976 onwards.
Google Scholar
Littlewood , K. ( 1972 ) Forecasting and control of passenger bookings, in Proceedings of the l2th AGIFORS Symposium , Nathanya , Israel , pp. 95 – 117 .
Google Scholar
Martinez , R. and Sanchez , M. ( 1970 ) Automatic booking level control, in Proceedings of the 10th AGIFORS Symposium , Terrigal , Austrulia , pp. 1 – 20 .
Google Scholar
McGill , J. I. and Van Ryzin , G. J. ( 1999 ) Revenue management: research overview and prospects. Transportation Science , 33 ( 2 ), 233 – 256 .
Web of Science ®Google Scholar
Puterman , M. L. ( 1994 ) Markov Decision Processes , Wiley lnterscience, New York , NY .
Google Scholar
Robbins , H. and Monro , S. ( 1951 ) A stochastic approximation method , Annals of Mathematical Statistics , 22 , 400 – 407 .
Google Scholar
Robinson , L. W. ( 1995 ) Optimal and approximate control policies for airline booking with sequential non monotonic fare classes. Operations Research , 43 , 252 – 263 .
Web of Science ®Google Scholar
Shapiro , A. ( 2000 ) Stochastic programming by Monte Carlo methods. , Preprint, Georgia Institute of Technology, Atlanta , GA .
Google Scholar
Smith , B. C. , Leimkuhler , J. F. and Darrow , R. M. ( 1992 ) Yield management at American Airlines. Interfaces , 22 , 8 – 31 .
Web of Science ®Google Scholar
Subramaniam , J. , Stidham , Jr, S. and Lautenbacher , C. J , ( 1999 ) Airline yield management with overbooking, cancellations and noshows. Transportation Science , 33 ( 2 ), 147 – 167 ,
Web of Science ®Google Scholar
Sutton , R. S. ( 1988 ) Learning to predict by the methods of temporal differences. Machine Learning , 3 , 9 – 44 .
Google Scholar
Sutton , R. and Barto , A. G. ( 1998 ) Reinforcement Learning , The MIT Press, Cambridge , MA .
Google Scholar
Talluri , K. T. and Van Ryzin , G. J. ( 1999 ) Bid-price controls for network revenue management. Management Science , 44 , 1577 – 1593 .
Web of Science ®Google Scholar
Thomson , H. R. ( 1961 ) Statistical problems in airline reservation control. Operational Research Quarterly , 12 , 167 – 185 .
Google Scholar
Van Ryzin , G. J. and McGill , J. I. ( 2000 ) Revenue management without forecasting or optimization: an adaptive algorithm for determining seat protection levels. Management Science , 46 ( 6 ), 760 – 775 .
Web of Science ®Google Scholar
Watkins , C. J. ( 1989 ) Learning from delayed rewards. PhD thesis , Kings College, Cambridge , UK .
Google Scholar
Wheeler , R. and Narenda , K. ( 1986 ) Decentralized learning in finite Markov chains , IEEE Transactions on Automatic Control , 31 ( 6 ), 373 – 376 .
Web of Science ®Google Scholar
Wollmer , R. D. ( 1992 ) An airline seat management model for a single leg route when lower fare classes book first. Operations Research , 40 , 26 – 37 .
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date