429
Views
13
CrossRef citations to date
0
Altmetric
General Paper

Comparing reinforcement learning approaches for solving game theoretic models: a dynamic airline pricing game example

&
Pages 1165-1173 | Received 01 Dec 2009, Accepted 01 Jun 2011, Published online: 21 Dec 2017

References

  • AhmedAHPoojariCAAn overview of the issues in the airline industry and the role of optimization models and algorithmsJ Opl Res Soc20085926727710.1057/palgrave.jors.2602350
  • AxelrodRThe Complexity of Cooperation: Agent-based Models of Competition and Collaboration1997
  • BellmanROn the theory of dynamic programmingProc Natl Acad Sci USA19523871671910.1073/pnas.38.8.716
  • BertsekasDPTsitsiklisJNNeuro-dynamic Programming1996
  • BinmoreKEssays on the Foundations of Game Theory1990
  • BinmoreKFun and Games: A Text on Game Theory1992
  • Boyd EA (2007). Perspectives on the future of pricing. Presented at 7th Annual INFORMS Pricing and Revenue Management Conference, Barcelona, Spain.
  • BridleJSTraining stochastic model recognition algorithms as networks can lead to maximum mutual information estimation of parametersAdvances in Neural Information Processing Systems: Proceedings of the 1989 Conference1990211217
  • BryantJWDrama theory: Dispelling the mythsJ Opl Res Soc20075860261310.1057/palgrave.jors.2602239
  • Chen X and Deng X (2005). Settling the complexity of 2-player Nash-Equilibrium. In Electronic Colloquium on Computational Complexity, 140(TR05), http://eccc.hpi-web.de/report/2005/140/ accessed 1 June 2011.
  • ChenXZhanFBAgent-based modelling and simulation of urban evacuation: Relative effectiveness of simultaneous and staged evacuation strategiesJ Opl Res Soc200859253310.1057/palgrave.jors.2602321
  • Collins AJ (2009). Evaluating reinforcement learning for game theory application: Learning to price airline seats under competition. Ph.D. thesis, University of Southampton.
  • Collins AJ, Pullum F and Kenyon L (2003). Applications of game theory in defence project: Year one report. Dstl/CR07880, Defence Science and Technology Laboratories, Ministry of Defence, United Kingdom (UNCLASSIFIED).
  • CurrieCChengRCHSmithHKDynamic pricing of airline tickets with competitionJ Opl Res Soc2008591026103710.1057/palgrave.jors.2602425
  • EatwellJMilgateMNewmanPThe New Palgrave: Game Theory1987
  • FudenbergDLevineDKThe Theory of Learning in Games1998
  • GosaviASimulation-based Optimization: Parametric Optimization Techniques and Reinforcement Learning. Operations Research/Computer Science Interfaces Series2003
  • HarsanyiJCSeltenRA General Theory of Equilibrium Selection in Games1988
  • KimJSKwakTCGame theoretic analysis of the bargaining process over a long-term replenishment contractJ Opl Res Soc20075876977810.1057/palgrave.jors.2602183
  • KolmogorovANSulla determinazione empirica di una legge di distribuzioneGiornale dell’Istituto Italiano degli Attuari193348391
  • LemkeCEHowsonJJTEquilibrium points of bimatrix gamesSIAM J Appl Math19641241342310.1137/0112033
  • LeslieAMLearning: Association or computation? Introduction to a special sectionCurr Dir Psychol Sci200110412412710.1111/1467-8721.00131
  • LeslieDSCollinsEJIndividual q-learning in normal form gamesSIAM J Control Optim20054449541410.1137/S0363012903437976
  • LuceRDIndividual Choice Behaviour1959
  • MannorSShammaJMulti-agent learning for engineersArtificial Intelligence200717141742210.1016/j.artint.2007.01.003
  • ManskiCFThe structure of random utility modelsTheory and Decision1977822925410.1007/BF00133443
  • McKelvey RD, McLennan AM and Turocy TL (2007). Gambit: Software tools for Game Theory – Version 0.2007.01.30, http://econweb.tamu.edu/gambit, accessed 10 September 2008.
  • MichieDChambersRABOXES: An experiment in adaptive controlMachine Intelligence1968137152
  • Minsky ML (1954). Theory of neural-analog reinforcement systems and its application to the brain-model problem. Ph.D. dissertation, Princeton University.
  • NashJNon-cooperative gamesAnn Math19515428629510.2307/1969529
  • NorthMJMacalCMManaging Business Complexity: Discovering Strategic Solutions with Agent-based Modeling and Simulation2007
  • PiddMTools for Thinking: Modelling in Management Science1996
  • Rummery GA (1995). Problem solving with reinforcement learning. Ph.D. thesis, Cambridge University.
  • Shoham Y, Powers R and Grenager T (2004). Multi-agent reinforcement learning: A critical survey. Presented at the AAAI Fall Symposium on Artificial Multi-agent Learning. Washington DC, USA.
  • SuttonRSBartoAGReinforcement Learning: An Introduction1998
  • TalluriKTvan RyzinGJTheory and Practice of Revenue Management2004
  • ThorndikeELAnimal Intelligence1911
  • von StackelbergHFMarktform und Gleichgewicht1934
  • von StengelBComputing equilibria for two-person gamesHandbook of Game Theory with Economic Applications200217231759
  • Watkins CJCH (1989). Learning from delayed rewards. Ph.D. thesis, Cambridge University.
  • WautersTVerbeeckKVanden BergheGDe CausmaeckerPLearning agents for the multi-mode project scheduling problemJ Opl Res Soc20116228129010.1057/jors.2010.101

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.