Search in:

Advanced search

Connection Science Volume 26, 2014 - Issue 1: Adaptive Learning Agents, Part 1

Submit an article Journal homepage

Free access

440

Views

CrossRef citations to date

Altmetric

Articles

A parallel framework for Bayesian reinforcement learning

Enda BarrettCollege of Engineering and Informatics, National University of Ireland, Galway, Ireland

Jim DugganCollege of Engineering and Informatics, National University of Ireland, Galway, Ireland

Enda HowleyCollege of Engineering and Informatics, National University of Ireland, Galway, IrelandCorrespondence[email protected]
[email protected]

Pages 7-23 | Received 01 Sep 2013, Accepted 19 Nov 2013, Published online: 13 Mar 2014

Cite this article
https://doi.org/10.1080/09540091.2014.885268
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

References

Baird, L. (1995, July 9–12). Residual algorithms: Reinforcement learning with function approximation. Proceedings of the 12th international conference on machine learning, Tahoe City, CA.
Google Scholar
Chu, C.-T., Kim, S. K., Lin, Y.-A., Yu, Y. Y., Bradski, G., Ng, A. Y., et al. (2007). Map-reduce for machine learning on multicore. In B. Schölkopf, J. C. Platt, & T. Hoffman (Eds.), Advances in neural information processing systems (pp. 281–288). Cambridge, MA: MIT Press.
Google Scholar
Doshi, P., Goodwin, R., Akkiraju R., & Verma, K. (2005). Dynamic workflow composition using Markov decision processes. International Journal of Web Services Research, 2, 1–17. doi: 10.4018/jwsr.2005010101
Google Scholar
Dearden, R., Friedman, N., & Andre, D. (1999, July). Model based Bayesian exploration. Proceedings of the fifteenth conference on uncertainty in artificial intelligence (pp. 150–159). Stockholm, Sweden.
Google Scholar
Dutreilh, X., Rivierre, N., Moreau, A., Malenfant, J., & Truck, I. (2010). From data center resource allocation to control theory and back. In S. S. Yau & L.-J. Zhang (Eds.), 2010 IEEE 3rd international conference on cloud computing (CLOUD) (pp. 410–417). Miami, FL: IEEE.
Google Scholar
Friedman, N., & Singer, Y. (1999). Efficient Bayesian parameter estimation in large discrete domains. In Advances in neural information processing systems 11: Proceedings of the 1998 conference, Vol. 11, p. 417, The MIT Press. Retrieved from http://books.google.co.in/books?id=bMuzXPzlkG0C&printsec=frontcover&source=gbs_ge_summary_r&cad=0#v=onepage&q&f=false
Google Scholar
Grounds, M., & Kudenko, D. (2008, May 12–16). Parallel reinforcement learning with linear function approximation. Adaptive agents and multi-agent systems III. Adaptation and multi-agent learning. Estoril, Portugal.
Google Scholar
Grounds, M., & Kudenko, D. (2009). Learning shaping rewards in model-based reinforcement learning. Proc. AAMAS 2009 workshop on adaptive learning agents, Budapest, Hungary.
Google Scholar
Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237–285.
Web of Science ®Google Scholar
Kawaguchi, K., & Araya, M. (2013). A greedy approximation of Bayesian reinforcement learning with probably optimistic transition model. Proc. AAMAS 2013 workshop on adaptive learning agents, Saint Paul, Minnesota, USA.
Google Scholar
Kretchmar, R. M. (2002, July 14–18). Parallel reinforcement learning. The 6th world conference on systemics, cybernetics, and informatics, Orlando, FL.
Google Scholar
Kushida, M., Takahashi, K., Ueda, H., & Miyahara, T. (2006, December). A comparative study of parallel reinforcement learning methods with a PC cluster system. Proceedings of the IEEE/WIC/ACM international conference on intelligent agent technology, pp. 18–22, Hong Kong, China.
Google Scholar
Li, Y., & Schuurmans, D. (2012). MapReduce for parallel reinforcement learning. In S. Sanner & M. Hutter (Eds.), Recent advances in reinforcement learning (pp. 309–320). New York: Springer Berlin Heidelberg.
Google Scholar
Littman, M. L. (1994, July 10–13). Markov games as a framework for multi-agent reinforcement learning. Proceedings of the 11th international conference on machine learning, New Brunswick, NJ.
Google Scholar
Melo, F. S., Meyn, S. P., & Ribeiro, M. I. (2008, July 5–9). An analysis of reinforcement learning with function approximation. Proceedings of the 25th international conference on machine learning, Helsinki, Finland.
Google Scholar
Nau, D., Ghallab, M., & Traverso, P. (2004) Automated planning: Theory & practice. San Francisco, CA: Morgan Kaufmann.
Google Scholar
Poupart, P., Vlassis, N., Hoey, J., & Regan, K. (2006). An analytic solution to discrete Bayesian reinforcement learning. In W. W. Cohen & A. Moore (Eds.), Proceedings of the 23rd international conference on machine learning (pp. 697–704). New York, NY: ACM.
Google Scholar
Russell, S. J., Norvig, P., Canny, J. F., Malik, J. M., & Edwards, D. D. (1995). Artificial intelligence: A modern approach (Vol. 2). Englewood Cliffs, NJ: Prentice Hall.
Google Scholar
Spiegelhalter, D. J., Dawid, A. P., Lauritzen, S. L., & Cowell, R. G. (1993). Bayesian analysis in expert systems. Statistical Science, 8, 219–247. doi: 10.1214/ss/1177010888
Web of Science ®Google Scholar
Strens, M. (2000). A Bayesian framework for reinforcement learning. ICML, (pp. 943–950). Stanford, CA.
Google Scholar
Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction (1st ed., Vol. 1). Cambridge, MA: Cambridge University Press.
Google Scholar
Sutton, R. S., McAllester, D. A., Singh, S. P., & Mansour, Y. (1999). Policy gradient methods for reinforcement learning with function approximation, In Advances in Neural Information Processing Systems, Denver, Colorado, USA, Vol. 99, pp. 1057–1063. Retrieved from http://webdocs.cs.ualberta.ca/sutton/papers/SMSM-NIPS99.pdf
Google Scholar
Watkins, C. (1989). Learning from delayed rewards. England: University of Cambridge.
Google Scholar
Zinkevich, M., Weimer, M., Smola, A., & Li, L. (2010). Parallelized stochastic gradient descent. Advances in neural information processing systems, (Vol. 23, pp. 2595–2603), Lake Tahoe, Nevada, USA.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

A parallel framework for Bayesian reinforcement learning

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

A parallel framework for Bayesian reinforcement learning

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date