Search in:

Advanced search

International Journal of Systems Science Volume 51, 2020 - Issue 15

Submit an article Journal homepage

299

Views

CrossRef citations to date

Altmetric

Regular papers

Understanding the mechanism of human–computer game: a distributed reinforcement learning perspective

Zhinan Penga School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, People's Republic of ChinaView further author information

Jiangping Hua School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, People's Republic of China

https://orcid.org/0000-0002-7559-8604 View further author information

Yiyi Zhaob School of Business Administration, Southwestern University of Finance and Economics, Chengdu, People's Republic of ChinaCorrespondence[email protected]
View further author information

Bijoy K. Ghoshc Department of Mathematics and Statistics, Texas Tech University, Lubbock, TX, USAView further author information

Pages 2837-2848 | Received 14 Apr 2019, Accepted 26 Jul 2020, Published online: 13 Aug 2020

Cite this article
https://doi.org/10.1080/00207721.2020.1803436
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

References

Abouheaf, M. I., Lewis, F. L., Vamvoudakis, K. G., Haesaert, S., & Babuska, R. (2014). Multi-agent discrete-time graphical games and reinforcement learning solutions. Automatica, 50(12), 3038–3053. https://doi.org/10.1016/j.automatica.2014.10.047
Web of Science ®Google Scholar
Al-Tamimi, A., Lewis, F. L., & Abu-Khalaf, M. (2008). Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 38(4), 943–949. https://doi.org/10.1109/TSMCB.2008.926614
PubMed Web of Science ®Google Scholar
Altafini, C. (2013). Consensus problems on networks with antagonistic interactions. IEEE Transactions on Automatic Control, 58(4), 935–946. https://doi.org/10.1109/TAC.2012.2224251
Web of Science ®Google Scholar
Bansal, T., Pachocki, J., Sidor, S., Sutskever, I., & Mordatch, I. (2017). Emergent complexity via multi-agent competition. arXiv:1710.03748.
Google Scholar
Bao, H., Wuyun, Q., & Banzhaf, W. (2018). Evolution of cooperation through genetic collective learning and imitation in multiagent societies. In Artificial Life Conference Proceedings (pp. 436–443). MIT Press.
Google Scholar
Camerer, C. F. (2011). Behavioral game theory: Experiments in strategic interaction. Princeton University Press.
Google Scholar
Gibney, E. (2016). Go players react to computer defeat. Nature News. https://doi.org/10.1038/nature.2016.19255
Google Scholar
Gleave, A., Dennis, M., Wild, C., Kant, N., Levine, S., & Russell, S. (2019). Adversarial policies: Attacking deep reinforcement learning. arXiv:1905.10615.
Google Scholar
Haima, G., Gal, Y., An, B., & Kraus, S. (2017). Human-computer negotiation in a three player market setting. Artificial Intelligence, 246, 34–52. https://doi.org/10.1016/j.artint.2017.01.003
Web of Science ®Google Scholar
Hassabis, D. (2017). Artificial intelligence: Chess match of the century. Nature, 544(7651), 413–414. https://doi.org/10.1038/544413a
Web of Science ®Google Scholar
Hu, J., Wu, Y., Li, T., & Ghosh, B. K. (2019). Consensus control of general linear multi-agent systems with antagonistic interactions and communication noises. IEEE Transactions on Automatic Control, 64(5), 2122–2127. https://doi.org/10.1109/TAC.9 doi: 10.1109/TAC.2018.2872197
Web of Science ®Google Scholar
Hu, J., & Zheng, W. X. (2014). Emergent collective behaviors on coopetition networks. Physics Letters A, 378(26–27), 1787–1796. https://doi.org/10.1016/j.physleta.2014.04.070
Web of Science ®Google Scholar
Lewis, F. L., & Liu, D. (2013). Reinforcement learning and approximate dynamic programming for feedback control. Wiley.
Google Scholar
Lima, S. L. (2002). Putting predators back into behavioral predator-prey interactions. Trends in Ecology and Evolution, 17(2), 70–75. https://doi.org/10.1016/S0169-5347(01)02393-X
Web of Science ®Google Scholar
Liu, D. R., & Wei, Q. L. (2014). Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems. IEEE Transactions on Neural Networks and Learning Systems, 25(3), 621–634. https://doi.org/10.1109/TNNLS.5962385 doi: 10.1109/TNNLS.2013.2281663
PubMed Web of Science ®Google Scholar
Ma, H. W., Liu, D. R., Wang, D., & Luo, B. (2016). Bipartite output consensus in networked multi-agent systems of high-order power integrators with signed digraph and input noises. International Journal of Systems Science, 47(13), 3116–3131. https://doi.org/10.1080/00207721.2015.1090039
Web of Science ®Google Scholar
Murray, J. J., Cox, C. J., Lendaris, G. G., & Saeks, R. (2002). Adaptive dynamic programming. IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 32(2), 140–153. https://doi.org/10.1109/TSMCC.2002.801727
Web of Science ®Google Scholar
Neuman, J. V., & Morgenstern, O. (2007). Theory of games and economic behavior. Princeton University Press.
Google Scholar
Peng, Z., Hu, J., & Ghosh, B. K. (2020). Data-driven containment control of discrete-time multi-agent systems via value iteration. Science China Information Sciences, 63(8), 189205. https://doi.org/10.1007/s11432-018-9671-2
Web of Science ®Google Scholar
Peng, Z., Zhao, Y., Hu, J., & Ghosh, B. K. (2019). Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm. Information Sciences, 481, 189–202. https://doi.org/10.1016/j.ins.2018.12.079
Web of Science ®Google Scholar
Si, J., & Wang, Y. T. (2001). Online learning control by association and reinforcement. IEEE Transactions on Neural Networks, 12(2), 264–276. https://doi.org/10.1109/72.914523
PubMed Web of Science ®Google Scholar
Ware, A. (2009). The dynamics of two-party politics: Party structures and the management of competition, comparative politics. Oxford University Press.
Google Scholar
Zhang, H. P., Yue, D., Dou, C. X., Zhao, W., & Xie, X. P. (2019). Data-driven distributed optimal consensus control for unknown multiagent systems with input-delay. IEEE Transactions on Cybernetics, 49(6), 2095–2105. https://doi:10.1109/TCYB.2018.2819695.
PubMed Web of Science ®Google Scholar
Zhao, D. B., Xia, Z. P., & Wang, D. (2015). Model-free optimal control for affine nonlinear systems with convergence analysis. IEEE Transactions on Automation Science and Engineering, 12(4), 1461–1468. https://doi.org/10.1109/TASE.2014.2348991
Web of Science ®Google Scholar
Zhong, X. N., & He, H. B. (2020). GrHDP solution for optimal consensus control of multiagent discrete-time systems. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 50(7), 2362–2374. https://doi:10.1109/TSMC.2018.2814018.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Understanding the mechanism of human–computer game: a distributed reinforcement learning perspective

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Understanding the mechanism of human–computer game: a distributed reinforcement learning perspective

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date