References
- Abouheaf, M. I., Lewis, F. L., Vamvoudakis, K. G., Haesaert, S., & Babuska, R. (2014). Multi-agent discrete-time graphical games and reinforcement learning solutions. Automatica, 50(12), 3038–3053. https://doi.org/10.1016/j.automatica.2014.10.047
- Al-Tamimi, A., Lewis, F. L., & Abu-Khalaf, M. (2008). Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 38(4), 943–949. https://doi.org/10.1109/TSMCB.2008.926614
- Altafini, C. (2013). Consensus problems on networks with antagonistic interactions. IEEE Transactions on Automatic Control, 58(4), 935–946. https://doi.org/10.1109/TAC.2012.2224251
- Bansal, T., Pachocki, J., Sidor, S., Sutskever, I., & Mordatch, I. (2017). Emergent complexity via multi-agent competition. arXiv:1710.03748.
- Bao, H., Wuyun, Q., & Banzhaf, W. (2018). Evolution of cooperation through genetic collective learning and imitation in multiagent societies. In Artificial Life Conference Proceedings (pp. 436–443). MIT Press.
- Camerer, C. F. (2011). Behavioral game theory: Experiments in strategic interaction. Princeton University Press.
- Gibney, E. (2016). Go players react to computer defeat. Nature News. https://doi.org/10.1038/nature.2016.19255
- Gleave, A., Dennis, M., Wild, C., Kant, N., Levine, S., & Russell, S. (2019). Adversarial policies: Attacking deep reinforcement learning. arXiv:1905.10615.
- Haima, G., Gal, Y., An, B., & Kraus, S. (2017). Human-computer negotiation in a three player market setting. Artificial Intelligence, 246, 34–52. https://doi.org/10.1016/j.artint.2017.01.003
- Hassabis, D. (2017). Artificial intelligence: Chess match of the century. Nature, 544(7651), 413–414. https://doi.org/10.1038/544413a
- Hu, J., Wu, Y., Li, T., & Ghosh, B. K. (2019). Consensus control of general linear multi-agent systems with antagonistic interactions and communication noises. IEEE Transactions on Automatic Control, 64(5), 2122–2127. https://doi.org/10.1109/TAC.9 doi: 10.1109/TAC.2018.2872197
- Hu, J., & Zheng, W. X. (2014). Emergent collective behaviors on coopetition networks. Physics Letters A, 378(26–27), 1787–1796. https://doi.org/10.1016/j.physleta.2014.04.070
- Lewis, F. L., & Liu, D. (2013). Reinforcement learning and approximate dynamic programming for feedback control. Wiley.
- Lima, S. L. (2002). Putting predators back into behavioral predator-prey interactions. Trends in Ecology and Evolution, 17(2), 70–75. https://doi.org/10.1016/S0169-5347(01)02393-X
- Liu, D. R., & Wei, Q. L. (2014). Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems. IEEE Transactions on Neural Networks and Learning Systems, 25(3), 621–634. https://doi.org/10.1109/TNNLS.5962385 doi: 10.1109/TNNLS.2013.2281663
- Ma, H. W., Liu, D. R., Wang, D., & Luo, B. (2016). Bipartite output consensus in networked multi-agent systems of high-order power integrators with signed digraph and input noises. International Journal of Systems Science, 47(13), 3116–3131. https://doi.org/10.1080/00207721.2015.1090039
- Murray, J. J., Cox, C. J., Lendaris, G. G., & Saeks, R. (2002). Adaptive dynamic programming. IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 32(2), 140–153. https://doi.org/10.1109/TSMCC.2002.801727
- Neuman, J. V., & Morgenstern, O. (2007). Theory of games and economic behavior. Princeton University Press.
- Peng, Z., Hu, J., & Ghosh, B. K. (2020). Data-driven containment control of discrete-time multi-agent systems via value iteration. Science China Information Sciences, 63(8), 189205. https://doi.org/10.1007/s11432-018-9671-2
- Peng, Z., Zhao, Y., Hu, J., & Ghosh, B. K. (2019). Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm. Information Sciences, 481, 189–202. https://doi.org/10.1016/j.ins.2018.12.079
- Si, J., & Wang, Y. T. (2001). Online learning control by association and reinforcement. IEEE Transactions on Neural Networks, 12(2), 264–276. https://doi.org/10.1109/72.914523
- Ware, A. (2009). The dynamics of two-party politics: Party structures and the management of competition, comparative politics. Oxford University Press.
- Zhang, H. P., Yue, D., Dou, C. X., Zhao, W., & Xie, X. P. (2019). Data-driven distributed optimal consensus control for unknown multiagent systems with input-delay. IEEE Transactions on Cybernetics, 49(6), 2095–2105. https://doi:10.1109/TCYB.2018.2819695.
- Zhao, D. B., Xia, Z. P., & Wang, D. (2015). Model-free optimal control for affine nonlinear systems with convergence analysis. IEEE Transactions on Automation Science and Engineering, 12(4), 1461–1468. https://doi.org/10.1109/TASE.2014.2348991
- Zhong, X. N., & He, H. B. (2020). GrHDP solution for optimal consensus control of multiagent discrete-time systems. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 50(7), 2362–2374. https://doi:10.1109/TSMC.2018.2814018.