26
Views
0
CrossRef citations to date
0
Altmetric
Original Article

The impact of sociality regimes on heterogeneous cooperative-competitive multi-agent reinforcement learning: a study with the predator-prey game

&
Received 30 Jan 2023, Accepted 25 May 2024, Published online: 12 Jun 2024
 

ABSTRACT

The performance in multi-agent reinforcement learning (MARL) scenarios has usually been analysed in homogeneous teams with a few choices for the sociality regime (selfish, egalitarian, or altruistic). In this paper we analyse both homogeneous and heterogeneous teams in a variation of sociality regimes in the predator-prey game, using a novel normalisation of the weights so that the sum of all rewards is independent of the sociality regime. We find that the selfish regime is advantageous for both predator and prey teams, and for both homogeneous and heterogeneous teams. In particular, rewards are about 100% higher for the predator team when switching from the egalitarian to selfish regime and more than 400% higher from the altruistic regime. For the prey, the increase is around 40% and 100% respectively. The results are similar for homogeneous and heterogeneous situations. The takeaway message is that any study of homogeneous and heterogeneous cooperative-competitive multi-agent reinforcement learning teams should also take into account the sociality regimes before making conclusions on the preference of any algorithm.

Acknowledgement

We thank the anonymous reviewers for their comments.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Reproducibility and results data availability

Here we give some further details about reproducibility and result data.

The hardware and software configuration where we ran the environment is as follows:

  • Processor: Intel(R) Xeon(R) Gold 5215 CPU @ 2.50 GHz

  • Operating system: Ubuntu 18.04

  • CPU cores: 40

  • Memory: 125GB

  • Available memory: 109GB

In the 2-predators × 2-prey game, there are 100 matches; in a 3-predators × 3-prey game, there are 400 matches. The sample numbers here are 2×100=20 and 2×400=40. In total, 20×9+40×9=540 experiments with three sociality regimes. With this hardware, a 2-predators × 2-prey game will take around 50 mins, and 3-predators × 3-prey will be 70 mins. We ran 4 to 8 processes simultaneously at a time. If things go well, these experiments take around three weeks with this hardware,Footnote1 which means around 100 KW in total.

In compliance with the recommendations of the Science paper (Burnell et al., Citation2023), we include all the results at the instance level in the appendix and further results can be found at: https://github.com/EvaluationResearch/SocialityMultiagent.

Supplementary data

Supplemental data for this article can be accessed online at https://doi.org/10.1080/0952813X.2024.2361408

Notes

1. GPUs didn’t speed up computations with Tensorflow in this environment, so we only used CPUs in the end.

Additional information

Funding

This work was funded by the EU (FEDER) and Spanish grant RTI2018-094403-B-C32 funded by MCIN/AEI/10.13039/501100011033 and by ‘ERDF A way of making Europe’, Generalitat Valenciana under CIPROM/2022/6 (FASSLOW) and IDIFEDER/2021/05 (CLUSTERIA), EU’s Horizon 2020 research and innovation programme under grant agreement No. 952215 (TAILOR) and Spanish grant PID2021-122830OB-C42 (SFERA) funded by MCIN/AEI/10.13039/501100011033 and ‘ERDF A way of making Europe’ and China Scholarship Council (CSC) scholarship (No. 202006290201).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 373.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.