Universal effect of dynamical reinforcement learning mechanism in   spatial evolutionary games

Zhang Hai Feng<sup>*</sup>; Wu Zhi Xi; Wang Bing Hong

doi:10.1088/1742-5468/2012/06/P06005

摘要

One of the prototypical mechanisms in understanding the ubiquitous cooperation in social dilemma situations is the win-stay, lose-shift rule. In this work, a generalized win-stay, lose-shift learning model-a reinforcement learning model with dynamic aspiration level-is proposed to describe how humans adapt their social behaviors based on their social experiences. In the model, the players incorporate the information of the outcomes in previous rounds with time-dependent aspiration payoffs to regulate the probability of choosing cooperation. By investigating such a reinforcement learning rule in the spatial prisoner's dilemma game and public goods game, a most noteworthy viewpoint is that moderate greediness (i.e. moderate aspiration level) favors best the development and organization of collective cooperation. The generality of this observation is tested against different regulation strengths and different types of network of interaction as well. We also make comparisons with two recently proposed models to highlight the importance of the mechanism of adaptive aspiration level in supporting cooperation in structured populations.

出版日期2012-6
单位安徽大学; 兰州大学; 中国科学技术大学

全文

访问全文

收藏分享被引(3) 浏览

更新时间：2023-10-18 18:15

Universal effect of dynamical reinforcement learning mechanism in spatial evolutionary games

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友