Universal effect of dynamical reinforcement learning mechanism in spatial evolutionary games

作者:Zhang Hai Feng*; Wu Zhi Xi; Wang Bing Hong
来源:Journal of Statistical Mechanics: Theory and Experiment , 2012, P06005.
DOI:10.1088/1742-5468/2012/06/P06005

摘要

One of the prototypical mechanisms in understanding the ubiquitous cooperation in social dilemma situations is the win-stay, lose-shift rule. In this work, a generalized win-stay, lose-shift learning model-a reinforcement learning model with dynamic aspiration level-is proposed to describe how humans adapt their social behaviors based on their social experiences. In the model, the players incorporate the information of the outcomes in previous rounds with time-dependent aspiration payoffs to regulate the probability of choosing cooperation. By investigating such a reinforcement learning rule in the spatial prisoner's dilemma game and public goods game, a most noteworthy viewpoint is that moderate greediness (i.e. moderate aspiration level) favors best the development and organization of collective cooperation. The generality of this observation is tested against different regulation strengths and different types of network of interaction as well. We also make comparisons with two recently proposed models to highlight the importance of the mechanism of adaptive aspiration level in supporting cooperation in structured populations.