Decentralized Indirect Methods for Learning Automata Games

Tilak Omkar<sup>*</sup>; Martin Ryan; Mukhopadhyay Snehasis

doi:10.1109/TSMCB.2011.2118749

摘要

We discuss the application of indirect learning methods in zero-sum and identical payoff learning automata games. We propose a novel decentralized version of the well-known pursuit learning algorithm. Such a decentralized algorithm has significant computational advantages over its centralized counterpart. The theoretical study of such a decentralized algorithm requires the analysis to be carried out in a nonstationary environment. We use a novel bootstrapping argument to prove the convergence of the algorithm. To our knowledge, this is the first time that such analysis has been carried out for zero-sum and identical payoff games. Extensive simulation studies are reported, which demonstrate the proposed algorithm's fast and accurate convergence in a variety of game scenarios. We also introduce the framework of partial communication in the context of identical payoff games of learning automata. In such games, the automata may not communicate with each other or may communicate selectively. This comprehensive framework has the capability to model both centralized and decentralized games discussed in this paper.

出版日期2011-10

全文

访问全文

收藏分享被引(6) 浏览

更新时间：2017-06-26 10:55

Decentralized Indirect Methods for Learning Automata Games

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友