Adding memory condition to learning classifier systems to solve partially observable environments

Zang, Zhao Xiang<sup>*</sup>; Li, De Hua; Wang, Jun Ying

doi:10.1504/IJCAT.2013.053425

摘要

Within the paradigm of learning classifier systems, extended classifier system (XCS) is outstanding. However, the original XCS has no memory mechanism and can only learn optimal policy in Markovian environments, where the optimal action is determined solely by the state of current sensory input. But in practice, most environments are partially observable environments with respect to agent's sensation, and they form the most general class of environments: non-Markov environments. In these environments, XCS either fails completely, or only develops a suboptimal policy, since it is memoryless. In this paper, we develop a new learning classifier system based on XCS, named 'XCSMM', which adds an internal message to XCS as an internal memory, and then extends the classifier with a memory condition that is used to sense the internal memory. XCSMM holds a simple and clear memory mechanism, which is easy to understand and implement. Besides, four sets of different complex maze problems have been employed to test XCSMM. Experimental results show that XCSMM is able to evolve optimal or suboptimal solutions in most non-Markovian environments.

出版日期2013
单位三峡大学; 华中科技大学

全文

访问全文

收藏分享被引浏览

更新时间：2019-10-25 13:32

Adding memory condition to learning classifier systems to solve partially observable environments

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友