Darwinian embodied evolution of the learning ability for survival

Elfwing Stefan<sup>*</sup>; Uchibe Eiji; Doya Kenji; Christensen Henrik I

doi:10.1177/1059712310397633

摘要

In this article we propose a framework for performing embodied evolution with a limited number of robots, by utilizing time-sharing in subpopulations of virtual agents hosted in each robot. Within this framework, we explore the combination of within-generation learning of basic survival behaviors by reinforcement learning, and evolutionary adaptations over the generations of the basic behavior selection policy, the reward functions, and metaparameters for reinforcement learning. We apply a biologically inspired selection scheme, in which there is no explicit communication of the individuals' fitness information. The individuals can only reproduce offspring by mating-a pair-wise exchange of genotypes-and the probability that an individual reproduces offspring in its own subpopulation is dependent on the individual's "health," that is, energy level, at the mating occasion. We validate the proposed method by comparing it with evolution using standard centralized selection, in simulation, and by transferring the obtained solutions to hardware using two real robots.

出版日期2011-4

全文

访问全文

收藏分享被引(12) 浏览

更新时间：2022-01-01 05:36

Darwinian embodied evolution of the learning ability for survival

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友