A distributed and morphology-independent strategy for adaptive locomotion in self-reconfigurable modular robots

Christensen David Johan<sup>*</sup>; Schultz Ulrik Pagh; Stoy Kasper

doi:10.1016/j.robot.2013.05.009

摘要

In this paper, we present a distributed reinforcement learning strategy for morphology-independent lifelong gait learning for modular robots. All modules run identical controllers that locally and independently optimize their action selection based on the robot%26apos;s velocity as a global, shared reward signal. We evaluate the strategy experimentally mainly on simulated, but also on physical, modular robots. We find that the strategy: (i) for six of seven configurations (3-12 modules) converge in 96% of the trials to the best known action-based gaits within 15 min, on average, (ii) can be transferred to physical robots with a comparable performance, (iii) can be applied to learn simple gait control tables for both M-TRAN and ATRON robots, (iv) enables an 8-module robot to adapt to faults and changes in its morphology, and (v) can learn gaits for up to 60 module robots but a divergence effect becomes substantial from 20-30 modules. These experiments demonstrate the advantages of a distributed learning strategy for modular robots, such as simplicity in implementation, low resource requirements, morphology independence, reconfigurability, and fault tolerance.

出版日期2013-9

全文

访问全文

收藏分享被引(42) 浏览

更新时间：2024-01-04 11:06

A distributed and morphology-independent strategy for adaptive locomotion in self-reconfigurable modular robots

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友