摘要

Smart agents are equipped with sensors that enable them to be sensitive to their surrounding environment. However, the mapping of multiple raw streams of sensory data to the appropriate actions is not an easy problem, especially when multiple conflicting objectives are involved. This type of multi-sensor fusion problem through the domain of power management for smart mobile devices was investigated in this study. In this application domain, the objective is to keep the user's mobile devices in an "on" state as long as possible such that just-in-time services (such as reminder announcements) can be provided. However, this conflicts with the mobile device's inherent goal - to turn off to conserve power. Due to the stochastic nature of human behavior, a hand-coded fixed strategy may not be the best solution. A learning control approach to the problem is presented in this paper. The experimental results show that the approach learns the appropriate mapping from multiple streams of raw sensory data to power conserving actions that can out-perform the hand-crafted policies. The learned policies are also shown to be more robust in handling unscheduled events.