A Dynamic-Bayesian Network framework for modeling and evaluating learning from observation

作者:Ontanon Santiago*; Montana Jose L; Gonzalez Avelino J
来源:Expert Systems with Applications, 2014, 41(11): 5212-5226.
DOI:10.1016/j.eswa.2014.02.049

摘要

Learning from observation (LfO), also known as learning from demonstration, studies how computers can learn to perform complex tasks by observing and thereafter imitating the performance of a human actor. Although there has been a significant amount of research in this area, there is no agreement on a unified terminology or evaluation procedure. In this paper, we present a theoretical framework based on Dynamic-Bayesian Networks (DBNs) for the quantitative modeling and evaluation of LfO tasks. Additionally, we provide evidence showing that: (1) the information captured through the observation of agent behaviors occurs as the realization of a stochastic process (and often not just as a sample of a state-to-action map); (2) learning can be simplified by introducing dynamic Bayesian models with hidden states for which the learning and model evaluation tasks can be reduced to minimization and estimation of some stochastic similarity measures such as crossed entropy.

  • 出版日期2014-9-1