A Dynamic-Bayesian Network framework for modeling and evaluating learning from observation

Ontanon Santiago<sup>*</sup>; Montana Jose L; Gonzalez Avelino J

doi:10.1016/j.eswa.2014.02.049

摘要

Learning from observation (LfO), also known as learning from demonstration, studies how computers can learn to perform complex tasks by observing and thereafter imitating the performance of a human actor. Although there has been a significant amount of research in this area, there is no agreement on a unified terminology or evaluation procedure. In this paper, we present a theoretical framework based on Dynamic-Bayesian Networks (DBNs) for the quantitative modeling and evaluation of LfO tasks. Additionally, we provide evidence showing that: (1) the information captured through the observation of agent behaviors occurs as the realization of a stochastic process (and often not just as a sample of a state-to-action map); (2) learning can be simplified by introducing dynamic Bayesian models with hidden states for which the learning and model evaluation tasks can be reduced to minimization and estimation of some stochastic similarity measures such as crossed entropy.

出版日期2014-9-1

全文

访问全文

收藏分享被引(18) 浏览

更新时间：2024-04-11 18:04

A Dynamic-Bayesian Network framework for modeling and evaluating learning from observation

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友