A Compact Representation of Visual Speech Data Using Latent Variables

Zhou Ziheng<sup>*</sup>; Hong Xiaopeng; Zhao Guoying; Pietikainen Matti

doi:10.1109/TPAMI.2013.173

登录

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

A Compact Representation of Visual Speech Data Using Latent Variables

作者：Zhou Ziheng^*; Hong Xiaopeng; Zhao Guoying; Pietikainen Matti

来源：IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36(1): 181-187.

DOI：10.1109/TPAMI.2013.173

摘要

The problem of visual speech recognition involves the decoding of the video dynamics of a talking mouth in a high-dimensional visual space. In this paper, we propose a generative latent variable model to provide a compact representation of visual speech data. The model uses latent variables to separately represent the interspeaker variations of visual appearances and those caused by uttering within images, and incorporates the structural information of the visual data through placing priors of the latent variables along a curve embedded within a path graph.

出版日期2014-1

全文

访问全文

收藏分享被引(26) 浏览

更新时间：2018-05-30 21:41

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号