DISCRIMINATIVE STREAM-WEIGHT TRAINING FOR MANDARIN AUDIO-VISUAL SPEECH RECOGNITION

Wu, Guanyong<sup>*</sup>; Zhu, Jie

doi:10.1080/02533839.2010.9671667

摘要

In a large vocabulary audio-visual speech recognition system, to efficiently improve the robustness of the system and reduce the word error rate, two discriminative stream-weight training methods are provided. The state-dependent stream weights are trained based on lattice rescoring by the minimum phone error and boosted maximum mutual information using the extended Baum Welch algorithm respectively. Experimental results show considerable error reductions have been achieved by the proposed methods over those using global stream weights. It is also shown that these methods provide better results than the minimum classification error based stream weight training methods.

出版日期2010-7
单位上海交通大学

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2021-07-15 15:57

DISCRIMINATIVE STREAM-WEIGHT TRAINING FOR MANDARIN AUDIO-VISUAL SPEECH RECOGNITION

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友