Novel Two-Stage Audiovisual Speech Filtering in Noisy Environments

Abel Andrew<sup>*</sup>; Hussain Amir

doi:10.1007/s12559-013-9231-2

摘要

In recent years, the established link between the various human communication production domains has become more widely utilised in the field of speech processing. In this work, we build on previous work by the authors and present a novel two-stage audiovisual speech enhancement system, making use of audio-only beamforming, automatic lip tracking, and pre-processing with visually derived Wiener speech filtering. Initial results have demonstrated that this two-stage multimodal speech enhancement approach can produce positive results with noisy speech mixtures that conventional audio-only beamforming would struggle to cope with, such as in very noisy environments with a very low signal to noise ratio, and when the type of noise is difficult for audio-only beamforming to process.

出版日期2014-6

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2018-05-30 20:32

Novel Two-Stage Audiovisual Speech Filtering in Noisy Environments

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友