An iterative model-based approach to cochannel speech separation

Hu Ke<sup>*</sup>; Wang DeLiang

doi:10.1186/1687-4722-2013-14

摘要

Cochannel speech separation aims to separate two speech signals from a single mixture. In a supervised scenario, the identities of two speakers are given, and current methods use pre-trained speaker models for separation. One issue in model-based methods is the mismatch between training and test signal levels. We propose an iterative algorithm to adapt speaker models to match the signal levels in testing. Our algorithm first obtains initial estimates of source signals using unadapted speaker models and then detects the input signal-to-noise ratio (SNR) of the mixture. The input SNR is then used to adapt the speaker models for more accurate estimation. The two steps iterate until convergence. Compared to search-based SNR detection methods, our method is not limited to given SNR levels. Evaluations demonstrate that the iterative procedure converges quickly in a considerable range of SNRs and improves separation results significantly. Comparisons show that the proposed system performs significantly better than related model-based systems.

出版日期2013

全文

访问全文

收藏分享被引(4) 浏览

更新时间：2019-03-28 08:44

An iterative model-based approach to cochannel speech separation

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友