Deep Learning Based Binaural Speech Separation in Reverberant Environments

Zhang, Xueliang<sup>*</sup>; Wang, DeLiang

doi:10.1109/TASLP.2017.2687104

摘要

Speech signal is usually degraded by room reverberation and additive noises in real environments. This paper focuses on separating target speech signal in reverberant conditions from binaural inputs. Binaural separation is formulated as a supervised learning problem, and we employ deep learning to map from both spatial and spectral features to a training target. With binaural inputs, we first apply a fixed beamformer and then extract several spectral features. A new spatial feature is proposed and extracted to complement the spectral features. The training target is the recently suggested ideal ratio mask. Systematic evaluations and comparisons show that the proposed system achieves very good separation performance and substantially outperforms related algorithms under challenging multisource and reverberant environments.

出版日期2017-5
单位内蒙古大学

全文

访问全文

收藏分享被引(87) 浏览

更新时间：2024-05-12 10:06

Deep Learning Based Binaural Speech Separation in Reverberant Environments

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友