A generalized time-frequency subtraction method for robust, speech   enhancement based on wavelet filter banks modeling of human auditory   system

Shao Yu; Chang Chip Hong

doi:10.1109/TSMCB.2007.895365

摘要

We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-to-noise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and, improve the Intelligibility of speech. The proposed method is a generalized time-frequency subtraction algorithm, which advantageously exploits the wavelet multirate signal representation to preserve the critical transient information. Simultaneous masking and temporal masking of the human auditory system are modeled by the perceptual wavelet packet transform via the frequency and temporal localization of speech components. The wavelet coefficients are used to calculate the Bark spreading energy and temporal spreading energy, from which a time-frequency masking threshold is deduced to adaptively adjust the subtraction parameters of the proposed method. An unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is shown that the proposed speech enhancement system is capable of reducing noise with little speech degradation in adverse noise environments and the overall performance is superior to several competitive methods.

出版日期2007-8
单位南阳理工学院

全文

访问全文

收藏分享被引(9) 浏览

更新时间：2018-08-02 16:31

A generalized time-frequency subtraction method for robust, speech enhancement based on wavelet filter banks modeling of human auditory system

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友