Auditory motivated front-end for noisy speech using spectro-temporal modulation filtering

Ganapathy Sriram<sup>*</sup>; Omar Mohamed

doi:10.1121/1.4896406

摘要

The robustness of the human auditory system to noise is partly due to the peak preserving capability of the periphery and the cortical filtering of spectro-temporal modulations. In this letter, a robust speech feature extraction scheme is developed that emulates this processing by deriving a spectrographic representation that emphasizes the high energy regions. This is followed by a modulation filtering step to preserve only the important spectro-temporal modulations. The features derived from this representation provide significant improvements for speech recognition in noise and language identification in radio channel speech. Further, the experimental analysis shows congruence with human psychophysical studies.

出版日期2014-11
单位IBM

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2021-04-20 23:11

Auditory motivated front-end for noisy speech using spectro-temporal modulation filtering

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友