Selective Gammatone Envelope Feature for Robust Sound Event Recognition

作者:Leng Yi Ren*; Huy Dat Tran; Kitaoka Norihide; Li Haizhou
来源:IEICE Transactions on Information and Systems, 2012, E95D(5): 1229-1237.
DOI:10.1587/transinf.E95.D.1229

摘要

Conventional features for Automatic Speech Recognition and Sound Event Recognition such as Mel-Frequency Cepstral Coefficients (MFCCs) have been shown to perform poorly in noisy conditions. We introduce an auditory feature based on the gammatone filterbank, the Selective Gammatone Envelope Feature (SGEF), for Robust Sound Event Recognition where channel selection and the filterbank envelope is used to reduce the effect of noise for specific noise environments. In the experiments with Hidden Markov Model (HMM) recognizers, we shall show that our feature outperforms MFCCs significantly in four different noisy environments at various signal-to-noise ratios.

  • 出版日期2012-5