Acoustic feature extraction method for robust speaker identification

Li, Zuoqiang<sup>*</sup>; Gao, Yong

doi:10.1007/s11042-015-2660-z

摘要

When there is a mismatch between the acoustic training environment and the testing environment, the performance of automatic speaker identification systems degrades significantly. A robust feature extraction method for speaker recognition based on the gammatone filter is therefore proposed in this paper. By employing the working mechanism of the human auditory model instead of the traditional triangular filter banks, gammatone filter banks are used to simulate the auditory model of the human ear cochlea. The cube root compression method, equal loudness technology, and relative spectral (RASTA) filtering technology are incorporated into the robust feature extraction process. A simulation experiment is conducted based on the Gaussian mixture model (GMM) recognition algorithm. The experimental results indicate that the proposed feature parameters could show superior robustness and represent the characteristics of the speaker better than the conventional mel-frequency cepstrum coefficient (MFCC), cochlear cepstrum coefficient (CFCC) and relative spectra-perceptual linear predictive (RASTA-PLP) parameters.

出版日期2016-6
单位四川大学

全文

访问全文

收藏分享被引(9) 浏览

更新时间：2021-08-06 05:45

Acoustic feature extraction method for robust speaker identification

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友