摘要

A method of speaker recognition which uses feature vectors of pole distribution derived from piecewise linear predictive coefficients obtained by bagging CAN2 (competitive associative net 2) is presented and analyzed. The CAN2 is a neural net for learning efficient piecewise linear approximation of nonlinear function, and the bagging CAN2 (bootstrap aggregating version of CAN2) is used to obtain statistically stable multiple linear predictive coefficients. From the coefficients, the present method obtains a number of poles which are supposed to reflect the shape of the speaker's vocal tract. Then, the pole distribution is used as a feature vector for speaker recognition. The effectiveness is analyzed and validated using real speech data.

  • 出版日期2010