Deep feature for text-dependent speaker verification

Liu, Yuan; Qian, Yanmin<sup>*</sup>; Chen, Nanxin; Fu, Tianfan; Zhang, Ya; Yu, Kai

doi:10.1016/j.specom.2015.07.003

摘要

Recently deep learning has been successfully used in speech recognition, however it has not been carefully explored and widely accepted for speaker verification. To incorporate deep learning into speaker verification, this paper proposes novel approaches of extracting and using features from deep learning models for text-dependent speaker verification. In contrast to the traditional short-term spectral feature, such as MFCC or PLP, in this paper, outputs from hidden layer of various deep models are employed as deep features for text-dependent speaker verification. Fours types of deep models are investigated: deep Restricted Boltzmann Machines, speech-discriminant Deep Neural Network (DNN), speaker-discriminant DNN, and multi-task joint-learned DNN. Once deep features are extracted, they may be used within either the GMM-UBM framework or the identity vector (i-vector) framework. Joint linear discriminant analysis and probabilistic linear discriminant analysis are proposed as effective back-end classifiers for identity vector based deep features. These approaches were evaluated on the RSR2015 data corpus. Experiments showed that deep feature based methods can obtain significant performance improvements compared to the traditional baselines, no matter if they are directly applied in the GMM-UBM system or utilized as identity vectors. The EER of the best system using the proposed identity vector is 0.10%, only one fifteenth of that in the GMM-UBM baseline.

出版日期2015-10
单位上海交通大学

全文

访问全文

收藏分享被引(112) 浏览

更新时间：2024-04-20 19:09

Deep feature for text-dependent speaker verification

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友