Identification of a Distant Speaker and Its Robustness

Jiang Ye<sup>*</sup>; Tang Zhenmin; Wang Longbiao

摘要

Robust speaker identification is presented for speech recorded by distant microphones. Three compensation approaches are investigated to improve the robustness of speaker identification in such environments. The first approach applies spectral subtraction before feature extraction to reduce the late-reverberation effect. The second approach makes use of feature warping as feature compensation in distant speaker identification under mismatched training-testing conditions. The third approach employs a novel method of initializing Gaussian mixture model parameters: combined division and k-means clustering. The experiment results show that, relative to the baseline system based on CMN, the channel-average recognition rates for the compensated system were 11.4%, 15.4%, 17.0%, and 17.8% higher for the TIMIT database and 6.8%, 6.4%, 9.3%, and 14.0% higher for the JNAS database for four different environments. In addition, the results show that the combination of the three approaches has better performance than the use of a single compensation method.

出版日期2011-4
单位南京大学

收藏分享被引(1) 浏览

更新时间：2018-08-02 13:40

Identification of a Distant Speaker and Its Robustness

摘要

产品服务

站内浏览

服务支持

联系方式

科研之友