A novel method for rapid speaker adaptation based on support speaker weighting

作者:Cai, T*; Zhu, J
来源:30th IEEE International Conference on Acoustics, Speech, and Signal Processing, United States, 2005-03-19 to 2005-03-23.
DOI:10.1109/ICASSP.2005.1415283

摘要

In this paper we propose a novel model-based speaker adaptation method called Support Speaker Weighting (SSW),. which performs the adaptation scheme of model combination based on the selected speakers. These speakers, who are acoustically close to the test speaker, are selected from reference speakers using support vector machines (SVM). Compared with GMM/HMM based speaker selection method, the proposed method can quickly obtain a more optimal speaker subset because the selection is dynamically determined according to the distribution of reference speakers around the test. Experimental results for a large-vocabulary task given in this paper show that this method is both cheaper in terms of memory and more effective than Reference Speaker Weighting (RSW) for tiny amounts of adaptation data. Relative error rate reduction of 4.1% is achieved when only one adaptation sentence is available.