A Novel Iterative Speaker Model Alignment Method from Non-Parallel Speech for Voice Conversion

Song, Peng<sup>*</sup>; Zheng, Wenming; Zhang, Xinran; Jin, Yun; Zha, Cheng; Xin, Minghai

doi:10.1587/transfun.E98.A.2178

摘要

Most of the current voice conversion methods are conducted based on parallel speech, which is not easily obtained in practice. In this letter, a novel iterative speaker model alignment (ISMA) method is proposed to address this problem. First, the source and target speaker models are each trained from the background model by adopting maximum a posteriori (MAP) algorithm. Then, a novel ISMA method is presented for alignment and transformation of spectral features. Finally, the proposed ISMA approach is further combined with a Gaussian mixture model (GMM) to improve the conversion performance. A series of objective and subjective experiments are carried out on CMU ARCTIC dataset, and the results demonstrate that the proposed method significantly outperforms the state-of-the-art approach.

出版日期2015-10
单位烟台大学; 东南大学

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2021-07-14 19:38

A Novel Iterative Speaker Model Alignment Method from Non-Parallel Speech for Voice Conversion

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友