A modified algorithm for voice conversion using compressed sensing

Jian Zhihua<sup>*</sup>; Wang Xiangwen

摘要

A voice conversion algorithm, which makes use of the information between continuous frames of speech by compressed sensing, is proposed in this paper. According to the sparsity property of the concatenated vector of several continuous Linear Spectrum Pairs (LSP) in the discrete cosine transformation domain, this paper utilizes compressed sensing to extract the compressed vector from the concatenated LSPs and uses it as the feature vector to train the conversion function. The results of evaluations demonstrate that the performance of this approach can averagely improve 3.21% comparing with the conventional algorithm based on weighted frequency warping when choosing the appropriate numbers of speech frame. The experimental results also illustrate that the performance of voice conversion system can be improved by taking full advantage of the inter-frame information, because those information can make the converted speech remain the more stable acoustic properties which is inherent in inter-frames.

出版日期2014
单位上海大学

全文

访问全文

收藏分享被引浏览

更新时间：2018-08-03 21:16

A modified algorithm for voice conversion using compressed sensing

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友