Automating manual user strategies for precise voice source analysis

Kane John<sup>*</sup>; Gobl Christer

doi:10.1016/j.specom.2012.12.004

摘要

A large part of the research carried out at the Phonetics and Speech Laboratory is concerned with the role of the voice source in the prosody of spoken language, including its linguistic and expressive dimensions. Due to the lack of robustness of automatic voice source analysis methods we have tended to use labour intensive methods which require pulse-by-pulse manual optimisation. This has affected the feasibility of conducting analysis on large volumes of data. To address this, a new method is proposed for automatic parameterisation of the deterministic component of the voice source by simulating the strategies used in the manual optimisation approach. The method involves a combination of exhaustive search, dynamic programming and optimisation methods, with settings derived from analysis of previous manual voice source analysis. A quantitative evaluation demonstrated clearly closer model parameter values to our reference values, compared with a standard time domain-based approach and a phase minimisation method. A complementary qualitative analysis illustrated broadly similar findings, in terms of voice source dynamics in various placements of focus, when using the proposed algorithm compared with a previous study which employed the manual optimisation approach.

出版日期2013-3

全文

访问全文

收藏分享被引(9) 浏览

更新时间：2018-04-21 02:48

Automating manual user strategies for precise voice source analysis

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友