A model for the prediction of breathiness in vowels

Shrivastav Rahul<sup>*</sup>; Camacho Arturo; Patel Sona; Eddins David A

doi:10.1121/1.3543993

摘要

The perception of breathiness in vowels is cued by multiple acoustic cues, including changes in aspiration noise (AH) and the open quotient (OQ) [Klatt and Klatt, J. Acoust. Soc. Am. 87(2), 820 857 (1990)]. A loudness model can be used to determine the extent to which AH masks the harmonic components in voice. The resulting "partial loudness" (PL) and loudness of AH ["noise loudness" (NL)] have been shown to be good predictors of perceived breathiness [Shrivastav and Sapienza, J. Acoust. Soc. Am. 114(1), 2217-2224 (2003)]. The levels of AH and OQ were systematically manipulated for ten synthetic vowels. Perceptual judgments of breathiness were obtained and regression functions to predict breathiness from the ratio of NL to PL (eta) were derived. Results show that breathiness can be modeled as a power function of eta. The power parameter of this function appears to be affected by the fundamental frequency of the vowel. A second experiment was conducted to determine if the resulting power function could estimate breathiness in a different set of voices. The breathiness of these stimuli, both natural and synthetic, was determined in a listening test. The model estimates of breathiness were highly correlated with perceptual data but the absolute predicted values showed some discrepancies.

出版日期2011-3

全文

访问全文

收藏分享被引(7) 浏览

更新时间：2018-02-09 22:49

A model for the prediction of breathiness in vowels

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友