Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination

Irino Toshio<sup>*</sup>; Aoki Yoshie; Kawahara Hideki; Patterson Roy D

doi:10.1016/j.specom.2012.04.002

摘要

There has recently been a series of studies concerning the interaction of glottal pulse rate (GPR) and mean-formant-frequency (MFF) in the perception of speaker characteristics and speech recognition. This paper extends the research by comparing the recognition and discrimination performance achieved with voiced words to that achieved with whispered words. The recognition experiment shows that performance with whispered words is slightly worse than with voiced words at all MFFs when the GPR of the voiced words is in the middle of the normal range. But, as GPR decreases below this range, voiced-word performance decreases and eventually becomes worse than whispered-word performance. The discrimination experiment shows that the just noticeable difference (JND) for MFF is essentially independent of the mode of vocal excitation; the JND is close to 5% for both voiced and voiceless words for all speaker types. The interaction between GPR and VTL is interpreted in terms of the stability of the internal representation of speech which improves with GPR across the range of values used in these experiments.

出版日期2012-11

全文

访问全文

收藏分享被引(12) 浏览

更新时间：2018-01-17 15:52

Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友