An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech

作者:Taal Cees H*; Hendriks Richard C; Heusdens Richard; Jensen Jesper
来源:Journal of the Acoustical Society of America, 2011, 130(5): 3013-3027.
DOI:10.1121/1.3641373

摘要

Existing objective speech-intelligibility measures are suitable for several types of degradation, however, it turns out that they are less appropriate in cases where noisy speech is processed by a time-frequency weighting. To this end, an extensive evaluation is presented of objective measure for intelligibility prediction of noisy speech processed with a technique called ideal time frequency (TF) segregation. In total 17 measures are evaluated, including four advanced speech-intelligibility measures (CSII, CSTI, NSEC, DAU), the advanced speech-quality measure (PESQ), and several frame-based measures (e.g., SSNR). Furthermore, several additional measures are proposed. The study comprised a total number of 168 different TF-weightings, including unprocessed noisy speech. Out of all measures, the proposed frame-based measure MCC gave the best results (rho = 0.93). An additional experiment shows that the good performing measures in this study also show high correlation with the intelligibility of single-channel noise reduced speech.

  • 出版日期2011-11