摘要

A new index is introduced in this article to measure the degree of normality in the speech. The proposed parameter has demonstrated to be correlated with the perceived hoarseness, giving an indication of the degree of normality. The calculation of such a parameter is based on a statistical model developed to represent normal and pathological voices. The modeling is built around Gaussian mixture models and Mel frequency cepstral coefficients. The proposed index has been named pathological likelihood index (PLI). PLI is compared with other aperiodicity features (such as jitter and shimmer), and measurements sensitive to additive noise (such as harmonics-to-noise ratio (HNR), cepstrum-based HNR, normalized noise energy, and glottal-to-noise excitation ratio). The proposed parameter is revealed to be a good estimator of the presence of pathology, showing lower correlation with noise, frequency, and amplitude perturbation parameters than these classical features among them.

  • 出版日期2010-11