Articulatory Knowledge in the Recognition of Dysarthric Speech

作者:Rudzicz Frank*
来源:IEEE Transactions on Audio Speech and Language Processing, 2011, 19(4): 947-960.
DOI:10.1109/TASL.2010.2072499

摘要

Disabled speech is not compatible with modern generative and acoustic-only models of speech recognition (ASR). This work considers the use of theoretical and empirical knowledge of the vocal tract for atypical speech in labeling segmented and unsegmented sequences. These combined models are compared against discriminative models such as neural networks, support vector machines, and conditional random fields. Results show significant improvements in accuracy over the baseline through the use of production knowledge. Furthermore, although the statistics of vocal tract movement do not appear to be transferable between regular and disabled speakers, transforming the space of the former given knowledge of the latter before retraining gives high accuracy. This work may be applied within components of assistive software for speakers with dysarthria.

  • 出版日期2011-5