Machine learning techniques for semantic analysis of dysarthric speech: An experimental study

Despotovic Vladimir; Walter Oliver; Haeb Umbach Reinhold

doi:10.1016/j.specom.2018.04.005

摘要

We present an experimental comparison of seven state-of-the-art machine learning algorithms for the task of semantic analysis of spoken input, with a special emphasis on applications for dysarthric speech. Dysarthria is a motor speech disorder, which is characterized by poor articulation of phonemes. In order to cater for these non-canonical phoneme realizations, we employed an unsupervised learning approach to estimate the acoustic models for speech recognition, which does not require a literal transcription of the training data. Even for the subsequent task of semantic analysis, only weak supervision is employed, whereby the training utterance is accompanied by a semantic label only, rather than a literal transcription. Results on two databases, one of them containing dysarthric speech, are presented showing that Markov logic networks and conditional random fields substantially outperform other machine learning approaches. Markov logic networks have proved to be especially robust to recognition errors, which are caused by imprecise articulation in dysarthric speech.

出版日期2018-5

全文

访问全文

收藏分享被引(9) 浏览

更新时间：2024-03-28 22:21

Machine learning techniques for semantic analysis of dysarthric speech: An experimental study

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友