Acoustic Analysis for Automatic Speech Recognition

O&#39; Shaughnessy Douglas<sup>*</sup>

doi:10.1109/JPROC.2013.2251592

摘要

As a pattern recognition application, automatic speech recognition (ASR) requires the extraction of useful features from its input signal, speech. To help determine relevance, human speech production and acoustic aspects of speech perception are reviewed, to identify acoustic elements likely to be most important for ASR. Common methods of estimating useful aspects of speech spectral envelopes are reviewed, from the point of view of efficiency and reliability in mismatched conditions. Because many speech inputs for ASR have noise and channel degradations, ways to improve robustness in speech parameterization are analyzed. While the main focus in ASR is to obtain spectral envelope measures, human speech communication efficiently exploits the manipulation of one's vocal-cord vibration rate [fundamental frequency (F0)], and so F0 extraction and its integration into ASR are also reviewed. For the acoustic analysis reviewed here for ASR, this work presents modern methods as well as future perspectives on important aspects of speech information processing.

出版日期2013-5

全文

访问全文

收藏分享被引浏览

更新时间：2017-04-24 18:30

Acoustic Analysis for Automatic Speech Recognition

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友