A comparative study between MFCC and LSF coefficients in automatic recognition of isolated digits pronounced in Portuguese and English

作者:Silva Diego Furtado*; Alves de Souza Vinicius Mourao; Prado Alves Batista Gustavo Enrique de Almeida
来源:Acta Scientiarum - Technology, 2013, 35(4): 621-628.
DOI:10.4025/actascitechnol.v35i4.19825

摘要

Recognition of isolated spoken digits is the core procedure for a large number of applications which rely solely on speech for data exchange, as in telephone-based services, such as dialing, airline reservation, bank transaction and price quotation. Spoken digit recognition is generally a challenging task since the signals last for a short period of time and often some digits are acoustically very similar to other digits. The objective of this paper is to investigate the use of machine learning algorithms for spoken digit recognition and disclose the free availability of a database with digits pronounced in English and Portuguese to the scientific community. Since machine learning algorithms are fully dependent on predictive attributes to build precise classifiers, we believe that the most important task for successfully recognizing spoken digits is feature extraction. In this work, we show that Line Spectral Frequencies (LSF) provide a set of highly predictive coefficients. We evaluated our classifiers in different settings by altering the sampling rate to simulate low quality channels and varying the number of coefficients.

  • 出版日期2013

全文