An Improved Speech Nonspeech Classification Based on Feature Combination for Audio Indexing

作者:Keum Ji Soo*; Lee Hyon Soo; Hagiwara Masafumi
来源:IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences, 2010, E93A(4): 830-832.
DOI:10.1587/transfun.E93.A.830

摘要

In this letter, we propose an improved speech/nonspeech classification method to effectively classify a multimedia source To improve performance. we introduce a feature based on spectral duration analysis, and combine recently proposed features such as high zero crossing rate ratio (HZCRR). low short tune energy ratio (LSTER). and pitch ratio (PR) According to the results of our experiments on speech. music, and environmental sounds, the proposed method obtained high classification results when compared with conventional approaches

  • 出版日期2010-4

全文