Applying the Bi-level HMM for Robust Voice-activity Detection

作者:Hwang Yongwon; Jeong Mun Ho*; Oh Sang Rok; Kim Il Hwan
来源:Journal of Electrical Engineering and Technology, 2017, 12(1): 373-377.
DOI:10.5370/JEET.2017.12.1.373

摘要

This paper presents a voice-activity detection (VAD) method for sound sequences with various SNRs. For real-time VAD applications, it is inadequate to employ a post-processing for the removal of burst clippings from the VAD output decision. To tackle this problem, building on the bilevel hidden Markov model, for which a state layer is inserted into a typical hidden Markov model (HMM), we formulated a robust method for VAD not requiring any additional post-processing. In the method, a forward-inference-ratio test was devised to detect the speech endpoints and Mel-frequency cepstral coefficients (MFCC) were used as the features. Our experiment results show that, regarding different SNRs, the performance of the proposed approach is more outstanding than those of the conventional methods.

  • 出版日期2017-1

全文