Audio segmentation-by-classification approach based on factor analysis in broadcast news domain

Castan Diego<sup>*</sup>; Ortega Alfonso; Miguel Antonio; Lleida Eduardo

doi:10.1186/s13636-014-0034-5

摘要

This paper studies a novel audio segmentation-by-classification approach based on factor analysis. The proposed technique compensates the within-class variability by using class-dependent factor loading matrices and obtains the scores by computing the log-likelihood ratio for the class model to a non-class model over fixed-length windows. Afterwards, these scores are smoothed to yield longer contiguous segments of the same class by means of different back-end systems. Unlike previous solutions, our proposal does not make use of specific acoustic features and does not need a hierarchical structure. The proposed method is applied to segment and classify audios coming from TV shows into five different acoustic classes: speech, music, speech with music, speech with noise, and others. The technique is compared to a hierarchical system with specific acoustic features achieving a significant error reduction.

出版日期2014-8-28

全文

访问全文

收藏分享被引(16) 浏览

更新时间：2024-04-17 10:11

Audio segmentation-by-classification approach based on factor analysis in broadcast news domain

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友