Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion

Butko Taras<sup>*</sup>; Nadeu Climent

doi:10.1186/1687-4722-2011-1

摘要

Recently, audio segmentation has attracted research interest because of its usefulness in several applications like audio indexing and retrieval, subtitling, monitoring of acoustic scenes, etc. Moreover, a previous audio segmentation stage may be useful to improve the robustness of speech technologies like automatic speech recognition and speaker diarization. In this article, we present the evaluation of broadcast news audio segmentation systems carried out in the context of the Albayzin-2010 evaluation campaign. That evaluation consisted of segmenting audio from the 3/24 Catalan TV channel into five acoustic classes: music, speech, speech over music, speech over noise, and the other. The evaluation results displayed the difficulty of this segmentation task. In this article, after presenting the database and metric, as well as the feature extraction methods and segmentation techniques used by the submitted systems, the experimental results are analyzed and compared, with the aim of gaining an insight into the proposed solutions, and looking for directions which are promising.

出版日期2011

全文

访问全文

收藏分享被引(20) 浏览

更新时间：2024-04-14 08:13

Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友