A multimodal alignment framework for spoken documents

Mekhaldi Dalila<sup>*</sup>; Lalanne Denis; Ingold Rolf

doi:10.1007/s11042-011-0842-x

摘要

We present a multimodal document alignment framework, which highlights existing alignment relationships between documents that are discussed and recorded during multimedia events such as meetings. These relationships that should help indexing the archives of these events are detected using various techniques from natural language processing and information retrieval. The main alignment strategies studied are based on thematic, quotation and reference relationships. At the analysis level, the alignment framework was applied at several levels of granularity of documents, requiring specific document segmentation techniques. Our framework that is language independent was evaluated on corpora in French and English, including meetings and scientific presentations. The satisfactory evaluation results obtained at several stages show the importance of our approach in bridging the gap between meeting documents, independently from the language and domain. They highlight also the utility of the multimodal alignment in advanced applications, e.g. multimedia document browsing, content-based / temporal-based searching, etc.

出版日期2012-11

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2018-04-10 15:45

A multimodal alignment framework for spoken documents

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友