A similarity measure between videos using alignment, graphical and speech features

作者:Fuentes D*; Bardeli R; Ortega J A; Gonzalez Abril L
来源:Expert Systems with Applications, 2012, 39(11): 10278-10282.
DOI:10.1016/j.eswa.2012.02.169

摘要

A novel video similarity measure is proposed by using visual features, alignment distances and speech transcripts. First, video files are represented by a sequence of segments each of which contains colour histograms, starting time, and a set of phonemes. After, textual, alignment and visual features are extracted of these segments. The following step, bipartite matching and statistical features are applied to find correspondences between segments. Finally, a similarity is calculated between videos. Experiments have been carried out and promising results have been obtained.

  • 出版日期2012-9-1