Discovery and organization of multi-camera user-generated videos of the same event

Bano Sophia<sup>*</sup>; Cavallaro Andrea

doi:10.1016/j.ins.2014.08.026

摘要

We propose a framework for the automatic grouping and alignment of unedited multi-camera User-Generated Videos (UGVs) within a database. The proposed framework analyzes the sound in order to match and cluster UGVs that capture the same spatio-temporal event and estimate their relative time-shift to temporally align them. We design a descriptor derived from the pairwise matching of audio chroma features of UGVs. The descriptor facilitates the definition of a classification threshold for automatic query-by-example event identification. We evaluate the proposed identification and synchronization framework on a database of 263 multi-camera recordings of 48 real-world events and compare it with state-of-the-art methods. Experimental results show the effectiveness of the proposed approach in the presence of various audio degradations.

出版日期2015-5-1

全文

访问全文

收藏分享被引(11) 浏览

更新时间：2022-01-02 20:34

Discovery and organization of multi-camera user-generated videos of the same event

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友