A multimodal temporal panorama approach for moving vehicle detection, reconstruction and classification

Wang Tao<sup>*</sup>; Zhu Zhigang; Taylor Clark N

doi:10.1016/j.cviu.2013.02.011

摘要

Moving vehicle detection and classification using multimodal data is a challenging task in data collection, audio-visual alignment, data labeling and feature selection under uncontrolled environments with occlusions, motion blurs, varying image resolutions and perspective distortions. In this work, we propose an effective multimodal temporal panorama approach for moving vehicle detection and classification using a novel long-range audio-visual sensing system. A new audio-visual vehicle (AVV) dataset is created, which features automatic vehicle detection and audio-visual alignment, accurate vehicle extraction and reconstruction, and efficient data labeling. In particular, vehicles' visual images are reconstructed once detected in order to remove most of the occlusions, motion blurs, and variations of perspective views. Multimodal audio-visual features are extracted, including global geometric features (aspect ratios, profiles), local structure features (HOGS), as well various audio features (MFCCs, etc.). Using radial-based SVMs, the effectiveness of the integration of these multimodal features is thoroughly and systematically studied. The concept of MTP may not be only limited to visual, motion and audio modalities; it could also be applicable to other sensing modalities that can obtain data in the temporal domain.

出版日期2013-12

全文

访问全文

收藏分享被引浏览

更新时间：2017-06-29 15:59

A multimodal temporal panorama approach for moving vehicle detection, reconstruction and classification

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友