A Mahout Based Image Classification Framework for Very Large Dataset

作者:He, Jun*; Xue, Zhi-Yun; Gao, Ming-Wei; Wu, Hao
来源:International Conference on Cloud Computing and Internet of Things, Changchun, PEOPLES R CHINA, 2014-12-13 To 2014-12-14.
DOI:10.1109/cciot.2014.7062518

摘要

In this paper, we present a distributed computing framework for image classification towards the current challenge of image big data due to enormous streaming image data sources, such as image sharing over online social network and massive video surveillance streams from Ubiquitous cameras all over our daily life. The proposed framework consists of four modules aiming at feature extraction, dimension reduction, bag of feature modeling, and supervised learning respectively. This distributed computing framework is implemented on Hadoop with Mahout support. We apply the framework for classifying whether a person is on calling or not in a surveillance video to verify the correctness and scalability.

全文