Data clustering of road transportation information system based on attribute dimension partition and MapReduce

作者:Zheng, Xiao-Feng; Xu, Jian-Min; Lu, Kai
来源:Journal of South China University of Technology(Natural Science Edition), 2014, 42(8): 122-128 and 135.
DOI:10.3969/j.issn.1000-565X.2014.08.019

摘要

Aiming at the shortcomings of DBSCAN (Density-Based Spatial Clustering of Applications with Noise), this paper presents the concept of the attribute dimension partition by integrating the domain knowledge with the partition idea. Then, the principles of the cluster merging and the pruning computation are demonstrated. Finally, an optimization method of DBSCAN is put forward based on the cloud computing programming model MapReduce, and the optimization method is verified through the data clustering of a real road transport information system. It is found that the dataset partition helps to perform the concurrent computation, and the proposed optimization method is superior to common statistical methods.

全文