Non-binarized high attribute dimensional sparse continuous data clustering algorithm using principle of granularity

Jie, Zhao; Zhen-Ning, Dong; Sha-Qing, Zhang; Wen-Hong, Wei; Nan-Feng, Xiao

doi:10.4156/jdcta.vol5.issue5.6

摘要

Currently a majority of high attribute dimensional sparse clustering algorithms can only handle binarized data, thresholds are set subjectively and lack of evaluation method for clustering results, which brings great limits to applications. To solve these problems, this paper proposes a clustering algorithm based on principle of granularity. Considering the characteristic of high attribute dimensional sparse continuous data, dimensional similarity threshold is designed without transforming continuous data to binarized data. Then dimensional equivalence granules are sought discontinuously according to sampled dimensional similarity thresholds. Then a new method is designed to calculate the sparse similarity, and a re-clustering model based on indiscernibility degree is designed to refine the result, so the algorithm gains noise-immune ability. The last but not the least a new clustering quality evaluation model is proposed. The experimental results on both real world and synthesis datasets demonstrate that our algorithm is more efficient than the existing ones, and the clustering results reflect the data characteristics more precisely.

出版日期2011-5
单位东莞理工学院; 华南理工大学

全文

访问全文

收藏分享被引浏览

更新时间：2023-06-28 08:01

Non-binarized high attribute dimensional sparse continuous data clustering algorithm using principle of granularity

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友