摘要

Many clustering validity functions have been proposed, especially those based on the geometrical structure of data set, such as Dunn';s index and Xie-Beni index. Xie-Beni index decreases with the number of partitions increasing. It is difficult to choose the optimal partition of data when the number of clusters is large. From the point of view of the compactness and the separation of clustering, a novel clustering validity function is proposed, which is based on the improved Huber Γ statistic combined with the separation of clustering. The function has the only maximum with the number of clusters increasing. The experiment indicates that the function is simply, precise and robust, can be used as the optimal index for choosing the optimal partition of data.

全文