Iterative Discovery of Multiple Alternative Clustering Views

作者:Niu Donglin*; Dy Jennifer G; Jordan Michael I
来源:IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36(7): 1340-1353.
DOI:10.1109/TPAMI.2013.180

摘要

Complex data can be grouped and interpreted in many different ways. Most existing clustering algorithms, however, only find one clustering solution, and provide little guidance to data analysts who may not be satisfied with that single clustering and may wish to explore alternatives. We introduce a novel approach that provides several clustering solutions to the user for the purposes of exploratory data analysis. Our approach additionally captures the notion that alternative clusterings may reside in different subspaces (or views). We present an algorithm that simultaneously finds these subspaces and the corresponding clusterings. The algorithm is based on an optimization procedure that incorporates terms for cluster quality and novelty relative to previously discovered clustering solutions. We present a range of experiments that compare our approach to alternatives and explore the connections between simultaneous and iterative modes of discovery of multiple clusterings.