Approximate Clustering of Time-Series Datasets using k-Modes Partitioning

Aghabozorgi Saeed<sup>*</sup>; Wah Teh Ying

摘要

Data in various systems, such as those in finance, healthcare, and business, are stored as time series. As such, interest in time series mining in these areas has surged. Clustering of data is performed as a pre-processing or exploratory approach in many data mining tasks. Time series data sets are often very large, thus, data cannot fit in the main memory for clustering. In this case, dimension reduction is a common solution. However, the cost of data reduction is relatively high because of overlooking the data involved in this process, leading to low-quality clustering. In this paper, we propose a new approach for improving the approximate clustering accuracy of dimensionality reduced time series by discretization approach. A new distance measure is initially introduced. Thereafter, the partitional algorithm that best matches the representation method is proposed.

出版日期2015-1

收藏分享被引浏览

更新时间：2017-04-25 19:28

Approximate Clustering of Time-Series Datasets using k-Modes Partitioning

摘要

产品服务

站内浏览

服务支持

联系方式

科研之友