An improved data characterization method and its application in classification algorithm recommendation

Wang, Guangtao<sup>*</sup>; Song, Qinbao; Zhu, Xiaoyan

doi:10.1007/s10489-015-0689-3

摘要

Picking up appropriate classification algorithms for a given data set is very important and useful in practice. One of the most challenging issues for algorithm selection is how to characterize different data sets. Recently, we extracted the structural information of a data set to characterize itself. Although these kinds of characteristics work well in identifying similar data sets and recommending appropriate classification algorithms, the extraction method can only be applied to binary data sets and its performance is not high. Thus, in this paper, an improved data set characterization method is proposed to address these problems. For the purpose of evaluating the effectiveness of the improved method on algorithm recommendation, the unsupervised learning method EM is employed to build the algorithm recommendation model. Extensive experiments with 17 different types of classification algorithms are conducted upon 84 public UCI data sets; the results demonstrate the effectiveness of the proposed method.

出版日期2015-12
单位西安交通大学

全文

访问全文

收藏分享被引(11) 浏览

更新时间：2021-11-22 02:28

An improved data characterization method and its application in classification algorithm recommendation

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友