A clustering-based feature selection via feature separability

Jiang, Shengyi; Wang, Lianxi<sup>*</sup>

doi:10.3233/JIFS-169022

摘要

With the extensive increase of the amount of data, such as text categorization, genomic microarray data, bioinformatics and digital images, there are more and more challenges in feature selection. Recently, feature selection has been widely studied in supervised learning, but there is significantly less work in unsupervised learning because of the absence of class information and explicit search criteria. In this work, we introduce a new measure to assess the importance of features in terms of feature separability. A clustering-based feature selection algorithm is then introduced to conduct the feature selection. The proposed algorithm with nearly linear time complexity selects final feature subset through a ranking procedure based on the separabilities of features and it is applicable to datasets of mixed nature. Experimental results on UCI datasets show that our method, by retaining relevant features, can obtain similar or even better results of classification and clustering for most datasets, and it outperforms other traditional supervised and unsupervised feature selection methods in terms of dimensionality reduction and classification accuracy.

出版日期2016
单位广东外语外贸大学; 中山大学

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2021-08-05 21:22

A clustering-based feature selection via feature separability

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友