A tree-based algorithm for attribute selection

作者:Baranauskas Jose Augusto*; Netto Oscar Picchi; Nozawa Sergio Ricardo; Macedo Alessandra Alaniz
来源:Applied Intelligence, 2018, 48(4): 821-833.
DOI:10.1007/s10489-017-1008-y

摘要

This paper presents an improved version of a decision tree-based filter algorithm for attribute selection. This algorithm can be seen as a pre-processing step of induction algorithms of machine learning and data mining tasks. The filter was evaluated based on thirty medical datasets considering its execution time, data compression ability and AUC (Area Under ROC Curve) performance. On average, our filter was faster than Relief-F but slower than both CFS and Gain Ratio. However for low-density (high-dimensional) datasets, our approach selected less than 2% of all attributes at the same time that it did not produce performance degradation during its further evaluation based on five different machine learning algorithms.

  • 出版日期2018-4