A peek into the black box: exploring classifiers by randomization

作者:Henelius Andreas*; Puolamaki Kai; Bostrom Henrik; Asker Lars; Papapetrou Panagiotis
来源:Data Mining and Knowledge Discovery, 2014, 28(5-6): 1503-1529.
DOI:10.1007/s10618-014-0368-8

摘要

Classifiers are often opaque and cannot easily be inspected to gain understanding of which factors are of importance. We propose an efficient iterative algorithm to find the attributes and dependencies used by any classifier when making predictions. The performance and utility of the algorithm is demonstrated on two synthetic and 26 real-world datasets, using 15 commonly used learning algorithms to generate the classifiers. The empirical investigation shows that the novel algorithm is indeed able to find groupings of interacting attributes exploited by the different classifiers. These groupings allow for finding similarities among classifiers for a single dataset as well as for determining the extent to which different classifiers exploit such interactions in general.

  • 出版日期2014-9