Mining incomplete data with singleton, subset and concept probabilistic approximations

Clark Patrick G; Grzymala Busse Jerzy W<sup>*</sup>; Rzasa Wojciech

doi:10.1016/j.ins.2014.05.007

摘要

Rough set theory provides a very useful idea of lower and upper approximations for inconsistent data. For incomplete data these approximations are not unique. In this paper we investigate properties of three well-known generalizations of approximations: singleton, subset and concept. These approximations were recently further generalized as to include an additional parameter alpha, interpreted as a probability. In this paper we report novel properties of singleton, subset and concept probabilistic approximations. Additionally, we validated such approximations experimentally. Our main objective was to test which of the singleton, subset and concept probabilistic approximations are the most useful for data mining. Our conclusion is that, for a given incomplete data set, all three approaches should be applied and the best approach should be selected as a result often-fold cross validation. Finally, we conducted experiments on complexity of rule sets and the total number of singleton, subset and concept approximations.

出版日期2014-10-1

全文

访问全文

收藏分享被引(28) 浏览

更新时间：2024-05-05 16:46

Mining incomplete data with singleton, subset and concept probabilistic approximations

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友