摘要

In this paper we present results of experiments on 166 incomplete data sets using three probabilistic approximations: lower, middle, and upper. Two interpretations of missing attribute values were used: lost and %26quot;do not care%26quot; conditions. Our main objective was to select the best combination of an approximation and a missing attribute interpretation. We conclude that the best approach depends on the data set. The additional objective of our research was to study the average number of distinct probabilities associated with characteristic sets for all concepts of the data set. This number is much larger for data sets with %26quot;do not care%26quot; conditions than with data sets with lost values. Therefore, for data sets with %26quot;do not care%26quot; conditions the number of probabilistic approximations is also larger.

  • 出版日期2013

全文