摘要

The potential applications and challenges of uncertain data mining have recently attracted interests from researchers. Most uncertain data mining algorithms consider aleatory (random) uncertainty of data, i.e. these algorithms require that exact probability distributions or confidence values are attached to uncertain data. However, knowledge about uncertainty may be incomplete in the case of epistemic (incomplete) uncertainty of data, i.e. probabilities of uncertain data may be imprecise, coarse, or missing in some applications. The paper focuses on uncertain data which miss probabilities, specially, value-uncertain discrete objects which miss probabilities (for short uncertain objects). On the other hand, classification is one of the most important tasks in data mining. But, to the best of our knowledge, there is no method to learn Naive Bayesian classifier from uncertain objects. So the paper studies Naive Bayesian classification of uncertain objects. Firstly, the paper defines interval probabilities of uncertain objects from probabilistic cardinality point of view, and bridges the gap between uncertain objects and the theory of interval probability by proving that interval probabilities are F-probabilities. Secondly, based on the theory of interval probability, the paper defines conditional interval probabilities including the intuitive concept and the canonical concept, and the conditional independence of the intuitive concept. Further, the paper gives a formula to effectively compute the intuitive concept. Thirdly, the paper presents a Naive Bayesian classifier with interval probability parameters which can handle both uncertain objects and certain objects. Finally, experiments with uncertain objects based on UCI data show satisfactory performances.