A LoOP based outlier detection method for high dimensional fuzzy data set

Jahromi Alireza Fakharzadeh<sup>*</sup>; Zarei Fateme

doi:10.3233/JIFS-151447

摘要

Regarding the magnificent ability of the fuzzy logic in explanation and analyzing the real phenomena, using fuzzy data is growing rapidly. But the tools for doing data mining in a fuzzy data set; especially in high dimension, are not sufficient. Considering the remarkable ability of the LoOP (Local Outlier Probabilities) method in outlier detecting in high dimensional crisp data set, like supplying for each data object, determining outlier probability as scar which is easily explainable competitor on various synthetic and real world data sets, the aim of this article is to present an extended version of this method for using in high dimensional fuzzy database. In order to provide a completely fuzzy strategy, first we propose a new approach to assigning distance between fuzzy high dimension data based on Hausdorff and Vertex distance. After proving that this distance measuring is a metric, since LoOP is a distance based method, we redefined the LoOP algorithm step by step based on the new meter. Then we allocate an outlier score in the range of [0,1] to each data based on LoOP idea, to identify the outliers. The presented numerical tests show the efficiency of the new method, for random cases of knowing and unknowing the outlier and a real case. The results shows also the stability and robustness of the presented method more that its accuracy.

出版日期2017

全文

访问全文

收藏分享被引浏览

更新时间：2021-01-21 21:38

A LoOP based outlier detection method for high dimensional fuzzy data set

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友