A precise ranking method for outlier detection

Ha Jihyun; Seok Seulgi; Lee Jong Seok<sup>*</sup>

doi:10.1016/j.ins.2015.06.030

摘要

Recent research studies on outlier detection have focused on examining the nearest neighbor structure of a data object to measure its outlierness degree. This leads to two weaknesses: the size of nearest neighborhood, which should be predetermined, greatly affects the final detection results, and the outlierness scores produced by existing methods are not sufficiently diverse to allow precise ranking of outliers. To overcome these problems, in this research paper, a novel outlier detection method involving an iterative random sampling procedure is proposed. The proposed method is inspired by the simple notion that outlying objects are less easily selected than inlying objects in blind random sampling, and therefore, more inlierness scores are given to selected objects. We develop a new measure called the observability factor (OF) by utilizing this idea. In order to offer a heuristic guideline to determine the best size of nearest neighborhood, we additionally propose using the entropy of OF scores. An intensive numerical evaluation based on various synthetic and real-world datasets shows the superiority and effectiveness of the proposed method.

出版日期2015-12-10

全文

访问全文

收藏分享被引(39) 浏览

更新时间：2024-04-16 02:58

A precise ranking method for outlier detection

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友