A review of boosting methods for imbalanced data classification

Li Qiujie<sup>*</sup>; Mao Yaobin

doi:10.1007/s10044-014-0392-8

摘要

Recently, the problem of imbalanced data classification has drawn a significant amount of interest from academia, industry and government funding agencies. The fundamental issue with imbalanced data classification is the imbalanced data has posed a significant drawback of the performance of most standard learning algorithms, which assume or expect balanced class distribution or equal misclassification costs. Boosting is a meta-technique that is applicable to most learning algorithms. This paper gives a review of boosting methods for imbalanced data classification, denoted as IDBoosting (Imbalanced-databoosting), where conventional learning algorithms can be integrated without further modifications. The main focus is on the intrinsic mechanisms without considering implementation detail. Existing methods are catalogued and each class is displayed in detail in terms of design criteria, typical algorithms and performance analysis. The essence of two IDBoosting methods is discovered followed by experimental evidence and useful reference point for future research are also given.

出版日期2014-11
单位南京大学

全文

访问全文

收藏分享被引浏览

更新时间：2019-03-27 18:45

A review of boosting methods for imbalanced data classification

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友