ACOSampling: An ant colony optimization-based undersampling method for classifying imbalanced DNA microarray data

Yu, Hualong<sup>*</sup>; Ni, Jun; Zhao, Jing

doi:10.1016/j.neucom.2012.08.018

摘要

In DNA microarray data, class imbalance problem occurs frequently, causing poor prediction performance for minority classes. Moreover, its other features, such as high-dimension, small sample, high noise etc., intensify this damage. In this study, we propose ACOSampling that is a novel undersampling method based on the idea of ant colony optimization (ACO) to address this problem. The algorithm starts with feature selection technology to eliminate noisy genes in data. Then we randomly and repeatedly divided the original training set into two groups: training set and validation set. In each division, one modified ACO algorithm as a variant of our previous work is conducted to filter less informative majority samples and search the corresponding optimal training sample subset. At last, the statistical results from all local optimal training sample subsets are given in the form of frequence list, where each frequence indicates the importance of the corresponding majority sample. We only extracted those high frequency ones and combined them with all minority samples to construct the final balanced training set. We evaluated the method on four benchmark skewed DNA microarray datasets by support vector machine (SVM) classifier, showing that the proposed method outperforms many other sampling approaches, which indicates its superiority.

出版日期2013-2-4
单位哈尔滨工程大学; 江苏科技大学

全文

访问全文

收藏分享被引(65) 浏览

更新时间：2021-07-17 15:07

ACOSampling: An ant colony optimization-based undersampling method for classifying imbalanced DNA microarray data

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友