A novel approach to feature extraction from classification models based   on information gene pairs

Li J<sup>*</sup>; Tang X; Liu J; Huang J; Wang Y

doi:10.1016/j.patcog.2007.11.019

摘要

Various microarray experiments are now done in many laboratories, resulting in the rapid accumulation of microarray data in public repositories. One of the major challenges of analyzing microarray data is how to extract and select efficient features from it for accurate cancer classification. Here we introduce a new feature extraction and selection method based on information gene pairs that have significant change in different tissue samples. Experimental results on five public microarray data sets demonstrate that the feature subset selected by the proposed method performs well and achieves higher classification accuracy on several classifiers. We perform extensive experimental comparison of the features selected by the proposed method and features selected by other methods using different evaluation methods and classifiers. The results confirm that the proposed method performs as well as other methods on acute lymphoblastic-acute myeloid leukemia, adenocarcinoma and breast cancer data sets using a fewer information genes and leads to significant improvement of classification accuracy on colon and diffuse large B cell lymphoma cancer data sets.

出版日期2008-6
单位哈尔滨工业大学

全文

访问全文

收藏分享被引(10) 浏览

更新时间：2024-03-07 15:42

A novel approach to feature extraction from classification models based on information gene pairs

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友