摘要

Gene expression data is used to find significant genes related to specific disease, such as lung cancer. These significant genes can be used as biomarkers to diagnose disease, and data mining techniques are useful in finding such biomarkers. Feature selection and classification schemes are extensively used for this purpose. Researchers should test various combinations of data mining schemes to find the best biomarker since there is no ultimate scheme for every case of datasets. Thus, the process is tedious and requires effort. In this study, we propose a software library that finds biomarker genes based on microarray datasets. The proposed library contains procedural steps to identify and test biomarker genes and is implemented as an R library for general use. This library with feature selection algorithm, helps to save time and effort in analysing and combining codes to test their research ideas.

  • 出版日期2016

全文