Multinomial event naive Bayesian modeling for SAGE data classification

作者:Jin Xin; Zhou Wengang; Bie Rongfang*
来源:Computational Statistics, 2007, 22(1): 133-143.
DOI:10.1007/s00180-007-0029-0

摘要

Recently developed SAGE technology enables us to simultaneously quantify the expression levels of thousands of genes in a population of cells. SAGE data is helpful in classification of different types of cancers. However, one main challenge in this task is the availability of a smaller number of samples compared to huge number of genes, many of which are irrelevant for classification. Another main challenge is that there is a lack of appropriate statistical methods that consider the specific properties of SAGE data. We propose an efficient solution by selecting relevant genes by information gain and building a multinomial event model for SAGE data. Promising results, in terms of accuracy, were obtained for the model proposed.

全文