摘要

Sequence tag count-based gene expression analysis is potent for the identification of candidate genes relevant to the cancerous phenotype. With the public availability of count-based data, the computational approaches for differentially expressed genes, which are mainly based on Binomial or beta-Binomial distribution, become practical and important in cancer biology. It remains a permanent need to select a Proper statistical model for these methods. In this Study, we developed a novel Bayesian algorithm-based method, Electronic Differential Gene Expression Screener (EDGES), in which a statistical model was determined by geometric averaging of 12 common housekeeping genes. EDGES identified a set of differentially expressed genes in king, breast and colorectal cancers by using publically available Serial Analysis of Gene Expression (SAGE) and Expressed Sequence Tag (EST data). Gene expression microarray analysis and quantitative reverse transcription real-time PCR demonstrated the effectiveness of this procedure. We conclude that current normalization of calibrators provides a new insight into count-based digital subtraction in cancer research.