A rank-based transcriptional signature for predicting relapse risk of stage II colorectal cancer identified with proper data sources

作者:Zhao, Wenyuan; Chen, Beibei; Guo, Xin; Wang, Ruiping; Chang, Zhiqiang; Dong, Yu; Song, Kai; Wang, Wen; Qi, Lishuang; Gu, Yunyan; Wang, Chenguang; Yang, Da*; Guo, Zheng*
来源:Oncotarget, 2016, 7(14): 19060-19071.
DOI:10.18632/oncotarget.7956

摘要

The irreproducibility problem seriously hinders the studies on transcriptional signatures for predicting relapse risk of early stage colorectal cancer (CRC) patients. Through reviewing recently published 34 literatures for the development of CRC prognostic signatures based on gene expression profiles, we revealed a surprising phenomenon that 33 of these studies analyzed CRC samples with and without adjuvant chemotherapy together in the training and/or validation datasets. This data misuse problem could be partially attributed to the unclear and incomplete data annotation in public data sources. Furthermore, all the signatures proposed by these studies were based on risk scores summarized from gene expression levels, which are sensitive to experimental batch effects and risk compositions of the samples analyzed together. To avoid the above-mentioned problems, we carefully selected three qualified large datasets to develop and validate a signature consisting of three pairs of genes. The within-sample relative expression orderings of these gene pairs could robustly predict relapse risk of stage II CRC samples assessed in different laboratories. The transcriptional and functional analyses provided clear evidence that the high risk patients predicted by the proposed signature represent patients with micro-metastases.