摘要

Gene networks are beneficial to identify functional genes that are highly relevant to clinical outcomes. Most of the current methods require information about the interaction of genes or proteins to construct genetic network connection. However, the conclusion of these methods may be bias because of the current incompleteness of human interactome. In this paper, we propose an efficient strategy to use gene expression data and gene mutation data for identifying cancer-related key genes based on graph entropy (iKGGE). Firstly, we construct a gene network using only gene expression data based on the sparse inverse covariance matrix, then, cluster genes use the algorithm of parallel maximal cliques for quickly obtaining a series of subgraphs, and at last, we introduce a novel metric that combine graph entropy and the influence of upstream gene mutations information to measure the impact factors of genes. Testing of the three available cancer datasets shows that our strategy can effectively extract key genes that may play distinct roles in tumorigenesis, and the cancer patient risk groups are well predicted based on key genes.