A network flow model for biclustering via optimal re-ordering of data matrices

作者:DiMaggio Peter A Jr; McAllister Scott R; Floudas Christodoulos A*; Feng Xiao Jiang; Rabinowitz Joshua D; Rabitz Herschel A
来源:Journal of Global Optimization, 2010, 47(3): 343-354.
DOI:10.1007/s10898-008-9349-z

摘要

The analysis of large-scale data sets using clustering techniques arises in many different disciplines and has important applications. Most traditional clustering techniques require heuristic methods for finding good solutions and produce suboptimal clusters as a result. In this article, we present a rigorous biclustering approach, OREO, which is based on the Optimal RE-Ordering of the rows and columns of a data matrix. The physical permutations of the rows and columns are accomplished via a network flow model according to a given objective function. This optimal re-ordering model is used in an iterative framework where cluster boundaries in one dimension are used to partition and re-order the other dimensions of the corresponding submatrices. The performance of OREO is demonstrated on metabolite concentration data to validate the ability of the proposed method and compare it to existing clustering methods.

  • 出版日期2010-7