An information-based network approach for protein classification

作者:Wan, Xiaogeng*; Zhao, Xin*; Yau, Stephen S. T.*
来源:PLos One, 2017, 12(3): e0174386.
DOI:10.1371/journal.pone.0174386

摘要

Protein classification is one of the critical problems in bioinformatics. Early studies used geometric distances and polygenetic-tree to classify proteins. These methods use binary trees to present protein classification. In this paper, we propose a new protein classification method, whereby theories of information and networks are used to classify the multivariate relationships of proteins. In this study, protein universe is modeled as an undirected network, where proteins are classified according to their connections. Our method is unsupervised, multivariate, and alignment-free. It can be applied to the classification of both protein sequences and structures. Nine examples are used to demonstrate the efficiency of our new method.