Multi Label Learning for Prediction of Human Protein Subcellular Localizations

Zhu, Lin; Yang, Jie; Shen, Hong Bin<sup>*</sup>

doi:10.1007/s10930-009-9205-0

摘要

Predicting protein subcellular locations has attracted much attention in the past decade. However, one of the most challenging problems is that many proteins were found simultaneously existing in, or moving between, two or more different cell components in a eukaryotic cell. Seldom previous predictors were able to deal with such multiplex proteins although they have extremely important implications in future drug discovery in terms of their specific subcellular targeting. Approximately 20% of the human proteome consists of such multiplex proteins with multiple sample labels. In order to efficiently handle such multiplex human proteins, we have developed a novel multi-label (ML) learning and prediction framework called ML-PLoc, which decomposes the multi-label prediction problem into multiple independent binary classification problems. ML-PLoc is constructed based on support vector machine (SVM) and sequential evolution information. Experimental results show that ML-PLoc can achieve an overall accuracy 64.6% and recall ratio 67.2% on a benchmark dataset consisting of 14 human subcellular locations, and is very powerful for dealing with multiplex proteins. The current approach represents a new strategy to deal with the multi-label biological problems. ML-PLoc software is freely available for academic use at: http://www.csbio.sjtu.edu.cn/bioinf/ML-PLoc.

出版日期2009-12
单位上海交通大学

全文

访问全文

收藏分享被引(26) 浏览

更新时间：2019-09-13 13:12

Multi Label Learning for Prediction of Human Protein Subcellular Localizations

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友