An approach to improve kernel-based Protein-Protein Interaction extraction by learning from large-scale network data

Li Lishuang<sup>*</sup>; Guo Rui; Jiang Zhenchao; Huang Degen

doi:10.1016/j.ymeth.2015.03.026

摘要

Protein-Protein Interaction extraction (PPIe) from biomedical literatures is an important task in biomedical text mining and has achieved desirable results on the annotated datasets. However, the traditional machine learning methods on PPIe suffer badly from vocabulary gap and data sparseness, which weakens classification performance. In this work, an approach capturing external information from the web-based data is introduced to address these problems and boost the existing methods. The approach involves three kinds of word representation techniques: distributed representation, vector clustering and Brown clusters. Experimental results show that our method outperforms the state-of-the-art methods on five publicly available corpora. Our code and data are available at: http://chaoslog.com/improving-kernel-based-protein-protein-interaction-extraction-by-unsupervised-word-representation-codes-and-data.html.

出版日期2015-7-15
单位大连理工大学

全文

访问全文

收藏分享被引(7) 浏览

更新时间：2019-02-20 05:56

An approach to improve kernel-based Protein-Protein Interaction extraction by learning from large-scale network data

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友