Unsupervised corpus distillation for represented indicator measurement on focus species detection

Wei Chih Hsuan; Kao Hung Yu<sup>*</sup>

doi:10.1504/IJDMB.2013.056615

登录

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

Unsupervised corpus distillation for represented indicator measurement on focus species detection

作者：Wei Chih Hsuan; Kao Hung Yu^*

来源：International Journal of Data Mining and Bioinformatics, 2013, 8(4): 413-426.

DOI：10.1504/IJDMB.2013.056615

摘要

The gene ambiguity with the highest dimension is the species with which an entity is associated in biomedical text mining. Furthermore, one of the bottlenecks in gene normalisation is focus species detection. This study presents a method which is robust for all types of articles, particularly those without explicit species mentions. Since our method requires a training corpus, we developed an iterative distillation method to extend the corpus. Unsupervised corpus is therefore helpful for the detection of focus species. In experiments, the proposed method achieved a high accuracy of 85.64% and 84.32% in datasets with and without species mentions respectively.

出版日期2013

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2019-05-20 19:28

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号