Resolving confusion of tongues in statistics and machine learning: A primer for biologists and bioinformaticians

van Iterson Maarten<sup>*</sup>; van Haagen Herman H H B M; Goeman Jelle J

doi:10.1002/pmic.201100395

摘要

Bioinformatics is the field where computational methods from various domains have come together for analysis of biological data. Each domain has introduced its own specific jargon. However, in closely related domains, e.g. machine learning and statistics, concordant and discordant terminology occurs, the later can lead to confusion. This article aims to help solve the confusion of tongues arising from these two closely related domains, which are frequently used in bioinformatics. We provide a short summary of the most commonly applied machine learning and statistical approaches to data analysis in bioinformatics, i.e. classification and statistical hypothesis testing. We explain differences and similarities in common terminology used in various domains, such as precision, recall, sensitivity and true positive rate. This primer can serve as a guide to the terminology used in these fields.

出版日期2012-2

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2018-04-10 19:31

Resolving confusion of tongues in statistics and machine learning: A primer for biologists and bioinformaticians

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友