Fault-prone module detection using large-scale text features based on spam filtering

Hata Hideaki; Mizuno Osamu<sup>*</sup>; Kikuno Tohru

doi:10.1007/s10664-009-9117-9

摘要

This paper proposes an approach using large-scale text features for fault-prone module detection inspired by spam filtering. The number of every text feature in the source code of a module is counted and used as data for training detection models. In this paper, we prepared a naive Bayes classifier and a logistic regression model as detection models. To show the effectiveness of our approaches, we conducted experiments with five open source projects and compared them with a well-known metrics set, thereby achieving higher detection results. The results imply that large-scale text features are useful in constructing practical detection models, and measuring sophisticated metrics is not always necessary for detecting fault-prone modules.

出版日期2010-4

全文

访问全文

收藏分享被引(5) 浏览

更新时间：2019-11-25 21:12

Fault-prone module detection using large-scale text features based on spam filtering

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友