Analysis of Naive Bayes' assumptions on software fault data: An empirical study

Turhan Burak<sup>*</sup>; Bener Ayse

doi:10.1016/j.datak.2008.10.005

摘要

Software defect prediction is important for reducing test times by allocating testing resources effectively. In terms of predicting the defects in software, Naive Bayes outperforms a wide range of other methods. However, Naive Bayes assumes the 'independence' and 'equal importance' of attributes. In this work, we analyze these assumptions of Naive Bayes using public software defect data from NASA. Our analysis shows that independence assumption is not harmful for software defect data with PCA pre-processing. Our results also indicate that assigning weights to static code attributes may increase the prediction performance significantly, while removing the need for feature subset selection.

出版日期2009-2

全文

访问全文

收藏分享被引(88) 浏览

更新时间：2024-03-28 22:36

Analysis of Naive Bayes' assumptions on software fault data: An empirical study

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友