An empirical study for software change prediction using imbalanced data

Malhotra Ruchika; Khanna Megha

doi:10.1007/s10664-016-9488-7

摘要

Software change prediction is crucial in order to efficiently plan resource allocation during testing and maintenance phases of a software. Moreover, correct identification of change-prone classes in the early phases of software development life cycle helps in developing cost-effective, good quality and maintainable software. An effective software change prediction model should equally recognize change-prone and not change-prone classes with high accuracy. However, this is not the case as software practitioners often have to deal with imbalanced data sets where instances of one type of class is much higher than the other type. In such a scenario, the minority classes are not predicted with much accuracy leading to strategic losses. This study evaluates a number of techniques for handling imbalanced data sets using various data sampling methods and MetaCost learners on six open-source data sets. The results of the study advocate the use of resample with replacement sampling method for effective imbalanced learning.

出版日期2017-12

全文

访问全文

收藏分享被引(44) 浏览

更新时间：2024-04-05 22:25

An empirical study for software change prediction using imbalanced data

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友