• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Software defect prediction based on
cost-sensitive support vector machine
 

REN Shengbing,LIAO Xiangdang   

  1. (School of Software,Central South University,Changsha 410075,China)
  • Received:2017-11-08 Revised:2018-01-24 Online:2018-10-25 Published:2018-10-25

Abstract:

Software defect prediction is a typical unbalanced learning problem. We propose a CCS-SVM software defect prediction model based on cost sensitive SVM algorithm improved by the CSSVM and clustering algorithm. In the CCSSVM prediction model, we combine SVM and the cost of class misclassification, take unbalanced data evaluation index as the objective function, and optimize the misclassification cost factor so as to enhance the recognition rate of the minority class samples. We find the center point of each sample through clustering, define the class confidence for each sample according to the distance of the sample to its center point, assign different misclassification cost factors to different samples, and introduce the class confidence of each sample to the optimization problem of cost sensitive SVM, and improve the robustness of the algorithm and classification performance of SVM. To enhance the generalization ability of the model, we use the genetic algorithm to optimize feature selection and model parameters. Experimental results of the NASA Metric Data Program (MDP) dataset show that our method is  significantly improved in the Gmean and Fmeasure value for model evaluation.
 

Key words: software defect prediction, cost sensitivity, support vector machine, unbanlanced data classification, parameter selection, genetic algorithm