• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Software defect prediction based on
deep autoencoder networks

ZHOU Mo,XU Ling,YANG Mengning,LIAO Shengping,YAN Meng   

  1. (School of Big Data & Software Engineering,Chongqing University,Chongqing 401331,China)
  • Received:2017-09-07 Revised:2018-03-20 Online:2018-10-25 Published:2018-10-25

Abstract:

Software defect prediction is an effective way for improving the quality of software, and the effect of software defect prediction is closely related to data sets’own characteristics. In regard of feature information redundancy and large dimension of data sets, combining with the powerful learning feature ability of deep learning, we propose a software defect prediction method based on deep autoencoder networks. This method firstly uses an unsupervised learning sampling method to do  sampling for 6 open source projects data sets to solve class imbalance problem of datasets. We then build a deep autoencoder network model through training, which can reduce the dimension of data sets. The model uses three classifiers for connection and employs the training sets with reduced dimension to train the classifiers. Finally, we use the test sets to do prediction. Experimental results show that the proposed method outperforms the basic software defect prediction model and the software defect prediction model based on existing feature extraction methods under the circumstance of the data sets with large dimension and redundant feature information. Besides, it is adaptive to different classifiers.
 

Key words: software defect prediction, feature dimension reduction, deep autoencoder network, class imbalance