• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2014, Vol. 36 ›› Issue (10): 1952-1960.

• 论文 • Previous Articles     Next Articles

A fault localization approach using
multivariate Logistic regression model   

JU Xiaolin1, 2,JIANG Shujuan1,CHEN Xiang2,CAO Heling1,WANG Xingya1   

  1. (1.School of Computer Science and Technology,China University of Mining and Technology,Xuzhou 221116;
    2.School of Computer Science and Technology,Nantong University,Nantong 226019,China)
  • Received:2014-06-13 Revised:2014-08-15 Online:2014-10-25 Published:2014-10-25

Abstract:

Fault localization plays an important role in software development. Combining both construction features and behavior characteristics of program can benefit fault locating. A framework based on multivariate logistic regress model for locating fault in evolving software is proposed. Firstly, the feature data set is constructed by statically analyzing and tracing the program that runs with a set of designed metrics of program construction features and behavior characteristics. Meanwhile, the fault information of old version is obtained from the bug tracking system. Secondly, a univariate analysis is performed to select the metrics that are significantly related to fault, and then we train the multivariate Logistic model on the selected metrics with the constructed feature data set and the tracked fault information. Finally, based on the trained Logistic model, we conduct the multivariate logistic analysis on the feature data set of a new version of evolved software, and predict the faulty class methods. We also conduct an empirical study on a set of benchmarks. The results indicate that the multivariate Logistic model can improve the effectiveness of fault localization.

Key words: fault localization;multivariate logistic analysis;software measurement;software testing