• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

An improved ID3 algorithm based on correlation coefficients

WU Sibo,CHEN Zhigang,HUANG Rui   

  1. (School of Software,Central South University,Changsha 410075,China)
  • Received:2016-07-16 Revised:2016-09-01 Online:2016-11-25 Published:2016-11-25

Abstract:

The ID3 algorithm is the most influential algorithm in decision tree construction. However,
it has some disadvantages, such as timeconsuming, low efficiency and multivalue
attribute bias problem. We propose an improved ID3 algorithm based on Spearman Rank, which
optimizes the inadequacy of the attribute selection criterion. We overcome the main
drawback of the ID3 algorithm, which is to select attributes of more value by using
correlation coefficients, thus the accuracy of classification is improved. Meanwhile, we
reduce the time spent on log calculation by using related mathematical theories. Finally,
we conduct several experiments to verify the feasibility of the improved algorithm. Its
accuracy and efficiency are proved better than the ID3 algorithm.

Key words: decision tree, ID3 algorithm, information entropy, Spearman Rank, correlation coefficients