J4 ›› 2015, Vol. 37 ›› Issue (9): 1783-1793.
• 论文 • Previous Articles Next Articles
DONG Yuehua,LIU Li
Received:
Revised:
Online:
Published:
Abstract:
Aiming at the problem of multivalue bias in ID3 algorithm, we propose an optimized algorithm of decision tree based on correlation coefficients. Firstly, the correlation coefficients between the attributes are introduced to improve the ID3 algorithm, and in turn the multivalue bias problem is overcome. Then the properties of Taylor formula and Maclaurin formula are adopted to simplify the information gain formula. The concrete data of examples prove that the optimized ID3 algorithm can overcome multivalue bias problem. Experiments on the standard UCI data sets show that the optimized algorithm of decision tree not only improves the accuracy of average classification, but also reduces the complexity in building decision trees and thus reduces the generation time of decision trees. Besides, the efficiency of the optimized ID3 algorithm increases significantly for large scale samples.
Key words: ID3 algorithm;correlation coefficient;decision tree;Taylor formula;information gain
DONG Yuehua,LIU Li. An optimized algorithm of decision tree based on correlation coefficients [J]. J4, 2015, 37(9): 1783-1793.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2015/V37/I9/1783