• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

Process diversity measurement of
ensemble learning based on information entropy

ZHOU Gang,GUO Fu-liang   

  1. (Naval University of Engineering,Wuhan 430033,China)
  • Received:2018-10-29 Revised:2019-01-08 Online:2019-09-25 Published:2019-09-25

Abstract:

The diversity of base classifiers is an important factor to improve the accuracy and generalization ability of ensemble learning. The traditional post verification diversity measurement method is inefficient in big data environment. We therefore propose a process diversity measurement method based on information entropy. The method uses the gain of each attribute of the base classifier and its tree level to obtain the joint gain of the attribute set, and calculates the entropy distance between the classifiers to evaluate their diversity. It dynamically integrates the learning classifier with the entropy distance in accordance with the K-means method. Compared with traditional methods on watermelon dataset and other classification datasets, it is found that the proposed method has similar accuracy and higher computational efficiency.
 
 

Key words: ensemble learning, process diversity, joint gain, K-means, diversity measurement