• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2012, Vol. 34 ›› Issue (2): 134-138.

• 论文 • Previous Articles     Next Articles

Categorization and Comparison of the Ensemble Pruning Algorithms

ZHAO Qiangli,JIANG Yanhuang,XU Ming   

  1. (School of Computer Science,National University of Defense Technology,Changsha 410073,China)
  • Received:2010-01-06 Revised:2010-04-25 Online:2012-02-25 Published:2012-02-25

Abstract:

Ensemble pruning is an active research direction in the machine learning field. Ensemble pruning is an NPhard problem, most researchers use heuristics to obtain near optimal solutions. There are already many ensemble pruning approaches in literatures, but because of the different perspectives on which those methods are based, it is difficult to understand them clearly. In this paper, the ensemble pruning approaches are divided into four categories according to their pruning strategies: optimizationbased, rankingbased, clustering based and pattern miningbased. Next, the popular algorithms of each category are implemented and tested on 20 datasets from the UCI repository, and compared from three facets: prediction performance, pruning time and the size of the final ensembles. The advantages and disadvantages of each category are analyzed. The paper ends with some conclusions and future work.

Key words: ensemble learning;ensemble pruning;optimization based pruning;ranking based pruning;clustering based pruning;pattern mining based pruning