• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

A parallel multi-classifier fusion approach
based on selective ensemble

TAO Xiao-ling1,2,KANG Rui-nan3,LIU Li-yan3   

  1. (1.Guangxi Collaboration Innovation Center of Cloud Computing and Big Data,
    Guilin University of Electronic Technology,Guilin 541004;
    2.Guangxi Key Laboratory of Cryptography and Information Security,
    Guilin University of Electronic Technology,Guilin 541004;
    3.College of Information and Communication,Guilin University of Electronic Technology,Guilin 541004,China)
  • Received:2016-11-16 Revised:2017-03-30 Online:2018-05-25 Published:2018-05-25

Abstract:

In order to solve the problem of large time and low accuracy in the process of multi-classifier fusion,a Parallel Multi-Classifier Fusion Approach based on Selective Ensemble (PMCF-SE) is proposed by adopting both the improved Baggingmethod and MapReduce technique. Our approach is based on the MapReduce parallel computing framework.In the Map phase,the base classifiers with better classification performance are selected. In the Reduce phase,the base classifiers of greater diversity are selected, and then the selected base classifiers are fused with the D-S evidence theory. Experimental results show that, compared with the stand-alone environment, the execution efficiency of the classification model in the cluster environment is improved. PMCF-SE has higher classification accuracy than the Bagging algorithm under different numbers of base classifiers.
 

Key words: multi-classifier fusion, selective ensemble, D-S evidence theory, MapReduce, parallelization