• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

Computer Engineering & Science

Previous Articles     Next Articles

A Spark based parallel genetic algorithm
solving multimodal function extremums
 

LIU Peng1,2,YE Shuai3,MENG Lei1,2,WANG Can4   

  1. (1.Internet of Things Perception Mine Research Center,China University of Mining and Technology,Xuzhou 221008;
    2.National and Local Joint Engineering Laboratory of Internet Application Technology on Mine,Xuzhou 221008;
    3.School of Information and Control Engineering,China University of Mining and Technology,Xuzhou 221116;
    4.Aerospace Products Division,East China Institute of Computing Technology,Shanghai 201808,China)
  • Received:2017-09-02 Revised:2017-11-05 Online:2018-02-25 Published:2018-02-25

Abstract:

The Genetic Algorithm (GA) needs many computation iterations in solving multimodal function extremums, so its running efficiency is too low when dealing with large-scale data, which greatly limits its practical application. The classical parallel platform Hadoop can improve the GA running efficiency to some extent, while the state-of-the-art parallel platform Spark can release much more parallelism of GA by realizing parallel crossover, mutation and other operations on each computing node. For the convenience of comparison, the GA solving multimodal function extremums are designed and implemented on single node, Hadoop and Spark, respectively. Experimental results show that, compared with single node platform and Hadoop platform, the Spark based implementation not only significantly reduces the running time but also effectively avoids the problem of premature convergence because of its powerful randomness, while dealing with large-scale samples.

Key words: genetic algorithm, multimodal function, extremum, parallel computing, Spark, Hadoop