• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (11): 87-93.

• 论文 • Previous Articles     Next Articles

A novel MapReduce parallel model
in hybrid computing environment              

TANG Bing1,HE Hai-wu2   

  1. (1.School of Computer Science and Engineering,Hunan University of Science and Technology, Xiangtan 411201,China;
    2.Laboratoire de l’Informatique du Parallélisme,Ecole Normale Supérieure de Lyon, 69364 Lyon Cedex 07,France)
  • Received:2013-08-03 Revised:2013-10-09 Online:2013-11-25 Published:2013-11-25

Abstract:

A novel MapReduce computation model in hybrid computing environment is proposed. Using this model, high performance cluster nodes and heterogeneous desktop PCs in Internet or Intranet can be integrated to form a hybrid computing environment, where MapReduce tasks can be executed to process large-scale datasets. In this way, the computation and storage capability of large-scale desktop PCs are fully utilized. Similar to the design of Hadoop, this model composes of storage layer and task layer. The paper introduces the architecture of the model briefly and describes the core HybridDFS and the MapReduce algorithms. Then, a prototype system is designed and implemented, and performance evaluations are accomplished. Evaluation results show that the proposed hybrid computation model is not only able to achieve reliable MapReduce computation, but also reduces the computation cost, hence being a potential effective computation model.

Key words: hybrid computing environment;MapReduce;volunteer computing;fault-tolerance;distributed file system