• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2011, Vol. 33 ›› Issue (3): 146-151.doi: 10.3969/j.issn.1007130X.2011.

• 论文 • Previous Articles     Next Articles

A Method for Solving the Congestion Issue During the Single Node Recovering Based on the MapReduce Model

ZHANG Zhaoning,PENG Yuxing   

  1. (National Laboratory for Parallel and Distributed Processing,Changsha 710073,China)
  • Received:2009-10-21 Revised:2010-01-09 Online:2011-03-25 Published:2011-03-25

Abstract:

The MapReduce model has provided strong support for the dataintensive supercomputing as a fundamental application flat. It has a singlenode task scheduler, which has a simple architecture and is convenient to control the worker nodes, while there exists the single node error problem. In Hadoop (Open Source MapReduce) released versions, it has three different mechanisms such as synchronization on demand, recovery from history logging and dropping. This paper analyses the data jam, result errors and efficiency decline in the three methods, and then gives a method for delivering the information of task dependencies to solve the problems.

Key words: MapReduce;Hadoop;task scheduling;single node error recovery;task dependency