J4 ›› 2011, Vol. 33 ›› Issue (3): 129-135.doi: 10.3969/j.issn.1007130X.2011.
• 论文 • Previous Articles Next Articles
LI Chenghua,ZHANG Xinfang,JIN Hai,XIANG Wen
Received:
Revised:
Online:
Published:
Abstract:
MapReduce is a programming model introduced by Google for writing applications that rapidly process vast amounts of data in parallel on large clusters of computing nodes. The model is inspired by map and reduce functions commonly used in functional programming. A Map/Reduce job usually splits the input dataset into independent chunks which are processed by the map tasks in a completely parallel manner. The reduce tasks merge all intermediate values generated by the map tasks. Users only devote themselves to how to specify the map functions and reduce functions. The details of partitioning the input data, scheduling the program’s execution across a set of machines, handling machine failures, and managing the required intermachine communication are taken care of by the runtime system of MapReduce. MapReduce will be widely adopted on the cloud computing platform. Several aspects of the Hadoop MapReduce contributed by Apache remain to be perfected.
Key words: MapReduce;distributed parallel computing;cloud computing
LI Chenghua,ZHANG Xinfang,JIN Hai,XIANG Wen. MapReduce:a New Programming Model for Distributed Parallel Computing[J]. J4, 2011, 33(3): 129-135.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/10.3969/j.issn.1007130X.2011.
http://joces.nudt.edu.cn/EN/Y2011/V33/I3/129