Adaptive Combiner for MapReduce on cloud computing

作者:Huang Tzu Chi*; Chu Kuo Chih; Lee Wei Tsong; Ho Yu Sheng
来源:Cluster Computing, 2014, 17(4): 1231-1252.
DOI:10.1007/s10586-014-0362-3

摘要

MapReduce is a programming model to process a massive amount of data on cloud computing. MapReduce processes data in two phases and needs to transfer intermediate data among computers between phases. MapReduce allows programmers to aggregate intermediate data with a function named combiner before transferring it. By leaving programmers the choice of using a combiner, MapReduce has a risk of performance degradation because aggregating intermediate data benefits some applications but harms others. Now, MapReduce can work with our proposal named the Adaptive Combiner for MapReduce (ACMR) to automatically, smartly, and trainer for getting a better performance without any interference of programmers. In experiments on seven applications, MapReduce can utilize ACMR to get the performance comparable to the system that is optimal for an application.

  • 出版日期2014-12