Makespan minimization for MapReduce systems with different servers

作者:Jiang, Yiwei*; Zhu, Yuqing; Wu, Weili; Li, Deying
来源:Future Generation Computer Systems-The International Journal of eScience, 2017, 67: 13-21.
DOI:10.1016/j.future.2016.07.012

摘要

In this paper we study MapReduce scheduling on n parallel machines (servers) with different speeds v(1) >= v(2) >= ... >= v(n). Each job contains two kinds of tasks: map tasks and reduce tasks. The job's reduce tasks can only be processed after finishing all its map tasks. We assume that the map tasks are parallelized, i.e., it can be arbitrarily split and parts of the same task can be processed on different machines in parallel, while the reduce tasks are non-parallelizable. We consider both the offline and online scheduling problems. On offline version, if the reduce tasks are non-preemptive, we design an approximation algorithm whose worst case ratio is at most max{1 + Delta/2 - 1/n, Delta}, where Delta = v(1)/v(n) is the ratio of the fastest speed to the slowest speed. If the reduce tasks are preemptive, we provide an approximation algorithm with worst case ratio of 2. On online version where jobs arriving over time, we design two heuristics for non-preemptive and preemptive reduce tasks respectively. In experiment, we verify the advantage of our algorithms comparing with the state-of-the-art.