A New Algorithm for Intermediate Dataset Storage in a Cloud-Based Dataflow

Cheng Jie; Zhu Daming<sup>*</sup>; Zhu Binhai

doi:10.1007/978-3-319-19647-3_4

登录

免费注册

赞收藏引用

科研之友

微信

新浪微博

Facebook

分享链接

A New Algorithm for Intermediate Dataset Storage in a Cloud-Based Dataflow

作者：Cheng Jie; Zhu Daming^*; Zhu Binhai

来源：9th International Frontiers of Algorithmics Workshop (FAW), 2015-07-03 to 2015-07-05.

DOI：10.1007/978-3-319-19647-3_4

摘要

Running a dataflow in a cloud environment usually generates many useful intermediate datasets. A strategy for running a dataflow is to decide which datasets should be stored, while the rest of them are regenerated. The intermediate dataset storage (IDS) problem asks to find a strategy for running a dataflow, such that the total cost is minimized. The current best algorithm for linear-structure IDS takes O(n(4)) time, where "linear-structure" means that the structure of the datasets in the dataflow is a pipeline. In this paper, we present a new algorithm for this problem, and improve the time complexity to O(n(3)), where n is the number of datasets in the pipeline.

出版日期2015
单位山东大学

全文

访问全文

收藏分享被引浏览

更新时间：2019-02-19 16:16

相似论文
引用论文
参考文献

产品服务

科研之友科研之友机构版科创云

站内浏览

科研成果科研人员科研机构

服务支持

帮助中心隐私政策服务条款

联系方式

在线客服：【立即咨询】客户热线：400-1616-289 电子邮箱：support@scholarmate.com

微信公众号