Rethink the storage of virtual machine images in clouds

作者:Xu Xiaolin; Jin Hai; Wu Song*; Wang Yihong
来源:Future Generation Computer Systems-The International Journal of eScience, 2015, 50: 75-86.
DOI:10.1016/j.future.2014.10.004

摘要

As one of the most prevalent cloud services, Infrastructure-as-a-Service provides virtual machines (VMs) with high flexibility. How to effectively manage a huge amount of VM images becomes a big challenge. On one hand, images affect VMs' disk IO performance significantly, which is essential to the quality of services, especially for those having intensive disk IO workloads. On the other hand, they consume many storage resources and cause much management cost, which is cared by cloud managers. Current ways to optimize images usually focus on either improving performance or decreasing image size, which unfortunately cannot satisfy the requirements of high IO performance, low storage consumption, and low management cost simultaneously:Typically, high IO performance requires images storing close to VMs, but this increases redundant data and consumes extra storage at the same time. Besides, a closer image means more data stored in local disks rather than a normal shared storage, which increases management cost as well. In this paper, we analyze these requirements and potential tradeoffs among them, and propose Zone-based model to well balance the requirements. We partition computing nodes into many zones, and construct a shared storage in each zone to cache hot data for high IO performance and low storage consumption. In addition, we improve the normal Copy-on-Write and cache mechanisms, providing new image types and cache functions to enhance the eventual effectiveness. The evaluations show that, our solution improves IO performance by more than 100% in general and even 10 times while adopting a friendly VM placement strategy, and gets close or less storage consumption and management cost than the other models at the same time.