A comprehensive approach to optimal software rejuvenation

作者:Zhao, Jing*; Wang, YanBin; Ning, GaoRong; Trivedi, Kishor S.; Matias, Rivalino, Jr.; Cai, Kai-Yuan
来源:Performance Evaluation, 2013, 70(11): 917-933.
DOI:10.1016/j.peva.2013.05.010

摘要

Software aging is caused by resource exhaustion and can lead to progressive performance degradation or result in a crash. We develop experiments that simulate an on-line bookstore application, using the standard configuration of TPC-W benchmark. We study application failures due to memory leaks, using the accelerated life testing (ALT). ALT significantly reduces the time needed to estimate the time to failure at normal level. We then select the Weibull time to failure distribution at normal level, to be used in a semi-Markov model so as to optimize the software rejuvenation trigger interval. Then we derive the optimal rejuvenation schedule interval by fixed point iteration and by an alternative non-parametric estimation algorithm. Finally, we develop a simulation model using importance sampling (IS) to cross validate the ALT experimental results and the semi-Mackay model, and also we apply the non-parametric method to cross validate the optimized trigger intervals by comparing the availabilities obtained from the semi-Markov model and those from IS simulation using the non-parametric method.