ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

DYNAMICALLY PRIORITIZED FAILURE MANAGEMENT ACCORDING TO RELIABILITY MODEL IN LARGE-SCALE DATA CENTER

Journal: IADIS INTERNATIONAL JOURNAL ON WWW/INTERNET (Vol.18, No. 2)

Publication Date:

Authors : ; ;

Page : 101-115

Keywords : ;

Source : Download Find it from : Google Scholarexternal

Abstract

We propose a dynamically prioritized failure management method according to the reliability model that the failure rate of virtual machine varies in its life cycle. When using a combination of server monitoring with ping and network connection check with Ethernet OAM, the system sets higher priorities to the port connected to a long running server, and the port within a certain time after the connection change or virtual machine addition is set. The system then selects the ports from the higher priority port to be monitored by Ethernet OAM. As a result of the evaluation by the simulation, by dynamically selecting the port to be monitored for Ethernet OAM using the proposed method, it was confirmed that more than a third of all failures were detected with Maintenance End Points which number is only a tenth of that of servers in a data center. In the data center for cloud services running many VMs, it is possible to shorten the recovery from VM failure while suppressing the number of objects monitored by Ethernet OAM by using this method.

Last modified: 2022-02-14 22:46:16