How to improve cloud services availability? Investigating the impact of power and IT subsystems failures

  • Daniel Rosendo
  • , Guto Leoni
  • , Demis Gomes
  • , André Moreira
  • , Glauco Gonçalves
  • , Patricia Takako Endo
  • , Judith Kelner
  • , Djamel Sadok
  • , Mozhgan Mahloo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

The cloud data center is a complex system composed of power, cooling, and IT subsystems. The power subsystem is crucial to feed the IT equipment. Power disruptions may result in service unavailability. This paper analyzes the impact of the power subsystem failures on IT services regarding different architecture configurations based on TIA-942 standard such as non-redundant, redundant, concurrently maintainable, and fault tolerant. We model both subsystems, power and IT, through Stochastic Petri Net (SPN). The availability results show that a fault tolerant power and IT configuration reduces the downtime from 54.1 to 34.5 hours/year when compared to a non-redundant architecture. The sensibility analysis results show that the failure and repair rates of the server component in a fault tolerant system present the highest impact on overall data center availability.

Original languageEnglish
Title of host publicationProceedings of the 51st Annual Hawaii International Conference on System Sciences, HICSS 2018
EditorsTung X. Bui
PublisherIEEE Computer Society
Pages1543-1552
Number of pages10
ISBN (Electronic)9780998133119
StatePublished - 2018
Externally publishedYes
Event51st Annual Hawaii International Conference on System Sciences, HICSS 2018 - Big Island, United States
Duration: Jan 2 2018Jan 6 2018

Publication series

NameProceedings of the Annual Hawaii International Conference on System Sciences
Volume2018-January
ISSN (Print)1530-1605

Conference

Conference51st Annual Hawaii International Conference on System Sciences, HICSS 2018
Country/TerritoryUnited States
CityBig Island
Period01/2/1801/6/18

Funding

This work was supported by the RLAM Innovation Center, Ericsson Telecomunicac¸ões S.A., Brazil.

Fingerprint

Dive into the research topics of 'How to improve cloud services availability? Investigating the impact of power and IT subsystems failures'. Together they form a unique fingerprint.

Cite this