Self-management of operational issues for grid computing: The case of the virtual imaging platform

Rafael Ferreira da Silva, Tristan Glatard, Frédéric Desprez

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

6 Scopus citations

Abstract

Science gateways,such as the virtual imaging platform (vip),enable transparent access to distributed computing and storage resources for scientific computations.However,their large scale and the number of middleware systems involved in these gateways lead to many errors and faults.This chapter addresses the autonomic management of workflow executions on science gateways in an online and non-clairvoyant environment,where the platform workload,task costs,and resource characteristics are unknown and not stationary.The chapter describes a general self-management process based on the mape-k loop(monitoring,analysis,planning,execution,and knowledge)to cope with operational incidents of workflow executions.Then,this process is applied to handle late task executions,task granularities,and unfairness among workflow executions.Experimental results show how the approach achieves a fair quality of service by using control loops that constantly perform online monitoring,analysis,and execution of a set of curative actions.

Original languageEnglish
Title of host publicationEmerging Research in Cloud Distributed Computing Systems
PublisherIGI Global
Pages187-221
Number of pages35
ISBN (Electronic)9781466682146
ISBN (Print)1466682132, 9781466682139
DOIs
StatePublished - Mar 31 2015

Keywords

  • Grid computing
  • Scientific gateway
  • Scientific workflow
  • Task grouping
  • Task replication
  • Task resubmission
  • Unfairness among workflow executions

Fingerprint

Dive into the research topics of 'Self-management of operational issues for grid computing: The case of the virtual imaging platform'. Together they form a unique fingerprint.

Cite this