D-factor: A quantitative model of application slow-down in multi-resource shared systems

Seung Hwan Lim, Jae Seok Huh, Youngjae Kim, Galen M. Shipman, Chita R. Das

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

38 Scopus citations

Abstract

Scheduling multiple jobs onto a platform enhances system utilization by sharing resources. The benefits from higher resource utilization include reduced cost to construct, operate, and maintain a system, which often include energy consumption. Maximizing these benefits, while satisfying performance limits, comes at a price - resource contention among jobs increases job completion time. In this paper, we analyze slow-downs of jobs due to contention for multiple resources in a system; referred to as dilation factor. We observe that multiple-resource contention creates non-linear dilation factors of jobs. From this observation, we establish a general quantitative model for dilation factors of jobs in multi-resource systems. A job is characterized by a vector-valued loading statistics and dilation factors of a job set are given by a quadratic function of their loading vectors. We demonstrate how to systematically characterize a job, maintain the data structure to calculate the dilation factor (loading matrix), and calculate the dilation factor of each job. We validated the accuracy of the model with multiple processes running on a native Linux server, virtualized servers, and with multiple MapReduce workloads co-scheduled in a cluster. Evaluation with measured data shows that the D-factor model has an error margin of less than 16%. We also show that the model can be integrated with an existing on-line scheduler to minimize the makespan of workloads.

Original languageEnglish
Title of host publicationSIGMETRICS/Performance 2012 - Proceedings of the 2012 ACM SIGMETRICS/Performance, Joint International Conference on Measurement and Modeling of Computer Systems
Pages271-282
Number of pages12
Edition1 SPEC. ISS.
DOIs
StatePublished - 2012
Event12th Joint International Conference on Measurement and Modeling of Computer Systems, ACM SIGMETRICS/Performance 2012 - London, United Kingdom
Duration: Jun 11 2012Jun 15 2012

Publication series

NamePerformance Evaluation Review
Number1 SPEC. ISS.
Volume40
ISSN (Print)0163-5999

Conference

Conference12th Joint International Conference on Measurement and Modeling of Computer Systems, ACM SIGMETRICS/Performance 2012
Country/TerritoryUnited Kingdom
CityLondon
Period06/11/1206/15/12

Keywords

  • application running time
  • cloud computing
  • performance modeling
  • shared resource management

Fingerprint

Dive into the research topics of 'D-factor: A quantitative model of application slow-down in multi-resource shared systems'. Together they form a unique fingerprint.

Cite this