TY - GEN
T1 - Workflow fairness control on online and non-clairvoyant distributed computing platforms
AU - Da Silva, Rafael Ferreira
AU - Glatard, Tristan
AU - Desprez, Frédéric
PY - 2013
Y1 - 2013
N2 - Fairly allocating distributed computing resources among workflow executions is critical to multi-user platforms. However, this problem remains mostly studied in clairvoyant and offline conditions, where task durations on resources are known, or the workload and available resources do not vary along time. We consider a non-clairvoyant, online fairness problem where the platform workload, task costs and resource characteristics are unknown and not stationary. We propose a fairness control loop which assigns task priorities based on the fraction of pending work in the workflows. Workflow characteristics and performance on the target resources are estimated progressively, as information becomes available during the execution. Our method is implemented and evaluated on 4 different applications executed in production conditions on the European Grid Infrastructure. Results show that our technique reduces slowdown variability by 3 to 7 compared to first-come-first-served.
AB - Fairly allocating distributed computing resources among workflow executions is critical to multi-user platforms. However, this problem remains mostly studied in clairvoyant and offline conditions, where task durations on resources are known, or the workload and available resources do not vary along time. We consider a non-clairvoyant, online fairness problem where the platform workload, task costs and resource characteristics are unknown and not stationary. We propose a fairness control loop which assigns task priorities based on the fraction of pending work in the workflows. Workflow characteristics and performance on the target resources are estimated progressively, as information becomes available during the execution. Our method is implemented and evaluated on 4 different applications executed in production conditions on the European Grid Infrastructure. Results show that our technique reduces slowdown variability by 3 to 7 compared to first-come-first-served.
UR - http://www.scopus.com/inward/record.url?scp=84883198175&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-40047-6_13
DO - 10.1007/978-3-642-40047-6_13
M3 - Conference contribution
AN - SCOPUS:84883198175
SN - 9783642400469
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 102
EP - 113
BT - Euro-Par 2013 Parallel Processing - 19th International Conference, Proceedings
T2 - 19th International Conference on Parallel Processing, Euro-Par 2013
Y2 - 26 August 2013 through 30 August 2013
ER -