TY - GEN
T1 - Model-driven multisite workflow scheduling
AU - Maheshwari, Ketan
AU - Jung, Eun Sung
AU - Meng, Jiayuan
AU - Vishwanath, Venkatram
AU - Kettimuthu, Rajkumar
PY - 2013
Y1 - 2013
N2 - Workflows continue to play an important role in expressing and deploying scientific applications. In recent years, a wide variety of computational sites have emerged with shared access to users. A user may not be able to complete a complex workflow at a single site. It is thus beneficial to run different tasks of a workflow on different sites. For such cases, judicious scheduling strategy is required in order to map tasks in the workflow to resources at multiple sites so that the workload is balanced among sites and the overhead is minimized in data transfer. The key challenge is that the data transfer rate among sites varies based on the network capacity and load. We propose a workflow scheduling technique that tackles the multi-site task distribution challenge by using data movement performance modeling. We applied this technique to schedule an earth observation science workflow over three sites. Executed via the Swift parallel scripting paradigm, we augmented its default schedule and improved the time-to-completion by up to 52%.
AB - Workflows continue to play an important role in expressing and deploying scientific applications. In recent years, a wide variety of computational sites have emerged with shared access to users. A user may not be able to complete a complex workflow at a single site. It is thus beneficial to run different tasks of a workflow on different sites. For such cases, judicious scheduling strategy is required in order to map tasks in the workflow to resources at multiple sites so that the workload is balanced among sites and the overhead is minimized in data transfer. The key challenge is that the data transfer rate among sites varies based on the network capacity and load. We propose a workflow scheduling technique that tackles the multi-site task distribution challenge by using data movement performance modeling. We applied this technique to schedule an earth observation science workflow over three sites. Executed via the Swift parallel scripting paradigm, we augmented its default schedule and improved the time-to-completion by up to 52%.
UR - http://www.scopus.com/inward/record.url?scp=84893545095&partnerID=8YFLogxK
U2 - 10.1109/CLUSTER.2013.6702647
DO - 10.1109/CLUSTER.2013.6702647
M3 - Conference contribution
AN - SCOPUS:84893545095
SN - 9781479908981
T3 - Proceedings - IEEE International Conference on Cluster Computing, ICCC
BT - 2013 IEEE International Conference on Cluster Computing, CLUSTER 2013
T2 - 15th IEEE International Conference on Cluster Computing, CLUSTER 2013
Y2 - 23 September 2013 through 27 September 2013
ER -