A scalable non-blocking multicast scheme for distributed dag scheduling

Fengguang Song, Jack Dongarra, Shirley Moore

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents an application-level non-blocking multicast scheme for dynamic DAG scheduling on large-scale distributed-memory systems. The multicast scheme takes into account both network topology and space requirement of routing tables to achieve scalability. Specifically, we prove that the scheme is deadlock-free and takes at most logN steps to complete. The routing table chooses appropriate neighbors to store based on topology IDs and has a small space of O(logN). Although built upon MPI point-to-point operations, the experimental results show that our scheme is significantly better than the simple flat-tree method and is comparable to vendor's collective MPI operations.

Original languageEnglish
Title of host publicationComputational Science - ICCS 2009 - 9th International Conference, Proceedings
Pages195-204
Number of pages10
EditionPART 1
DOIs
StatePublished - 2009
Event9th International Conference on Computational Science, ICCS 2009 - Baton Rouge, LA, United States
Duration: May 25 2009May 27 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume5544 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Computational Science, ICCS 2009
Country/TerritoryUnited States
CityBaton Rouge, LA
Period05/25/0905/27/09

Funding

This material is based upon work supported by the Department of Energy Office of Science under grant No. DE-FC02-06ER25761 and by Microsoft Research.

FundersFunder number
U.S. Department of EnergyDE-FC02-06ER25761
Microsoft Research

    Fingerprint

    Dive into the research topics of 'A scalable non-blocking multicast scheme for distributed dag scheduling'. Together they form a unique fingerprint.

    Cite this